Publications

Machine Learning Predicts Reach-Scale Channel Types From Coarse-Scale Geospatial Data in a Large River Basin

Hervé Guillon, University of California, Davis
Colin F. Byrne, University of California, Davis
Belize A. Lane, Utah State UniversityFollow
Samuel Sandoval Solis, University of California, Davis
Gregory B. Pasternack, University of California, Davis

Document Type

Article

Journal/Book Title/Conference

Water Resources Research

Volume

Issue

Publisher

Wiley-Blackwell Publishing, Inc.

Publication Date

2-27-2020

First Page

Last Page

Abstract

Hydrologic and geomorphic classifications have gained traction in response to the increasing need for basin-wide water resources management. Regardless of the selected classification scheme, an open scientific challenge is how to extend information from limited field sites to classify tens of thousands to millions of channel reaches across a basin. To address this spatial scaling challenge, this study leverages machine learning to predict reach-scale geomorphic channel types using publicly available geospatial data. A bottom-up machine learning approach selects the most accurate and stable model among∼20,000 combinations of 287 coarse geospatial predictors, preprocessing methods, and algorithms in a three-tiered framework to (i) define a tractable problem and reduce predictor noise, (ii) assess model performance in statistical learning, and (iii) assess model performance in prediction. This study also addresses key issues related to the design, interpretation, and diagnosis of machine learning models in hydrologic sciences. In an application to the Sacramento River basin (California, USA), the developed framework selects a Random Forest model to predict 10 channel types previously determined from 290 field surveys over 108,943 two hundred-meter reaches. Performance in statistical learning is reasonable with a 61% median cross-validation accuracy, a sixfold increase over the 10% accuracy of the baseline random model, and the predictions coherently capture the large-scale geomorphic organization of the landscape. Interestingly, in the study area, the persistent roughness of the topography partially controls channel types and the variation in the entropy-based predictive performance is explained by imperfect training information and scale mismatch between labels and predictors.

Recommended Citation

Guillon, H., Byrne, C. F., Lane, B. A., Sandoval Solis, S., & Pasternack, G. B. (2020). Machine learning predicts reach‐scale channel types from coarse‐scale geospatial data in a large river basin. Water Resources Research, 56, e2019WR026691. https://doi.org/10.1029/2019WR026691

Download

Included in

Water Resource Management Commons

COinS

DOI

https://doi.org/10.1029/2019WR026691

Publications

Machine Learning Predicts Reach-Scale Channel Types From Coarse-Scale Geospatial Data in a Large River Basin

Document Type

Journal/Book Title/Conference

Volume

Issue

Publisher

Publication Date

First Page

Last Page

Abstract

Recommended Citation

Included in

DOI

Browse

For Authors

Scholarly Communication

Research Data

Publications

Machine Learning Predicts Reach-Scale Channel Types From Coarse-Scale Geospatial Data in a Large River Basin

Authors

Document Type

Journal/Book Title/Conference

Volume

Issue

Publisher

Publication Date

First Page

Last Page

Abstract

Recommended Citation

Included in

Share

DOI

Browse

For Authors

Scholarly Communication

Research Data