Machine learning methods for empirical streamflow simulation: a comparison of model accuracy, interpretability, and uncertainty in seasonal watersheds

被引:196
作者
Shortridge, Julie E. [1 ]
Guikema, Seth D. [2 ]
Zaitchik, Benjamin F. [3 ]
机构
[1] Johns Hopkins Univ, Dept Geog & Environm Engn, Baltimore, MD 21218 USA
[2] Univ Michigan, Dept Ind & Operat Engn, Ann Arbor, MI 48109 USA
[3] Johns Hopkins Univ, Dept Earth & Planetary Sci, Baltimore, MD USA
基金
美国国家科学基金会;
关键词
ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINES; BLUE-NILE; PREDICTIVE CAPABILITIES; NASH VALUES; RAINFALL; FLOW; CATCHMENT; DISCHARGE; REGRESSION;
D O I
10.5194/hess-20-2611-2016
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
In the past decade, machine learning methods for empirical rainfall-runoff modeling have seen extensive development and been proposed as a useful complement to physical hydrologic models, particularly in basins where data to support process-based models are limited. However, the majority of research has focused on a small number of methods, such as artificial neural networks, despite the development of multiple other approaches for non-parametric regression in recent years. Furthermore, this work has often evaluated model performance based on predictive accuracy alone, while not considering broader objectives, such as model interpretability and uncertainty, that are important if such methods are to be used for planning and management decisions. In this paper, we use multiple regression and machine learning approaches (including generalized additive models, multivariate adaptive regression splines, artificial neural networks, random forests, and M5 cubist models) to simulate monthly streamflow in five highly seasonal rivers in the highlands of Ethiopia and compare their performance in terms of predictive accuracy, error structure and bias, model interpretability, and uncertainty when faced with extreme climate conditions. While the relative predictive performance of models differed across basins, data-driven approaches were able to achieve reduced errors when compared to physical models developed for the region. Methods such as random forests and generalized additive models may have advantages in terms of visualization and interpretation of model structure, which can be useful in providing insights into physical watershed function. However, the uncertainty associated with model predictions under extreme climate conditions should be carefully evaluated, since certain models (especially generalized additive models and multivariate adaptive regression splines) become highly variable when faced with high temperatures.
引用
收藏
页码:2611 / 2628
页数:18
相关论文
共 66 条
[1]   Neural network modelling of non-linear hydrological relationships [J].
Abrahart, R. J. ;
See, L. M. .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2007, 11 (05) :1563-1579
[2]  
Achenef H., 2013, TANA SUB BASIN INITI, P8
[3]  
Alemayehu Tadesse, 2010, Ecohydrology & Hydrobiology, V10, P211, DOI 10.2478/v10104-011-0023-6
[4]  
[Anonymous], 2020, R PACKAGE VERSION 6
[5]  
[Anonymous], 1996, Pattern Recognition and Neural Networks, DOI DOI 10.1017/CBO9780511812651.006
[6]   Rainfall-runoff modelling using artificial neural networks technique: a Blue Nile catchment case study [J].
Antar, MA ;
Elassiouti, I ;
Allam, MN .
HYDROLOGICAL PROCESSES, 2006, 20 (05) :1201-1216
[7]   Neural networks for real time catchment flow modeling and prediction [J].
Aqil, Muhammad ;
Kita, Ichiro ;
Yano, Akira ;
Nishiyama, Soichi .
WATER RESOURCES MANAGEMENT, 2007, 21 (10) :1781-1796
[8]   Multi-time scale stream flow predictions: The support vector machines approach [J].
Asefa, T ;
Kemblowski, M ;
McKee, M ;
Khalil, A .
JOURNAL OF HYDROLOGY, 2006, 318 (1-4) :7-16
[9]  
Beven K. J., 2011, RAINFALL RUNOFF MODE, p[83, 307]
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32