Super ensemble based streamflow simulation using multi-source remote sensing and ground gauged rainfall data fusion

被引:4
作者
Wegayehu, Eyob Betru [1 ]
Muluneh, Fiseha Behulu [1 ]
机构
[1] Addis Ababa Univ, Addis Ababa Inst Technol, Sch Civil & Environm Engn, Addis Ababa, Ethiopia
关键词
Streamflow prediction; Super ensemble learning; Remote sensing; Gauge-rainfall; Ethiopian river basins; NEURAL-NETWORK ANN; PRECIPITATION PRODUCTS; PERFORMANCE; SELECTION; MODELS; FLOW;
D O I
10.1016/j.heliyon.2023.e17982
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Traditional data-driven streamflow predictions usually apply a single model with inconsistent performance in different variability conditions. These days model ensembles or merging the benefits of different models without losing the general character of the data are becoming a trend in hydrology. This study compared three super ensemble learners with eight base models. Twelve years of monthly rolled daily time series data in three river catchments of Ethiopia (Borkena watershed: Awash river basin), (Gummera watershed: Abay river basin), and (Sore watershed: Baro Akobo river basin) is used for single-step daily streamflow simulation using previous thirty day input timesteps. Five input scenarios are applied: three vegetation indices, three remote sensing-based precipitation products, ground-gauged rainfall, all fused inputs, and selected inputs with the Recursive Feature Elimination (RFE) algorithm. The time series is then divided into training and testing datasets with a ratio of 80:20. The performance of the proposed models was evaluated using the Root Mean Squared Error (RMSE), coefficient of determination (R2), Mean Absolute Error (MAE), and Median Absolute Error (MEDAE). Finally, the result is presented with the corresponding five input scenarios. The catchment's and input scenario's average performance indicated that the three super ensemble learners outperformed the eight base models with relatively stable performance. The top-ranked WASE model exceeded the linear regression baseline by 13.3%. XGB, CNN-GRU, and LSTM proved the highest performance of the base models. This study also revealed that LSTM's key downside is its performance drop in the absence of feature selection criteria. In comparison, XGB showed its superior performance after controlling redundant inputs internally. Moreover, this study uniquely highlights the potential of remote sensing-based vegetation indices in the science of data-driven streamflow modelling for non gauged catchments with no meteorological time series.
引用
收藏
页数:25
相关论文
共 61 条
[1]   Remote-Sensing-Based Streamflow Forecasting Using Artificial Neural Network and Support Vector Machine Models [J].
Alquraish, Mohammed M. ;
Khadr, Mosaad .
REMOTE SENSING, 2021, 13 (20)
[2]   Principled Machine Learning Using the Super Learner: An Application to Predicting Prison Violence [J].
Bacak, Valerio ;
Kennedy, Edward H. .
SOCIOLOGICAL METHODS & RESEARCH, 2019, 48 (03) :698-721
[3]  
Basak D., 2007, Efficient learning machines: Theories, concepts, and applications for engineers and system designers, V11, P203, DOI DOI 10.1007/978-1-4302-5990-9_4
[4]   COMBINATION OF FORECASTS [J].
BATES, JM ;
GRANGER, CWJ .
OPERATIONAL RESEARCH QUARTERLY, 1969, 20 (04) :451-&
[5]   MSWEP V2 Global 3-Hourly 0.1° Precipitation: Methodology and Quantitative Assessment [J].
Beck, Hylke E. ;
Wood, Eric F. ;
Pan, Ming ;
Fisher, Colby K. ;
Miralles, Diego G. ;
van Dijk, Albert I. J. M. ;
McVicar, Tim R. ;
Adler, Robert F. .
BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2019, 100 (03) :473-502
[6]   Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling [J].
Beck, Hylke E. ;
Vergopolan, Noemi ;
Pan, Ming ;
Levizzani, Vincenzo ;
van Dijk, Albert I. J. M. ;
Weedon, Graham P. ;
Brocca, Luca ;
Pappenberger, Florian ;
Huffman, George J. ;
Wood, Eric F. .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2017, 21 (12) :6201-6217
[7]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[8]   Predicting failure in the US banking sector: An extreme gradient boosting approach [J].
Carmona, Pedro ;
Climent, Francisco ;
Momparler, Alexandre .
INTERNATIONAL REVIEW OF ECONOMICS & FINANCE, 2019, 61 :304-323
[9]   Merging ground and satellite-based precipitation data sets for improved hydrological simulations in the Xijiang River basin of China [J].
Chen, Tao ;
Ren, Liliang ;
Yuan, Fei ;
Tang, Tiantian ;
Yang, Xiaoli ;
Jiang, Shanhu ;
Liu, Yi ;
Zhao, Chongxu ;
Zhang, Limin .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2019, 33 (10) :1893-1905
[10]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794