Hybrid multi-model ensemble learning for reconstructing gridded runoff of Europe for 500 years

被引:11
作者
Singh, Ujjwal [1 ]
Maca, Petr [1 ]
Hanel, Martin [1 ]
Markonis, Yannis [1 ]
Nidamanuri, Rama Rao [2 ]
Nasreen, Sadaf [1 ]
Blocher, Johanna Ruth [1 ]
Strnad, Filip [1 ]
Vorel, Jiri [3 ]
Riha, Lubomir [4 ]
Raghubanshi, Akhilesh Singh [5 ]
机构
[1] Czech Univ Life Sci Prague, Fac Environm Sci, Kamycka 129, Praha Suchdol 16500, Czech Republic
[2] Indian Inst Space Sci & Technol, Dept Earth & Space Sci, Thiruvananthapuram 695547, Kerala, India
[3] CESNET ZSPO, Zikova 1903-4, Prague 16000, Czech Republic
[4] VSB Tech Univ Ostrava, Innovat Natl Supercomp Ctr IT4, Ostrava, Czech Republic
[5] Banaras Hindu Univ, Inst Environm & Sustainable Dev, Varanasi 221005, UP, India
关键词
Machine learning; Ensemble machine learning; Spatial auto-correlation; Hybrid Ensemble Multi-Model Framework; (HEMMF); ARTIFICIAL NEURAL-NETWORKS; REGRESSION; PRECIPITATION; SERIES; EVAPORATION; TEMPERATURE; PERFORMANCE; ALGORITHMS; PREDICTION; FRAMEWORK;
D O I
10.1016/j.inffus.2023.101807
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Runoff is a crucial water cycle component that contributes to the water resources to sustain human life. Historical trends in runoff, when examining climate change scenarios, provide vital information about past variability and support the design of adaptation measures. However, hydrological models based on climate data, such as the Budyko model, can be biased in estimating annual runoff due to input data uncertainty. Therefore, it is vital to utilize advanced machine learning-based computing models to reduce uncertainty and reconstruct climate variables over a long period of time and sufficiently large spatial coverage, preferably at a continental scale. We propose and test a novel machine learning-based framework called Hybrid Ensemble Multi-Model Framework (HEMMF) to reconstruct the gridded runoff of Europe over a 500-year historical period (1500 to 1999). The HEMMF combines non-parametric extended data pattern recognition and data-driven methods. The extended data patterns are computed using Moran's spatial autocorrelation (SPA) index of the climate variable fields and the Budyko models output, whereas the data-driven methods contain nine different machine learning (ML) algorithms and four ensembles of ML. The extended data patterns are jointly ingested with climate-reconstructed data (precipitation, temperature, Palmer's drought severity index) as predictor variables, which serve as input for the data-driven methods. To assess the impact and contribution of SPA, the runoff is simulated based on three different input training datasets in the HEMMF: (1) a dataset containing only precipitation, temperature, Palmer's drought severity index, and four different estimates of runoff from the Budyko model, (2) a dataset containing only SPA of the first input datasets, and (3) a dataset created by merging the first and second datasets. The HEMMF offers the best reconstruction performance when using the third input dataset. This reconstructed runoff helps to explain the runoff trend, drought propagation, and runoff's link with the climate variables. The proposed methodology has the potential to be applied to past hydroclimatic data and related analyses across different temporal periods, climate scenarios, and geographical scales.
引用
收藏
页数:17
相关论文
共 128 条
[1]   Hybrid Machine Learning Ensemble Techniques for Modeling Dissolved Oxygen Concentration [J].
Abba, Sani Isah ;
Linh, Nguyen Thi Thuy ;
Abdullahi, Jazuli ;
Ali, Shaban Ismael Albrka ;
Pham, Quoc Bao ;
Abdulkadir, Rabiu Aliyu ;
Costache, Romulus ;
Nam, Van Thai ;
Anh, Duong Tran .
IEEE ACCESS, 2020, 8 :157218-157237
[2]   Least square support vector machine and multivariate adaptive regression splines for streamflow prediction in mountainous basin using hydro-meteorological data as inputs [J].
Adnan, Rana Muhammad ;
Liang, Zhongmin ;
Heddam, Salim ;
Zounemat-Kermani, Mohammad ;
Kisi, Ozgur ;
Li, Binquan .
JOURNAL OF HYDROLOGY, 2020, 586 (586)
[3]   Development of multivariate adaptive regression spline integrated with differential evolution model for streamflow simulation [J].
Al-Sudani, Zainab Abdulelah ;
Salih, Sinan Q. ;
Sharafati, Ahmad ;
Yaseen, Zaher Mundher .
JOURNAL OF HYDROLOGY, 2019, 573 :1-12
[4]   POINTS OF SIGNIFICANCE Ensemble methods: bagging and random forests [J].
Altman, Naomi ;
Krzywinski, Martin .
NATURE METHODS, 2017, 14 (10) :933-934
[5]   LOCAL INDICATORS OF SPATIAL ASSOCIATION - LISA [J].
ANSELIN, L .
GEOGRAPHICAL ANALYSIS, 1995, 27 (02) :93-115
[6]   Comparing instrumental, palaeoclimate, and projected rainfall data: Implications for water resources management and hydrological modelling [J].
Armstrong, Matthew S. ;
Kiem, Anthony S. ;
Vance, Tessa R. .
JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2020, 31
[7]   A review and case study of Artificial intelligence and Machine learning methods used for ground condition prediction ahead of tunnel boring Machines [J].
Ayawah, Prosper E. A. ;
Sebbeh-Newton, Sylvanus ;
Azure, Jessica W. A. ;
Kaba, Azupuri G. A. ;
Anani, Angelina ;
Bansah, Samuel ;
Zabidi, Hareyani .
TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2022, 125
[8]   Prediction of Neuropeptides from Sequence Information Using Ensemble Classifier and Hybrid Features [J].
Bin, Yannan ;
Zhang, Wei ;
Tang, Wending ;
Dai, Ruyu ;
Li, Menglu ;
Zhu, Qizhi ;
Xia, Junfeng .
JOURNAL OF PROTEOME RESEARCH, 2020, 19 (09) :3732-3740
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]  
Breiman L., 2017, Classification and regression trees