A robust Bayesian Multi-Machine learning ensemble framework for probabilistic groundwater level forecasting

被引:0
作者
Zhu, Feilin [1 ]
Sun, Yimeng [2 ]
Han, Mingyu [1 ]
Hou, Tiantian [1 ]
Zeng, Yurou [1 ]
Lin, Meiyan [1 ]
Wang, Yaqin [1 ]
Zhong, Ping-an [1 ]
机构
[1] Hohai Univ, Coll Hydrol & Water Resources, Nanjing 210098, Peoples R China
[2] Hydrol Bur Changjiang Water Resources Commiss, Lower Changjiang River Bur Hydrol & Water Resource, Nanjing 210009, Peoples R China
基金
中国国家自然科学基金;
关键词
Groundwater level prediction; Machine learning; Multi-model ensemble; Probabilistic forecasting; Uncertainty analysis; Hyperparameter optimization;
D O I
10.1016/j.jhydrol.2024.132567
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate prediction of groundwater levels is crucial for effective water resource management in agricultural regions, where groundwater is a vital resource. In these areas, the complex nonlinear relationships between groundwater storage, agricultural water demand, climate, and surface water delivery pose challenges for traditional physically-based models. As an alternative, data-driven machine learning methods are often adopted as surrogate to capture such intricate relationships due to their high computational efficiency. Inevitably, using only one machine learning model is prone to underestimate prediction uncertainty and subjected to poor accuracy. This study proposes a robust Bayesian multi-machine learning ensemble framework for probabilistic groundwater level forecasting. The framework incorporates a comprehensive set of input factors, including autocorrelation, meteorological, hydrological, and human activity variables, to capture the lag effects and driving mechanisms of groundwater depth variations. Five rule-based constraints are applied to identify the optimal combination of predictive factors and their time lags. Five machine learning models are employed, and their hyperparameters are optimized using Bayesian optimization algorithm. The ensemble of models is constructed using the stacking algorithm, leveraging the strengths of different modeling approaches. The uncertainty analysis within the framework involves three key components: fitting and selecting marginal distributions, constructing joint distributions using Copula functions, and calculating Bayesian posterior distributions. This enables a quantitative assessment of the uncertainties associated with groundwater depth predictions. A case study in the YingGuo region of the Huaihe River Basin, China, demonstrates the effectiveness of the proposed framework. The ensemble predictions outperform individual models, and the probabilistic forecasts provide reliable confidence intervals. The flexibility of the uncertainty analysis framework in describing complex groundwater depth variables is also highlighted. The proposed approach offers a robust framework for analyzing the uncertainties in groundwater depth predictions, serving as a valuable tool for comprehensive groundwater resource assessment and effective risk management strategies in agricultural regions.
引用
收藏
页数:16
相关论文
共 41 条
[1]   Groundwater levels forecasting using machine learning models: A case study of the groundwater region 10 at Karst Belt, South Africa [J].
Aderemi, Banjo A. ;
Olwal, Thomas O. ;
Ndambuki, Julius M. ;
Rwanga, Sophia S. .
SYSTEMS AND SOFT COMPUTING, 2023, 5
[2]   Groundwater resources assessment in the Densu Basin: A review [J].
Akurugu, Bismark Awinbire ;
Obuobie, Emmanuel ;
Yidana, Sandow Mark ;
Stisen, Simon ;
Seidenfaden, Ida Karlsson ;
Chegbeleh, Larry Pax .
JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2022, 40
[3]   A Spatially Enhanced Data-Driven Multimodel to Improve Groundwater Forecasts in the High Plains Aquifer, USA [J].
Amaranto, A. ;
Munoz-Arriola, F. ;
Solomatine, D. P. ;
Corzo, G. .
WATER RESOURCES RESEARCH, 2019, 55 (07) :5941-5961
[4]   Sensitivity analysis of data-driven groundwater forecasts to hydroclimatic controls in irrigated croplands [J].
Amaranto, Alessandro ;
Pianosi, Francesca ;
Solomatine, Dimitri ;
Corzo, Gerald ;
Munoz-Arriola, Francisco .
JOURNAL OF HYDROLOGY, 2020, 587
[5]   Semi-seasonal groundwater forecast using multiple data-driven models in an irrigated cropland [J].
Amaranto, Alessandro ;
Munoz-Arriola, Francisco ;
Corzo, Gerald ;
Solomatine, Dimitri P. ;
Meyer, George .
JOURNAL OF HYDROINFORMATICS, 2018, 20 (06) :1227-1246
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]   A parsimonious methodological framework for short-term forecasting of groundwater levels [J].
Collados-Lara, A. J. ;
Pulido-Velazquez, D. ;
Ruiz, L. G. B. ;
Pegalajar, M. C. ;
Pardo-Iguzquiza, E. ;
Baena-Ruiz, L. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 881
[9]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[10]   A stacked machine learning model for multi-step ahead prediction of lake surface water temperature [J].
Di Nunno, Fabio ;
Zhu, Senlin ;
Ptak, Mariusz ;
Sojka, Mariusz ;
Granata, Francesco .
SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 890