Bayesian machine learning ensemble approach to quantify model uncertainty in predicting groundwater storage change

被引:77
|
作者
Yin, Jina [1 ,2 ,3 ]
Medellin-Azuara, Josue [3 ]
Escriva-Bou, Alvar [4 ]
Liu, Zhu [1 ]
机构
[1] Hohai Univ, State Key Lab Hydrol Water Resources & Hydraul En, Nanjing 210098, Jiangsu, Peoples R China
[2] Hohai Univ, Yangtze Inst Conservat & Dev, Nanjing 210098, Jiangsu, Peoples R China
[3] Univ Calif Merced, Civil & Environm Engn, Merced, CA 95343 USA
[4] Publ Policy Inst Calif, Water Policy Ctr, 500 Washington St,Suite 600, San Francisco, CA 94111 USA
关键词
Machine learning ensemble; Bayesian model averaging; Uncertainty quantification; Groundwater storage change; Irrigation pumping; ARTIFICIAL NEURAL-NETWORK; SAN-JOAQUIN VALLEY; OPTIMIZATION; PARAMETER; FRAMEWORK; ALGORITHMS; SIMULATION; SUPPORT; INPUT; SWAT;
D O I
10.1016/j.scitotenv.2020.144715
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Agricultural water demand, groundwater extraction, surface water delivery and climate have complex nonlinear relationships with groundwater storage in agricultural regions. As an alternative to elaborate computationally intensive physical models, machine learning methods are often adopted as surrogate to capture such complex relationships due to their high computational efficiency. Inevitably, using only one machine learning model is prone to underestimate prediction uncertainty and subjected to poor accuracy. This study presents a novel machine learning-based groundwater ensemble modeling framework in conjunct ion with a Bayesian model averaging approach to predict groundwater storage change and improve overall model predicting reliability. Three different machine learning models have been developed namely artificial neural network, support vector machine and response surface regression. To explicitly quantify uncertainty from machine learning model parameter and structure, Bayesian model averaging is employed to produce a forecast distribution associated with each machine learning prediction. Model weights and variances are obtained based on model performance to construct ensemble models. Then, the developed individual and Bayesian model averaging machine learning ensemble models are applied, evaluated and validated at different spatial scales including subregional and regional scales in an overdrafted agricultural region-the San Joaquin River Basin, through independent training and testing dataset. Results shows the machine learning models have remarkable predicting capability without sacrificing accuracy but with higher computational efficiency. Compared to a single-model approach, the ensemble model is able to produce consistently reliable predictions across the basin, yet it does not always outperform the best model in the ensemble. Additionally, model results suggest that groundwater pumping for agricultural irrigation is the primary driving force of groundwater storage change across the region. The modeling framework can serve as an alternative approach to simulating groundwater response, especially in those agricultural regions where lack of subsurface data hinders physically-based irrigation is the primary driving force of groundwater storage change across the region. The modeling framework can serve as an alternative approach to simulating groundwater response, especially in those agricultural regions where lack of subsurface data hinders physically-based modeling. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Unraveling uncertainty drivers of the maize yield response to nitrogen: A Bayesian and machine learning approach
    Correndo, Adrian A.
    Tremblay, Nicolas
    Coulter, Jeffrey A.
    Ruiz-Diaz, Dorivar
    Franzen, David
    Nafziger, Emerson
    Prasad, Vara
    Rosso, Luiz H. Moro
    Steinke, Kurt
    Du, Juan
    Messina, Carlos D.
    Ciampitti, Ignacio A.
    AGRICULTURAL AND FOREST METEOROLOGY, 2021, 311
  • [32] Predicting inmate suicidal behavior with an interpretable ensemble machine learning approach in smart prisons
    Akhtar, Khayyam
    Yaseen, Muhammad Usman
    Imran, Muhammad
    Khattak, Sohaib Bin Altaf
    Nasralla, Moustafa M.
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [33] The trauma severity model: An ensemble machine learning approach to risk prediction
    Gorczyca, Michael T.
    Toscano, Nicole C.
    Cheng, Julius D.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 108 : 9 - 19
  • [34] A Bayesian approach for quantification of model uncertainty
    Park, Inseok
    Amarchinta, Hemanth K.
    Grandhi, Ramana V.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2010, 95 (07) : 777 - 785
  • [35] Groundwater potential zoning using Logistics Model Trees based novel ensemble machine learning model
    Bien, Tran Xuan
    Trinh, Pham The
    Duong, Luu Thuy
    Phong, Tran Van
    Nhat, Vuong Hong
    Lei, Hiep Van
    Nguyen, Dam Duc
    Prakash, Indra
    Tam, Pham Thanh
    Pham, Binh Thai
    VIETNAM JOURNAL OF EARTH SCIENCES, 2024, 46 (02): : 272 - 281
  • [36] Bayesian-optimized ensemble learning for predicting stiffness modulus of asphalt mixtures: A data-driven approach with model interpretation
    Qiu, Pengfei
    Lu, Jiakai
    Ma, Jie
    Hou, Xiangdao
    Wang, Tao
    Li, Jin
    MATERIALS LETTERS, 2025, 382
  • [37] A Hierarchical Bayesian Model to Quantify Uncertainty of Stream Water Temperature Forecasts
    Bal, Guillaume
    Rivot, Etienne
    Bagliniere, Jean-Luc
    White, Jonathan
    Prevost, Etienne
    PLOS ONE, 2014, 9 (12):
  • [38] Machine Learning Ensemble Modelling for Predicting Unemployment Duration
    Gabrikova, Barbora
    Svabova, Lucia
    Kramarova, Katarina
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [39] Dealing with Drift Uncertainty: A Bayesian Learning Approach
    De Franco, Carmine
    Nicolle, Johann
    Pham, Huyen
    RISKS, 2019, 7 (01):
  • [40] Predicting unrecognized enhancer-mediated genome topology by an ensemble machine learning model
    Tang, Li
    Hill, Matthew C.
    Wang, Jun
    Wang, Jianxin
    Martin, James F.
    Li, Min
    GENOME RESEARCH, 2020, 30 (12) : 1835 - 1845