Improving GALDIT-based groundwater vulnerability predictive mapping using coupled resampling algorithms and machine learning models

被引:64
作者
Barzegar, Rahim [1 ,2 ]
Razzagh, Siamak [3 ]
Quilty, John [4 ]
Adamowski, Jan [1 ]
Pour, Homa Kheyrollah [2 ]
Booij, Martijn J. [5 ]
机构
[1] McGill Univ, Dept Bioresource Engn, 21111 Lakeshore, Ste Anne De Bellevue, PQ H9X 3V9, Canada
[2] Wilfrid Laurier Univ, Dept Geog & Environm Studies, Waterloo, ON, Canada
[3] Univ Tabriz, Fac Nat Sci, Dept Earth Sci, Tabriz, Iran
[4] Univ Waterloo, Dept Civil & Environm Engn, Waterloo, ON, Canada
[5] Univ Twente, Fac Engn Technol, Dept Water Engn & Management, Enschede, Netherlands
关键词
Resampling algorithm; Groundwater vulnerability; Coastal aquifer; Machine learning; Hybrid model; MODIFIED DRASTIC MODEL; COASTAL AQUIFER; LANDSLIDE SUSCEPTIBILITY; ARTIFICIAL-INTELLIGENCE; SALTWATER INTRUSION; CONTAMINATION RISK; PLAIN AQUIFER; POLLUTION; CITY; AREA;
D O I
10.1016/j.jhydrol.2021.126370
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Developing accurate groundwater vulnerability maps is important for the sustainable management of groundwater resources. In this research, resampling methods [e.g., Bootstrap Aggregating (BA) and Disjoint Aggregating (DA)] are combined with machine learning (ML) models, namely eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), Adaptive Boosting (AdaBoost), Categorical Boosting (CatBoost), and Random Forest (RF), to improve the GALDIT groundwater vulnerability mapping framework that considers Groundwater occurrence (G) (i.e., aquifer type), Aquifer hydraulic conductivity (A), depth to groundwater Level (L), Distance from the seashore (D), Impact of existing seawater intrusion status (I), and aquifer Thickness (T). The proposed approach overcomes the subjectivity of the weights and ratings given to the six variables in the GALDIT framework (via the ML methods) and helps address the small dataset issue (via resampling methods) common to groundwater vulnerability predictive mapping. Considering the Shabestar Plain aquifer, situated in the northeast of Lake Urmia (Iran), the predicted vulnerability indices from GALDIT were adjusted using total dissolved solid (TDS, an indicator of drinking water quality) concentrations, and were modeled by the ML models. Pearson's correlation coefficient (r) and distance correlation (DC) between the predicted vulnerability indices and TDS were used to validate the models. Using a validation set, the GALDIT framework (r = 0.447 and DC = 0.511) was compared against the best performing standalone (XGBoost-GALDIT, r = 0.613, DC = 0.647) and coupled resampling (BA-XGBoost-GALDIT, r = 0.659, DC = 0.699 and DA-RF-GALDIT, r = 0.616, DC = 0.662) ML models, revealing that the proposed framework significantly increases r and DC metrics. In general, the BA resampling method led to better performing ML models than DA. However, in all cases, it was found that integrating resampling methods and ML models are promising tools to improve the accuracy of GALDIT vulnerability models.
引用
收藏
页数:15
相关论文
共 86 条
  • [1] Saltwater intrusion modelling in Jorf coastal aquifer, South-eastern Tunisia: geochemical, geoelectrical and geostatistical application
    Agoubi, Belgacem
    Kharroubi, Adel
    Abida, Habib
    [J]. HYDROLOGICAL PROCESSES, 2013, 27 (08) : 1191 - 1199
  • [2] Aller L., 1985, DRASTIC STANDARDIZED, P455
  • [3] [Anonymous], 2014, CARTOGRAPHY POLE POL
  • [4] APHP AWWA WEF, 2005, Standard Methods for Examination of Water and Wastewater, VFirst, DOI DOI 10.2105/AJPH.56.4.684-A
  • [5] An approach to aquifer vulnerability including uncertainty in a spatial random function framework
    Armengol, S.
    Sanchez-Vila, X.
    Folch, A.
    [J]. JOURNAL OF HYDROLOGY, 2014, 517 : 889 - 900
  • [6] Asadian O., 2007, GEOLOGICAL QUADRANGL
  • [7] Asghari Moghaddam A., 1991, THESIS U COLL LONDON
  • [8] AI-based prediction of independent construction safety outcomes from universal attributes
    Baker, Henrietta
    Hallowell, Matthew R.
    Tixier, Antoine J-P
    [J]. AUTOMATION IN CONSTRUCTION, 2020, 118
  • [9] Coupling a hybrid CNN-LSTM deep learning model with a Boundary Corrected Maximal Overlap Discrete Wavelet Transform for multiscale Lake water level forecasting
    Barzegar, Rahim
    Aalami, Mohammad Taghi
    Adamowski, Jan
    [J]. JOURNAL OF HYDROLOGY, 2021, 598
  • [10] Heavy Metal(loid)s in the Groundwater of Shabestar Area (NW Iran): Source Identification and Health Risk Assessment
    Barzegar, Rahim
    Asghari Moghaddam, Asghar
    Soltani, Shahla
    Fijani, Elham
    Tziritis, Evangelos
    Kazemian, Naeimeh
    [J]. EXPOSURE AND HEALTH, 2019, 11 (04) : 251 - 265