Improving GALDIT-based groundwater vulnerability predictive mapping using coupled resampling algorithms and machine learning models

被引:70
作者
Barzegar, Rahim [1 ,2 ]
Razzagh, Siamak [3 ]
Quilty, John [4 ]
Adamowski, Jan [1 ]
Pour, Homa Kheyrollah [2 ]
Booij, Martijn J. [5 ]
机构
[1] McGill Univ, Dept Bioresource Engn, 21111 Lakeshore, Ste Anne De Bellevue, PQ H9X 3V9, Canada
[2] Wilfrid Laurier Univ, Dept Geog & Environm Studies, Waterloo, ON, Canada
[3] Univ Tabriz, Fac Nat Sci, Dept Earth Sci, Tabriz, Iran
[4] Univ Waterloo, Dept Civil & Environm Engn, Waterloo, ON, Canada
[5] Univ Twente, Fac Engn Technol, Dept Water Engn & Management, Enschede, Netherlands
关键词
Resampling algorithm; Groundwater vulnerability; Coastal aquifer; Machine learning; Hybrid model; MODIFIED DRASTIC MODEL; COASTAL AQUIFER; LANDSLIDE SUSCEPTIBILITY; ARTIFICIAL-INTELLIGENCE; SALTWATER INTRUSION; CONTAMINATION RISK; PLAIN AQUIFER; POLLUTION; CITY; AREA;
D O I
10.1016/j.jhydrol.2021.126370
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Developing accurate groundwater vulnerability maps is important for the sustainable management of groundwater resources. In this research, resampling methods [e.g., Bootstrap Aggregating (BA) and Disjoint Aggregating (DA)] are combined with machine learning (ML) models, namely eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), Adaptive Boosting (AdaBoost), Categorical Boosting (CatBoost), and Random Forest (RF), to improve the GALDIT groundwater vulnerability mapping framework that considers Groundwater occurrence (G) (i.e., aquifer type), Aquifer hydraulic conductivity (A), depth to groundwater Level (L), Distance from the seashore (D), Impact of existing seawater intrusion status (I), and aquifer Thickness (T). The proposed approach overcomes the subjectivity of the weights and ratings given to the six variables in the GALDIT framework (via the ML methods) and helps address the small dataset issue (via resampling methods) common to groundwater vulnerability predictive mapping. Considering the Shabestar Plain aquifer, situated in the northeast of Lake Urmia (Iran), the predicted vulnerability indices from GALDIT were adjusted using total dissolved solid (TDS, an indicator of drinking water quality) concentrations, and were modeled by the ML models. Pearson's correlation coefficient (r) and distance correlation (DC) between the predicted vulnerability indices and TDS were used to validate the models. Using a validation set, the GALDIT framework (r = 0.447 and DC = 0.511) was compared against the best performing standalone (XGBoost-GALDIT, r = 0.613, DC = 0.647) and coupled resampling (BA-XGBoost-GALDIT, r = 0.659, DC = 0.699 and DA-RF-GALDIT, r = 0.616, DC = 0.662) ML models, revealing that the proposed framework significantly increases r and DC metrics. In general, the BA resampling method led to better performing ML models than DA. However, in all cases, it was found that integrating resampling methods and ML models are promising tools to improve the accuracy of GALDIT vulnerability models.
引用
收藏
页数:15
相关论文
共 86 条
[1]   Saltwater intrusion modelling in Jorf coastal aquifer, South-eastern Tunisia: geochemical, geoelectrical and geostatistical application [J].
Agoubi, Belgacem ;
Kharroubi, Adel ;
Abida, Habib .
HYDROLOGICAL PROCESSES, 2013, 27 (08) :1191-1199
[2]  
Aller L., 1985, DRASTIC STANDARDIZED, P455
[3]  
American Public Health Association (APHA), 2005, Standard Methods for the Examination of Water and Wastewater, V21st ed.
[4]   An approach to aquifer vulnerability including uncertainty in a spatial random function framework [J].
Armengol, S. ;
Sanchez-Vila, X. ;
Folch, A. .
JOURNAL OF HYDROLOGY, 2014, 517 :889-900
[5]  
Asghari Moghaddam A., 1991, THESIS U COLL LONDON
[6]   AI-based prediction of independent construction safety outcomes from universal attributes [J].
Baker, Henrietta ;
Hallowell, Matthew R. ;
Tixier, Antoine J-P .
AUTOMATION IN CONSTRUCTION, 2020, 118
[7]   Coupling a hybrid CNN-LSTM deep learning model with a Boundary Corrected Maximal Overlap Discrete Wavelet Transform for multiscale Lake water level forecasting [J].
Barzegar, Rahim ;
Aalami, Mohammad Taghi ;
Adamowski, Jan .
JOURNAL OF HYDROLOGY, 2021, 598
[8]   Heavy Metal(loid)s in the Groundwater of Shabestar Area (NW Iran): Source Identification and Health Risk Assessment [J].
Barzegar, Rahim ;
Asghari Moghaddam, Asghar ;
Soltani, Shahla ;
Fijani, Elham ;
Tziritis, Evangelos ;
Kazemian, Naeimeh .
EXPOSURE AND HEALTH, 2019, 11 (04) :251-265
[9]   Mapping groundwater contamination risk of multiple aquifers using multi-model ensemble of machine learning algorithms [J].
Barzegar, Rahim ;
Moghaddam, Asghar Asghari ;
Deo, Ravinesh ;
Fijani, Elham ;
Tziritis, Evangelos .
SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 621 :697-712
[10]   A supervised committee machine artificial intelligent for improving DRASTIC method to assess groundwater contamination risk: a case study from Tabriz plain aquifer, Iran [J].
Barzegar, Rahim ;
Moghaddam, Asghar Asghari ;
Baghban, Hamed .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2016, 30 (03) :883-899