Optimization of statistical and machine learning hybrid models for groundwater potential mapping

被引:72
作者
Yariyan, Peyman [1 ]
Avand, Mohammadtaghi [2 ]
Omidvar, Ebrahim [3 ]
Pham, Quoc Bao [4 ,5 ]
Linh, Nguyen Thi Thuy [6 ,7 ]
Tiefenbacher, John P. [8 ]
机构
[1] Islamic Azad Univ, Dept Surveying Engn, Saghez Branch, Saghez, Iran
[2] TarbiatModares Univ, Coll Nat Resources, Dept Watershed Management Engn, Tehran, Iran
[3] Univ Kashan, Fac Nat Resources & Earth Sci, Dept Rangeland & Watershed Management, Kashan, Iran
[4] Ton Duc Thang Univ, Environm Qual Atmospher Sci & Climate Change Res, Ho Chi Minh City, Vietnam
[5] Ton Duc Thang Univ, Fac Environm & Labour Safety, Ho Chi Minh City, Vietnam
[6] Duy Tan Univ, Inst Res & Dev, Danang, Vietnam
[7] Duy Tan Univ, Fac Environm & Chem Engn, Danang, Vietnam
[8] Texas State Univ, Dept Geog, San Marcos, TX USA
关键词
Groundwater potential mapping; ensemble frameworks; statistical and machine learning models; Iran; ENSEMBLE; FOREST;
D O I
10.1080/10106049.2020.1870164
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Determining areas of high groundwater potential is important for exploitation, management, and protection of water resources. This study assesses the spatial distribution of groundwater potential in the Zarrinehroud watershed of Kurdistan Province, Iran using combinations of five statistical and machine learning algorithms - frequency ratio (FR), radial basis function (RBF), index of entropy (IOE), evidential belief function (EBF) and fuzzy art map (FAM). To accomplish this, 1448 well locations in the study area were randomly divided into two data sets for training (70%= 1013 locations) and validation (30%= 435 locations) based on the holdout method. Fourteen factors that can affect the presence or absence of groundwater were identified, measured, and mapped using ArcGIS and SAGA-GIS software. The models were used to predict the locations of groundwater based on suitable combinations of the conditioning factors to produce groundwater potential maps. The probability of groundwater at any location was classified as low, moderate, high, or very high based on natural breaks in the data spectrum. The model predictions were tested for validity and their success was determined using receiver operating characteristic (ROC) curves, standard errors (SE), positive predictive value (PPV), negative predictive value (NPV), sensitivity (SST), specificity (SPF) and accuracy (ACC), and the Friedman test. The performance assessments of groundwater potential predictions using the area under the curve (AUC) and accuracy (ACC) showed that the FR-RBF model had very good performance (AUC= 0.889, ACC= 87.51). FR-FAM (AUC= 0.869, ACC= 84.67), EBF-FAM (AUC= 0.864, ACC= 84.42), EBF-RBF (AUC= 0.854, ACC= 83.94), FR-IOE (AUC= 0.836, ACC= 83.62), and EBF-IOE (AUC= 0.833, ACC= 80.42) also had acceptable performance. The results of the Friedman test also show that there are significant differences between the models and the highest mean rank was generated by the FR-FAM model (3.642). Therefore, the hybrid models can be used to increase the accuracy of groundwater-prediction models in the study region and perhaps in similar settings.
引用
收藏
页码:3877 / 3911
页数:35
相关论文
共 95 条
[1]   Application of probabilistic-based frequency ratio model in groundwater potential mapping using remote sensing data and GIS [J].
Abd Manap, Mohamad ;
Nampak, Haleh ;
Pradhan, Biswajeet ;
Lee, Saro ;
Sulaiman, Wan Nor Azmin ;
Ramli, Mohammad Firuz .
ARABIAN JOURNAL OF GEOSCIENCES, 2014, 7 (02) :711-724
[2]   A knowledge-driven GIS modeling technique for groundwater potential mapping at the Upper Langat Basin, Malaysia [J].
Abd Manap, Mohamad ;
Sulaiman, Wan Nor Azmin ;
Ramli, Mohammad Firuz ;
Pradhan, Biswajeet ;
Surip, Noraini .
ARABIAN JOURNAL OF GEOSCIENCES, 2013, 6 (05) :1621-1637
[3]   How do machine learning techniques help in increasing accuracy of landslide susceptibility maps? [J].
Achour, Yacine ;
Pourghasemi, Hamid Reza .
GEOSCIENCE FRONTIERS, 2020, 11 (03) :871-883
[4]   GIS-based spatial prediction of debris flows using logistic regression and frequency ratio models for Zezere River basin and its surrounding area, Northwest Covilha, Portugal [J].
Achour, Yacine ;
Garcia, Sonia ;
Cavaleiro, Victor .
ARABIAN JOURNAL OF GEOSCIENCES, 2018, 11 (18)
[5]   Multi-criteria decision analysis for delineation of groundwater potential zones in a tropical river basin using remote sensing, GIS and analytical hierarchy process (AHP) [J].
Achu, A. L. ;
Thomas, Jobin ;
Reghunath, Rajesh .
GROUNDWATER FOR SUSTAINABLE DEVELOPMENT, 2020, 10
[6]   A comparison between index of entropy and catastrophe theory methods for mapping groundwater potential in an arid region [J].
Al-Abadi, Alaa M. ;
Shahid, Shamsuddin .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2015, 187 (09)
[7]  
Ali Abbaspour R., 2019, INT ARCH PHOTOGRAMME, V18
[8]   Multi-influencing factor method for delineation of groundwater potential zones using remote sensing and GIS techniques in the western part of Perambalur district, southern India [J].
Anbarasu, S. ;
Brindha, K. ;
Elango, L. .
EARTH SCIENCE INFORMATICS, 2020, 13 (02) :317-332
[9]  
ARABAMERI A, 2020, REMOTE SENS-BASEL, V20
[10]  
ARABAMERI A, 2019, REMOTE SENS-BASEL, V11