GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran

被引:499
作者
Naghibi, Seyed Amir [1 ]
Pourghasemi, Hamid Reza [2 ]
Dixon, Barnali [3 ]
机构
[1] Tarbiat Modares Univ, Coll Nat Resources, Dept Watershed Management Engn, Noor, Mazandaran, Iran
[2] Shiraz Univ, Coll Agr, Dept Nat Resources & Environm Engn, Shiraz, Iran
[3] Univ S Florida, Dept Environm Sci Policy & Geog, St Petersburg, FL 33701 USA
关键词
Spring potential mapping; Boosted regression tree; Classification and regression tree; Random forest; GIS; Iran; GEOGRAPHIC INFORMATION-SYSTEMS; SULTAN MOUNTAINS; FREQUENCY RATIO; SUSCEPTIBILITY; UNCERTAINTY; WEIGHTS; ZONES; AREA; TOOL; IDENTIFICATION;
D O I
10.1007/s10661-015-5049-6
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Groundwater is considered one of the most valuable fresh water resources. The main objective of this study was to produce groundwater spring potential maps in the Koohrang Watershed, Chaharmahal-e-Bakhtiari Province, Iran, using three machine learning models: boosted regression tree (BRT), classification and regression tree (CART), and random forest (RF). Thirteen hydrological-geological-physiographical (HGP) factors that influence locations of springs were considered in this research. These factors include slope degree, slope aspect, altitude, topographic wetness index (TWI), slope length (LS), plan curvature, profile curvature, distance to rivers, distance to faults, lithology, land use, drainage density, and fault density. Subsequently, groundwater spring potential was modeled and mapped using CART, RF, and BRT algorithms. The predicted results from the three models were validated using the receiver operating characteristics curve (ROC). From 864 springs identified, 605 (approximate to 70 %) locations were used for the spring potential mapping, while the remaining 259 (approximate to 30 %) springs were used for the model validation. The area under the curve (AUC) for the BRT model was calculated as 0.8103 and for CART and RF the AUC were 0.7870 and 0.7119, respectively. Therefore, it was concluded that the BRT model produced the best prediction results while predicting locations of springs followed by CART and RF models, respectively. Geospatially integrated BRT, CART, and RF methods proved to be useful in generating the spring potential map (SPM) with reasonable accuracy.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 96 条
[31]  
Geology Survey of Iran (GSI), 1997, GEOL MAP CHAH E BAKH
[32]   Application of GIS techniques to determine areas most suitable for artificial groundwater recharge in a coastal aquifer in southern Iran [J].
Ghayoumian, J. ;
Saravi, M. Mohseni ;
Feiznia, S. ;
Nouri, B. ;
Malekian, A. .
JOURNAL OF ASIAN EARTH SCIENCES, 2007, 30 (02) :364-374
[33]   Using and comparing two nonparametric methods (CART and MARS) to model the potential distribution of gullies [J].
Gomez Gutierrez, Alvaro ;
Schnabel, Susanne ;
Lavado Contador, J. Francisco .
ECOLOGICAL MODELLING, 2009, 220 (24) :3630-3637
[34]   Modelling the occurrence of gullies in rangelands of southwest Spain [J].
Gomez Gutierrez, Alvaro ;
Schnabel, Susanne ;
Felicisimo, Angel M. .
EARTH SURFACE PROCESSES AND LANDFORMS, 2009, 34 (14) :1894-1902
[35]   Predicting species distribution: offering more than simple habitat models [J].
Guisan, A ;
Thuiller, W .
ECOLOGY LETTERS, 2005, 8 (09) :993-1009
[36]   Integrating GIS and remote sensing for identification of groundwater potential zones in the hilly terrain of Pavagarh, Gujarat, India [J].
Gupta, Manika ;
Srivastava, Prashant K. .
WATER INTERNATIONAL, 2010, 35 (02) :233-245
[37]  
Harrell FE, 1996, STAT MED, V15, P361, DOI 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO
[38]  
2-4
[39]  
Hastie T., 2001, Springer Series in Statistics, DOI [10.1111/j.1467-985X.2010.00646_6.x, DOI 10.1111/J.1467-985X.2010.00646_6.X, DOI 10.1007/978-0-387-84858-7, 10.1007/978-0-387-21606-5, DOI 10.1007/978-0-387-21606-5]
[40]  
Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601