Assessing the performance of GIS-based machine learning models with different accuracy measures for determining susceptibility to gully erosion

被引:165
作者
Garosi, Younes [1 ]
Sheklabadi, Mohsen [1 ]
Conoscenti, Christian [2 ]
Pourghasemi, Hamid Reza [3 ,4 ]
Van Oost, Kristof [5 ,6 ]
机构
[1] Bu Ali Sina Univ, Dept Soil Sci, Fac Agr, Ahmadi Roshan Ave, Hamadan 6517838695, Iran
[2] Univ Palermo, Dept Earth & Sea Sci DISTEM, Via Archirafi 22, I-90123 Palermo, Italy
[3] Nanjing Normal Univ, Coll Marine Sci & Engn, Nanjing 210023, Jiangsu, Peoples R China
[4] Shiraz Univ, Dept Nat Resources & Environm Engn, Coll Agr, Shiraz, Iran
[5] A Fonds Rech Sci, Rue Egmont 5, B-1000 Brussels, Belgium
[6] Catholic Univ Louvain, B TECLIM Georges Lemaitre Ctr Earth & Climate Res, BE-1348 Louvain La Neuve, Belgium
关键词
Discrimination; Gully erosion susceptibility; Machine learning models; Reliability; Latin hypercube sampling technique (cLHS); Topographic attributes; LANDSLIDE SUSCEPTIBILITY; LOGISTIC-REGRESSION; RANDOM FORESTS; NONPARAMETRIC METHODS; SOIL-EROSION; RISK MAP; WATER; CATCHMENT; BIVARIATE; CLASSIFICATION;
D O I
10.1016/j.scitotenv.2019.02.093
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The main purpose was to compare discrimination and reliability of four machine learning models to create gully erosion susceptibility map (GESM) in a part of Ekbatan Dam Basin, Hamedan, western Iran. Extensive field surveys using GPS, and the visual interpretation of satellite images, used to prepare a digital map of the spatial distribution of gullies. 130 locations were sampled to elucidate the spatial distribution of the soil surface properties. Topographic attributes were provided from digital elevation model (DEM). The land use and normalized difference vegetation index (NDVI) maps were created by satellite image ry.The functional relationships between gully erosion and controlling factors were calculated using the random forest (RF), support vector machine (SVM), Na ve Bayes (NB), and generalized additive model (GAM) models. The performance of models was evaluated by 10-fold cross-validation based on efficiency, Kappa coefficient, receiver operating characteristic curve (ROC), mean absolute error (MAE), and root mean square error (RMSE). The results showed that the RF model had the highest amount of efficiency, Kappa coefficient, and AUC and the lowest amounts of MAE and RMSE compared with SVM, NB, and GAM. The RF model showed the highest predictive performance (mean AUC 924%), followed by SVM (mean AUC 90.9%), GAM (mean AUC 89.9%), and NB (mean AUC 87.2%) models. Overall accuracy of the models ranged from excellent (NB, GAM) to outstanding (RF, SVM) classes. The capacity of all models for creating GESM was quite stable when the calibration and validation samples were changed through10-fold cross-validation technique. According to variable importance analysis performed by RF model, the most important variables are distance from rivers, calcium carbonate equivalent (CCE), and topographic position index (TPI). The obtained maps can help identifying areas at risk of gully erosion and facilitate the implementation of plans for soil conservation and sustainable management. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:1117 / 1132
页数:16
相关论文
共 143 条
[21]  
[Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
[22]  
[Anonymous], MACH LEARN MACH LEARN
[23]  
[Anonymous], 2005, MORPHO CLIMATIC CLAS
[24]  
[Anonymous], PACKAGE CLHS
[25]  
[Anonymous], 2011, Landforms Analysis
[26]  
[Anonymous], 2601 CSIRO LAND WAT
[27]   GIS-based gully erosion susceptibility mapping: a comparison among three data-driven models and AHP knowledge-based technique [J].
Arabameri, Alireza ;
Rezaei, Khalil ;
Pourghasemi, Hamid Reza ;
Lee, Saro ;
Yamani, Mojtaba .
ENVIRONMENTAL EARTH SCIENCES, 2018, 77 (17)
[28]  
Ash A, 1999, STAT MED, V18, P375, DOI 10.1002/(SICI)1097-0258(19990228)18:4<375::AID-SIM20>3.0.CO
[29]  
2-J
[30]  
Auguie B., 2017, gridExtra: Miscellaneous Functions for"Grid" GraphicsR package version 2.3