Assessment of the importance of gully erosion effective factors using Boruta algorithm and its spatial modeling and mapping using three machine learning algorithms

被引:187
作者
Amiri, Mandis [1 ]
Pourghasemi, Hamid Reza [1 ]
Ghanbarian, Gholam Abbas [1 ]
Afzali, Sayed Fakhreddin [1 ]
机构
[1] Shiraz Univ, Coll Agr, Dept Nat Resources & Environm Engn, Shiraz, Iran
关键词
Gully erosion; Machine learning; Boruta algorithm; Evidential belief function; Integrating GIS and R; ANALYTICAL HIERARCHY PROCESS; BINARY LOGISTIC-REGRESSION; EVIDENTIAL BELIEF FUNCTION; SUPPORT VECTOR MACHINE; LANDSLIDE SUSCEPTIBILITY; LINEAR EROSION; DECISION TREES; ENSEMBLE; INITIATION; RESOLUTION;
D O I
10.1016/j.geoderma.2018.12.042
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
The Maharloo watershed has witnessed many gullies in the recent due to the specific topo-climatic conditions and man-made activities in that area. The present study is set out to address this issue by producing gully erosion prediction maps via three machine learning algorithms including RF, SVM and BRT in Maharloo watershed, Fars province, Iran. Also, this research attempted to consider the importance of effective factors in the occurrence of gully erosion using Boruta algorithm. To this end, gully erosion locations were identified by extensive field surveys as well as the use of already prepared gully raster map of Maharloo watershed. Then, sixteen causative factors of gully erosion such as elevation, slope degree, slope aspect, plan curvature, TWI, distance from rivers, distance from roads, drainage density, lithology, annual mean rainfall, NDVI, land use and some soil characteristics (pH, clay percent, electrical conductivity-EC, and silt percent) were identified and their maps were produced and classified in the GIS. In this study, the relationships among each agent and gully erosion were defined employing the evidential belief function (EBF) algorithm and the weight of each factor's classes was determined. On the other hand, the results of the collinearity test among the factors showed that sand percentage agent had a VIF > 5; therefore, this covariate was removed from the model. Also, the results of the importance of effective factors using Boruta algorithm indicated that three factors including land use, distance from river, and clay percent had the most noticeable importance in the occurrence of gully erosion in the study area. Finally, the gully erosion susceptibility maps were produced using the RF, BRT, and SVM models in the R statistical software. The results of machine learning techniques were evaluated employing 30% of unused locations in the modeling process as well as the receiver operating characteristic (ROC) curve. Also, in the current research, try to assess the fitting performance of models and their robustness using sensitivity rate, specificity rate, Cohen's Kappa, and 4-fold cross-validation measures. Results showed that the final gully erosion susceptibility maps had an excellent accuracy with AUC values of validation data sets by scenarios 7 and 9 independent factors on gully erosion, respectively (SVM = 0.957, 0.975, RF = 0.991, 0.986, BRT = 0.913, 0.913). The fitting performance measures and robustness technique (4-fold cross-validation) also confirmed the achieved validation results. In order to control and prevent this type of erosion in the Maharloo watershed, there should be protective actions and watershed management measures in place at the primary stages, especially at the beginning of the gully erosion, to control the development of the gully erosion.
引用
收藏
页码:55 / 69
页数:15
相关论文
共 92 条
[1]   Gully erosion in South Eastern Tanzania: spatial distribution and topographic thresholds [J].
Achten, Wouter M. J. ;
Dondeyne, Stefaan ;
Mugogo, Samweli ;
Kafiriti, Elly ;
Poesen, Jean ;
Deckers, Jozef ;
Muys, Bart .
ZEITSCHRIFT FUR GEOMORPHOLOGIE, 2008, 52 (02) :225-235
[2]   Comparison and ranking of different modelling techniques for prediction of site index in Mediterranean mountain forests [J].
Aertsen, Wim ;
Kint, Vincent ;
van Orshoven, Jos ;
Ozkan, Kuersad ;
Muys, Bart .
ECOLOGICAL MODELLING, 2010, 221 (08) :1119-1130
[3]   A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping [J].
Althuwaynee, Omar F. ;
Pradhan, Biswajeet ;
Park, Hyuck-Jin ;
Lee, Jung Hyun .
CATENA, 2014, 114 :21-36
[4]  
[Anonymous], THESIS
[5]   Spatial modelling of gully erosion using evidential belief function, logistic regression, and a new ensemble of evidential belief function-logistic regression algorithm [J].
Arabameri, Alireza ;
Pradhan, Biswajeet ;
Rezaei, Khalil ;
Yamani, Mojtaba ;
Pourghasemi, Hamid Reza ;
Lombardo, Luigi .
LAND DEGRADATION & DEVELOPMENT, 2018, 29 (11) :4035-4049
[6]   Spatial Modelling of Gully Erosion Using GIS and R Programing: A Comparison among Three Data Mining Algorithms [J].
Arabameri, Alireza ;
Pradhan, Biswajeet ;
Pourghasemi, Hamid Reza ;
Rezaei, Khalil ;
Kerle, Norman .
APPLIED SCIENCES-BASEL, 2018, 8 (08)
[7]   The role of vegetation on gully erosion stabilization at a severely degraded landscape: A case study from Calhoun Experimental Critical Zone Observatory [J].
Bastola, S. ;
Dialynas, Y. G. ;
Bras, R. L. ;
Noto, L. V. ;
Istanbulluoglu, E. .
GEOMORPHOLOGY, 2018, 308 :25-39
[8]   Landslide susceptibility assessment of the Kralovany-Liptovsky Mikulas railway case study [J].
Bednarik, Martin ;
Magulova, Barbora ;
Matys, Mirko ;
Marschalko, Marian .
PHYSICS AND CHEMISTRY OF THE EARTH, 2010, 35 (3-5) :162-171
[9]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32