Robustness of Optimized Decision Tree-Based Machine Learning Models to Map Gully Erosion Vulnerability

被引:11
作者
Eloudi, Hasna [1 ]
Hssaisoune, Mohammed [1 ,2 ,3 ]
Reddad, Hanane [4 ]
Namous, Mustapha [5 ]
Ismaili, Maryem [5 ]
Krimissa, Samira [5 ]
Ouayah, Mustapha [5 ]
Bouchaou, Lhoussaine [1 ,3 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Appl Geol & Geoenvironm Lab, Agadir 80000, Morocco
[2] Ibn Zohr Univ, Fac Appl Sci, Ait Melloul 86150, Morocco
[3] Mohammed VI Polytech Univ, Int Water Res Inst, Ben Guerir 43150, Morocco
[4] Sultan Moulay Slimane Univ, Ecole Super Technol Beni Mellal, Lab Ingn & Technol Appl LITA, Beni Mellal 23000, Morocco
[5] Sultan Moulay Slimane Univ, Data Sci Sustainable Earth Lab Data4Earth, Beni Mellal 23000, Morocco
关键词
soil erosion; inventory data; performance; robustness; spatial prediction; LANDSLIDE SUSCEPTIBILITY ASSESSMENT; SOIL-EROSION; LOGISTIC-REGRESSION; SEDIMENT YIELD; CLIMATE-CHANGE; WATER EROSION; SLOPE ASPECT; HIGH-ATLAS; CLASSIFICATION; VEGETATION;
D O I
10.3390/soilsystems7020050
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Gully erosion is a worldwide threat with numerous environmental, social, and economic impacts. The purpose of this research is to evaluate the performance and robustness of six machine learning ensemble models based on the decision tree principle: Random Forest (RF), C5.0, XGBoost, treebag, Gradient Boosting Machines (GBMs) and Adaboost, in order to map and predict gully erosion-prone areas in a semi-arid mountain context. The first step was to prepare the inventory data, which consisted of 217 gully points. This database was then randomly subdivided into five percentages of Train/Test (50/50, 60/40, 70/30, 80/20, and 90/10) to assess the stability and robustness of the models. Furthermore, 17 geo-environmental variables were used as potential controlling factors, and several metrics were examined to evaluate the performance of the six models. The results revealed that all of the models used performed well in terms of predicting vulnerability to gully erosion. The C5.0 and RF models had the best prediction performance (AUC = 90.8 and AUC = 90.1, respectively). However, according to the random subdivisions of the database, these models exhibit small but noticeable instability, with high performance for the 80/20% and 70/30% subdivisions. This demonstrates the significance of database refining and the need to test various splitting data in order to ensure efficient and reliable output results.
引用
收藏
页数:24
相关论文
共 71 条
[51]  
RENARD KG, 1991, J SOIL WATER CONSERV, V46, P30
[52]   Shallow landslide susceptibility assessment in a semiarid environment - A Quaternary catchment of KwaZulu-Natal, South Africa [J].
Romer, Clarice ;
Ferentinou, Maria .
ENGINEERING GEOLOGY, 2016, 201 :29-44
[53]   Novel Ensemble of Multivariate Adaptive Regression Spline with Spatial Logistic Regression and Boosted Regression Tree for Gully Erosion Susceptibility [J].
Roy, Paramita ;
Chandra Pal, Subodh ;
Arabameri, Alireza ;
Chakrabortty, Rabin ;
Pradhan, Biswajeet ;
Chowdhuri, Indrajit ;
Lee, Saro ;
Tien Bui, Dieu .
REMOTE SENSING, 2020, 12 (20) :1-35
[54]   Machine Learning-Based Gully Erosion Susceptibility Mapping: A Case Study of Eastern India [J].
Saha, Sunil ;
Roy, Jagabandhu ;
Arabameri, Alireza ;
Blaschke, Thomas ;
Dieu Tien Bui .
SENSORS, 2020, 20 (05)
[55]   Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest [J].
Sahin, Emrehan Kutlug .
SN APPLIED SCIENCES, 2020, 2 (07)
[56]   A comparative analysis of statistical and machine learning techniques for mapping the spatial distribution of groundwater salinity in a coastal aquifer [J].
Sahour, Hossein ;
Gholami, Vahid ;
Vazifedan, Mehdi .
JOURNAL OF HYDROLOGY, 2020, 591
[57]   Geomorphic threshold conditions for gully erosion in Southwestern Iran (Boushehr-Samal watershed) [J].
Samani, Aliakbar Nazari ;
Ahmadi, Hassan ;
Jafari, Mohammad ;
Boggs, Guy ;
Ghoddousi, Jamal ;
Malekian, Arash .
JOURNAL OF ASIAN EARTH SCIENCES, 2009, 35 (02) :180-189
[58]   Potential of airborne LiDAR data for terrain parameters extraction [J].
Sharma, Mayank ;
Garg, Rahul Dev ;
Badenko, Vladimir ;
Fedotov, Alexandre ;
Min, Liu ;
Yao, Ada .
QUATERNARY INTERNATIONAL, 2021, 575 :317-327
[59]   PREDICTION OF SEDIMENT YIELD FROM SOUTHERN PLAINS GRASSLANDS WITH THE MODIFIED UNIVERSAL SOIL LOSS EQUATION [J].
SMITH, SJ ;
WILLIAMS, JR ;
MENZEL, RG ;
COLEMAN, GA .
JOURNAL OF RANGE MANAGEMENT, 1984, 37 (04) :295-297
[60]   Predicting gully initiation: comparing data mining techniques, analytical hierarchy processes and the topographic threshold [J].
Svoray, Tal ;
Michailov, Evgenia ;
Cohen, Avraham ;
Rokach, Lior ;
Sturm, Arnon .
EARTH SURFACE PROCESSES AND LANDFORMS, 2012, 37 (06) :607-619