Performance Evaluation of the GIS-Based Data-Mining Techniques Decision Tree, Random Forest, and Rotation Forest for Landslide Susceptibility Modeling

被引:50
作者
Park, Soyoung [1 ]
Hamm, Se-Yeong [2 ]
Kim, Jinsoo [3 ]
机构
[1] Pukyong Natl Univ, Grad Sch Earth Environm Hazard Syst, BK21 Plus Project, Busan 48513, South Korea
[2] Pusan Natl Univ, Dept Geol Sci, Busan 46241, South Korea
[3] Pukyong Natl Univ, Dept Spatial Informat Engn, Busan 48513, South Korea
基金
新加坡国家研究基金会;
关键词
decision tree; ensemble learning; landslide susceptibility; random forest; rotation forest; ARTIFICIAL NEURAL-NETWORKS; LOGISTIC-REGRESSION; HIERARCHY PROCESS; FREQUENCY RATIO; ENSEMBLE; MACHINE; PREDICTION; AREA; CLASSIFIERS;
D O I
10.3390/su11205659
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study analyzed and compared landslide susceptibility models using decision tree (DT), random forest (RF), and rotation forest (RoF) algorithms at Woomyeon Mountain, South Korea. Out of a total of 145 landslide locations, 102 locations (70%) were used for model training, and the remaining 43 locations (30%) were used for validation. Fourteen landslide conditioning factors were identified, and the contributions of each factor were evaluated using the RRelief-F algorithm with a 10-fold cross-validation approach. Three factors, timber diameter, age, and density had no contribution to landslide occurrence. Landslide susceptibility maps (LSMs) were produced using DT, RF, and RoF models with the 11 remaining landslide conditioning factors: altitude, slope, aspect, profile curvature, plan curvature, topographic position index, elevation-relief ratio, slope length and slope steepness, topographic wetness index, stream power index, and timber type. The performances of the LSMs were assessed and compared based on sensitivity, specificity, precision, accuracy, kappa index, and receiver operating characteristic curves. The results showed that the ensemble learning methods outperformed the single classifier (DT) and that the RoF model had the highest prediction capability compared to the DT and RF models. The results of this study may be helpful in managing areas vulnerable to landslides and establishing mitigation strategies.
引用
收藏
页数:20
相关论文
共 56 条
[1]  
[Anonymous], LANDSLIDES
[2]  
[Anonymous], P 9 INT WORKSH MACH
[3]  
[Anonymous], MACH LEARN MACH LEARN
[4]  
[Anonymous], STUD INV CAUS DEV RE
[5]  
[Anonymous], 1997, Machine Learning: Proceedings of the Fourteenth International Conference (ICML'97)
[6]  
[Anonymous], 1985, P 4 INT S LANDSL TOR
[7]  
[Anonymous], 2004, COMBINING PATTERN CL, DOI [DOI 10.1002/0471660264.CH7, 10.1002/0471660264.ch7]
[8]   Spatial Modelling of Gully Erosion Using GIS and R Programing: A Comparison among Three Data Mining Algorithms [J].
Arabameri, Alireza ;
Pradhan, Biswajeet ;
Pourghasemi, Hamid Reza ;
Rezaei, Khalil ;
Kerle, Norman .
APPLIED SCIENCES-BASEL, 2018, 8 (08)
[9]  
Beven K.J., 1979, Hydrological Sciences Bulletin, V24, P43, DOI [DOI 10.1080/02626667909491834, 10.1080/02626667909491834]
[10]   A hybrid machine learning ensemble approach based on a Radial Basis Function neural network and Rotation Forest for landslide susceptibility modeling: A case study in the Himalayan area, India [J].
Binh Thai Pham ;
Shirzadi, Ataollah ;
Dieu Tien Bui ;
Prakash, Indra ;
Dholakia, M. B. .
INTERNATIONAL JOURNAL OF SEDIMENT RESEARCH, 2018, 33 (02) :157-170