Comparing the Performance of a Logistic Regression and a Random Forest Model in Landslide Susceptibility Assessments. the Case of Wuyaun Area, China

被引:21
作者
Hong, Haoyuan [1 ]
Tsangaratos, Paraskevas [2 ]
Ilia, Ioanna [2 ]
Chen, Wei [3 ]
Xu, Chong [1 ]
机构
[1] China Earthquake Adm, Inst Geol, Key Lab Act Tecton & Volcano, 1 Huayanli, Beijing 100029, Peoples R China
[2] Natl Tech Univ Athens, Sch Min & Met Engn, Dept Geol Studies, Zografou Campus,Heroon Polytechniou 9, Zografos 15780, Greece
[3] Xian Univ Sci & Technol, Coll Geol & Environm, Xian 710054, Shaanxi, Peoples R China
来源
ADVANCING CULTURE OF LIVING WITH LANDSLIDES, VOL 2: ADVANCES IN LANDSLIDE SCIENCE | 2017年
基金
中国国家自然科学基金;
关键词
Landslide susceptibility assessment; Logistic regression; Random forest; Wuyaun area; China; CONDITIONAL-PROBABILITY; NEURAL-NETWORKS; RISK-ASSESSMENT; GIS; SENSITIVITY;
D O I
10.1007/978-3-319-53498-5_118
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The main objectives of the study was to apply a Logistic Regression and a Random Forest model for the construction of a landslide susceptibility map in the Wuyuan area, China, and to compare their results by performing non-parametric and linear regression analysis. Thirteen landslide variables were analyzed, namely: lithology, soil, slope, aspect, altitude, topographic wetness index, stream power index, stream transport index, plan curvature, profile curvature, distance to roads, distance to rivers and distance to faults, while 255 sites classified as landslide and 255 sites classified as non-landslide were separated into a training dataset (70%) and a validation dataset (30%). The comparison and validation of the outcomes of each model were achieved using statistical evaluation measures, the receiving operating characteristic and the area under the success and prediction rate curves. The presence of linear correlation between the two models was estimated by performing a simple linear regression analysis. The most accurate model was Random Forest, which identified correctly 98.32% of the instances during the training phase, followed by Logistic Regression (87.43%). During the validation phase, the Random Forest achieved a classification accuracy of 85.52%, while Logistic Regression model achieved an accuracy of 80.92%. The area under the success and prediction rate curves for the Random Forest were calculated to be 0.9805 and 0.9324, respectively, while the Logistic Regression model showed as slightly lower predictive performance, 0.9372 and 0.8903 respectively. Finally, by performing a non-parametric analysis, the two models were found to be significantly different. Strong evidence of linear relationship between the two models exist, having a p-value less than 0.0001 at a 95% confidence level and an R-2 value estimated to be 0.6993 indicating that 69.93% of the variability in the Logistic Regression model can be explained by variation in the Random Forest model.
引用
收藏
页码:1043 / 1050
页数:8
相关论文
共 21 条
[1]   Application of remote sensing data and GIS for landslide risk assessment as an environmental threat to Izmir city (west Turkey) [J].
Akgun, Aykut ;
Kincal, Cem ;
Pradhan, Biswajeet .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2012, 184 (09) :5453-5470
[2]   The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan [J].
Ayalew, L ;
Yamagishi, H .
GEOMORPHOLOGY, 2005, 65 (1-2) :15-31
[3]   A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India) [J].
Binh Thai Pham ;
Pradhan, Biswajeet ;
Bui, Dieu Tien ;
Prakash, Indra ;
Dholakia, M. B. .
ENVIRONMENTAL MODELLING & SOFTWARE, 2016, 84 :240-250
[4]   Landslide susceptibility estimation by random forests technique: sensitivity and scaling issues [J].
Catani, F. ;
Lagomarsino, D. ;
Segoni, S. ;
Tofani, V. .
NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2013, 13 (11) :2815-2831
[5]   Land use/land cover change and driving effects of water environment system in Dunhuang Basin, northwestern China [J].
Chen, Weitao ;
Wang, Yanxin ;
Li, Xianju ;
Zou, Yi ;
Liao, Yiwei ;
Yang, Juncang .
ENVIRONMENTAL EARTH SCIENCES, 2016, 75 (12)
[6]   Landslide risk assessment and management: an overview [J].
Dai, FC ;
Lee, CF ;
Ngai, YY .
ENGINEERING GEOLOGY, 2002, 64 (01) :65-87
[7]   Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree [J].
Dieu Tien Bui ;
Tran Anh Tuan ;
Klempe, Harald ;
Pradhan, Biswajeet ;
Revhaug, Inge .
LANDSLIDES, 2016, 13 (02) :361-378
[8]  
Gorsevski P. V., 2006, Transactions in GIS, V10, P395, DOI 10.1111/j.1467-9671.2006.01004.x
[9]   NEURAL NETWORK ENSEMBLES [J].
HANSEN, LK ;
SALAMON, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (10) :993-1001
[10]   GIS-based landslide spatial modeling in Ganzhou City, China [J].
Hong, Haoyuan ;
Naghibi, Seyed Amir ;
Pourghasemi, Hamid Reza ;
Pradhan, Biswajeet .
ARABIAN JOURNAL OF GEOSCIENCES, 2016, 9 (02) :1-26