Landslide susceptibility assessment in Western Henan Province based on a comparison of conventional and ensemble machine learning

被引:11
作者
Cao, Wen-geng [1 ,2 ]
Fu, Yu [3 ]
Dong, Qiu-yao [1 ]
Wang, Hai-gang [4 ]
Ren, Yu [1 ]
Li, Ze-yan [1 ]
Du, Yue-ying [3 ]
机构
[1] Chinese Acad Geol Sci, Inst Hydrogeol & Environm Geol, China Geol Survey, Minist Nat Resources, Shijiazhuang 050061, Peoples R China
[2] Minist Nat Resources, Key Lab Groundwater Sci & Engn, Shijiazhuang 050061, Peoples R China
[3] North China Univ Water Resources & Elect Power, Zhengzhou 450011, Peoples R China
[4] China Inst Geoenvironm Monitoring, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Landslide susceptibility model; Risk assessment; Machine learning; Support vector machines; Logistic regression; Random forest; Extreme gradient boosting; Linear discriminant analysis; Ensemble modeling; Factor analysis; Geological disaster survey engineering; Middle mountain area; Yellow River Basin; CLASSIFICATION; HAZARDS;
D O I
10.31035/cg2023013
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Landslide is a serious natural disaster next only to earthquake and flood, which will cause a great threat to people 's lives and property safety. The traditional research of landslide disaster based on experience driven or statistical model and its assessment results are subjective , difficult to quantify, and no pertinence. As a new research method for landslide susceptibility assessment, machine learning can greatly improve the landslide susceptibility model's accuracy by constructing statistical models. Taking Western Henan for example, the study selected 16 landslide influencing factors such as topography, geological environment, hydrological conditions, and human activities, and 11 landslide factors with the most significant influence on the landslide were selected by the recursive feature elimination (RFE) method. Five machine learning methods [Support Vector Machines (SVM), Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Linear Discriminant Analysis (LDA)] were used to construct the spatial distribution model of landslide susceptibility. The models were evaluated by the receiver operating characteristic curve and statistical index. After analysis and comparison, the XGBoost model (AUC 0.8759) performed the best and was suitable for dealing with regression problems. The model had a high adaptability to landslide data. According to the landslide susceptibility map of the five models, the overall distribution can be observed. The extremely high and high susceptibility areas are distributed in the Funiu Mountain range in the southwest, the Xiaoshan Mountain range in the west, and the Yellow River Basin in the north. These areas have large terrain fluctuations, complicated geological structural environments and frequent human engineering activities. The extremely high and highly prone areas were 12043.3 km2 and 3087.45 km2, accounting for 47.61% and 12.20% of the total area of the study area, respectively. Our study reflects the distribution of landslide susceptibility in western Henan Province, which provides a scientific basis for regional disaster warning, prediction, and resource protection. The study has important practical significance for subsequent landslide disaster management. (c) 2023 China Geology Editorial Office.
引用
收藏
页码:409 / 419
页数:11
相关论文
共 26 条
[11]   Landslide susceptibility analysis in the Hoa Binh province of Vietnam using statistical index and logistic regression [J].
Dieu Tien Bui ;
Lofman, Owe ;
Revhaug, Inge ;
Dick, Oystein .
NATURAL HAZARDS, 2011, 59 (03) :1413-1444
[12]   Local Linear Discriminant Analysis Framework Using Sample Neighbors [J].
Fan, Zizhu ;
Xu, Yong ;
Zhang, David .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (07) :1119-1132
[13]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[14]   Stochastic gradient boosting [J].
Friedman, JH .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) :367-378
[15]  
Guzzetti F., 2006, Landslide Hazard and Risk Assessment
[16]  
Ministry of Natural Resources of the people's Republic of China, 2022, National Geological Disaster Situation in 2021 and Geological disaster Trend forec ast in 2022
[17]   Recursive Feature Elimination for Machine Learning-based Landslide Prediction Models [J].
Munasinghe, Kusala ;
Karunanayake, Piyumika .
3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, :126-129
[18]   What is a support vector machine? [J].
Noble, William S. .
NATURE BIOTECHNOLOGY, 2006, 24 (12) :1565-1567
[19]   An objective analysis of Support Vector Machine based classification for remote sensing [J].
Oommen, Thomas ;
Misra, Debasmita ;
Twarakavi, Navin K. C. ;
Prakash, Anupma ;
Sahoo, Bhaskar ;
Bandopadhyay, Sukumar .
MATHEMATICAL GEOSCIENCES, 2008, 40 (04) :409-424
[20]  
Savvaidis P.D., 2003, From Stars to Earth and Culture, P242