Sample size effects on landslide susceptibility models: A comparative study of heuristic, statistical, machine learning, deep learning and ensemble learning models with SHAP analysis

被引：1

作者：

Yang, Shilong ^{[1
]}

Tan, Jiayao ^{[1
]}

Luo, Danyuan ^{[1
]}

Wang, Yuzhou ^{[2
,3
]}

Guo, Xu ^{[1
]}

Zhu, Qiuyu ^{[1
,4
]}

Ma, Chuanming ^{[1
]}

Xiong, Hanxiang ^{[1
]}

机构：

[1] China Univ Geosci, Sch Environm Studies, Wuhan 430074, Peoples R China

[2] Eastern Inst Technol, Eastern Inst Adv Study, Ningbo 315200, Peoples R China

[3] Shanghai Jiao Tong Univ, Sch Environm Sci & Engn, Shanghai 200240, Peoples R China

[4] Hangzhou Yuhang Urban Dev Investment Grp Co Ltd, Hangzhou 311100, Peoples R China

来源：

COMPUTERS & GEOSCIENCES | 2024年 / 193卷

关键词：

Landslide susceptibility assessment; Model robustness; Inventory sample size; XGBoost and LightGBM; Explainable machine learning; ANALYTICAL HIERARCHY PROCESS; FREQUENCY RATIO MODEL; LOGISTIC-REGRESSION; NEURAL-NETWORKS; GIS; AREA; HAZARD; PROVINCE; BASIN; INDEX;

D O I：

10.1016/j.cageo.2024.105723

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In landslide susceptibility assessment (LSA), inventory incompleteness impacts the accuracy of different models to varying degrees. However, this area remains under-researched. This study investigated six LSA models from heuristic, statistical, machine learning and ensemble learning models (analytical hierarchy process (AHP), frequency ratio (FR), logistic regression (LR), Keras based deep learning (KBDL), XGBoost, and LightGBM) across six different sample sizes (100%, 90%, 75%, 50%, 25%, and 10%). Results revealed that XGBoost and LightGBM consistently outperformed other models across all sample sizes. The LR and KBDL models followed, while FR model was the most affected by sample size variations. AHP, an empirical model, remained unaffected by sample size. Through SHapley Additive exPlanations (SHAP) analysis, elevation, NDVI, slope, land use, and distance to roads and rivers emerged as pivotal indicators for landslide occurrences in the study area, suggesting that human activities significantly influence these events. Five time-varying indicators regarding human activity and climate validated this inference, which provides a new method to identify landslide triggering factors, especially in areas of intense human activity. Based on the findings, a comprehensive framework for LSA is proposed to assist landslide managers in making informed decisions. Future research should focus on expanding model diversity to address the effects of sample size, enhancing the adaptability of the LSA framework, deepening the analysis of human activity impacts on landslides using explainable machine learning techniques, addressing temporal inventory incompleteness in LSA, and critically evaluating model sensitivity to sample size variations across multiple disciplines.

引用

页数：19

共 50 条

[21] A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan
Shahzad, Naeem
Ding, Xiaoli
Abbas, Sawaid
APPLIED SCIENCES-BASEL, 2022, 12 (05):
[22] Using the rotation and random forest models of ensemble learning to predict landslide susceptibility
Zhao, Lingran
Wu, Xueling
Niu, Ruiqing
Wang, Ying
Zhang, Kaixiang
GEOMATICS NATURAL HAZARDS & RISK, 2020, 11 (01) : 1542 - 1564
[23] Landslide Susceptibility Mapping: Machine and Ensemble Learning Based on Remote Sensing Big Data
Kalantar, Bahareh
Ueda, Naonori
Saeidi, Vahideh
Ahmadi, Kourosh
Halin, Alfian Abdul
Shabani, Farzin
REMOTE SENSING, 2020, 12 (11)
[24] A comparison of different machine learning models for landslide susceptibility mapping in Rize (Türkiye)
Bilgilioglu, Hacer
BALTICA, 2023, 36 (02): : 115 - 132
[25] A Novel Heterogeneous Ensemble Framework Based on Machine Learning Models for Shallow Landslide Susceptibility Mapping
Tang, Haozhe
Wang, Changming
An, Silong
Wang, Qingyu
Jiang, Chenglin
REMOTE SENSING, 2023, 15 (17)
[26] Susceptibility Prediction of Groundwater Hardness Using Ensemble Machine Learning Models
Mosavi, Amirhosein
Hosseini, Farzaneh Sajedi
Choubin, Bahram
Abdolshahnejad, Mahsa
Gharechaee, Hamidreza
Lahijanzadeh, Ahmadreza
Dineva, Adrienn A.
WATER, 2020, 12 (10)
[27] A comparative study of the bivariate, multivariate and machine-learning-based statistical models for landslide susceptibility mapping in a seismic-prone region in China
Zhou S.
Zhang Y.
Tan X.
Abbas S.M.
Arabian Journal of Geosciences, 2021, 14 (6)
[28] Evaluation of linear, nonlinear and ensemble machine learning models for landslide susceptibility assessment in southwest China
Wang, Bingwei
Lin, Qigen
Jiang, Tong
Yin, Huaxiang
Zhou, Jian
Sun, Jinhao
Wang, Dongfang
Dai, Ran
GEOCARTO INTERNATIONAL, 2022,
[29] Effects of non-landslide sampling strategies on machine learning models in landslide susceptibility mapping
Gu, Tengfei
Duan, Ping
Wang, Mingguo
Li, Jia
Zhang, Yanke
SCIENTIFIC REPORTS, 2024, 14 (01)
[30] Landslide susceptibility and building exposure assessment using machine learning models and geospatial analysis techniques
Luu, Chinh
Ha, Hang
Tran, Xuan Thong
Ha Vu, Thai
Bui, Quynh Duy
ADVANCES IN SPACE RESEARCH, 2024, 74 (11) : 5489 - 5513

← 1 2 3 4 5 →