Sample size effects on landslide susceptibility models: A comparative study of heuristic, statistical, machine learning, deep learning and ensemble learning models with SHAP analysis

被引:1
作者
Yang, Shilong [1 ]
Tan, Jiayao [1 ]
Luo, Danyuan [1 ]
Wang, Yuzhou [2 ,3 ]
Guo, Xu [1 ]
Zhu, Qiuyu [1 ,4 ]
Ma, Chuanming [1 ]
Xiong, Hanxiang [1 ]
机构
[1] China Univ Geosci, Sch Environm Studies, Wuhan 430074, Peoples R China
[2] Eastern Inst Technol, Eastern Inst Adv Study, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Environm Sci & Engn, Shanghai 200240, Peoples R China
[4] Hangzhou Yuhang Urban Dev Investment Grp Co Ltd, Hangzhou 311100, Peoples R China
关键词
Landslide susceptibility assessment; Model robustness; Inventory sample size; XGBoost and LightGBM; Explainable machine learning; ANALYTICAL HIERARCHY PROCESS; FREQUENCY RATIO MODEL; LOGISTIC-REGRESSION; NEURAL-NETWORKS; GIS; AREA; HAZARD; PROVINCE; BASIN; INDEX;
D O I
10.1016/j.cageo.2024.105723
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In landslide susceptibility assessment (LSA), inventory incompleteness impacts the accuracy of different models to varying degrees. However, this area remains under-researched. This study investigated six LSA models from heuristic, statistical, machine learning and ensemble learning models (analytical hierarchy process (AHP), frequency ratio (FR), logistic regression (LR), Keras based deep learning (KBDL), XGBoost, and LightGBM) across six different sample sizes (100%, 90%, 75%, 50%, 25%, and 10%). Results revealed that XGBoost and LightGBM consistently outperformed other models across all sample sizes. The LR and KBDL models followed, while FR model was the most affected by sample size variations. AHP, an empirical model, remained unaffected by sample size. Through SHapley Additive exPlanations (SHAP) analysis, elevation, NDVI, slope, land use, and distance to roads and rivers emerged as pivotal indicators for landslide occurrences in the study area, suggesting that human activities significantly influence these events. Five time-varying indicators regarding human activity and climate validated this inference, which provides a new method to identify landslide triggering factors, especially in areas of intense human activity. Based on the findings, a comprehensive framework for LSA is proposed to assist landslide managers in making informed decisions. Future research should focus on expanding model diversity to address the effects of sample size, enhancing the adaptability of the LSA framework, deepening the analysis of human activity impacts on landslides using explainable machine learning techniques, addressing temporal inventory incompleteness in LSA, and critically evaluating model sensitivity to sample size variations across multiple disciplines.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan
    Shahzad, Naeem
    Ding, Xiaoli
    Abbas, Sawaid
    APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [22] Using the rotation and random forest models of ensemble learning to predict landslide susceptibility
    Zhao, Lingran
    Wu, Xueling
    Niu, Ruiqing
    Wang, Ying
    Zhang, Kaixiang
    GEOMATICS NATURAL HAZARDS & RISK, 2020, 11 (01) : 1542 - 1564
  • [23] Landslide Susceptibility Mapping: Machine and Ensemble Learning Based on Remote Sensing Big Data
    Kalantar, Bahareh
    Ueda, Naonori
    Saeidi, Vahideh
    Ahmadi, Kourosh
    Halin, Alfian Abdul
    Shabani, Farzin
    REMOTE SENSING, 2020, 12 (11)
  • [24] A comparison of different machine learning models for landslide susceptibility mapping in Rize (Türkiye)
    Bilgilioglu, Hacer
    BALTICA, 2023, 36 (02): : 115 - 132
  • [25] A Novel Heterogeneous Ensemble Framework Based on Machine Learning Models for Shallow Landslide Susceptibility Mapping
    Tang, Haozhe
    Wang, Changming
    An, Silong
    Wang, Qingyu
    Jiang, Chenglin
    REMOTE SENSING, 2023, 15 (17)
  • [26] Susceptibility Prediction of Groundwater Hardness Using Ensemble Machine Learning Models
    Mosavi, Amirhosein
    Hosseini, Farzaneh Sajedi
    Choubin, Bahram
    Abdolshahnejad, Mahsa
    Gharechaee, Hamidreza
    Lahijanzadeh, Ahmadreza
    Dineva, Adrienn A.
    WATER, 2020, 12 (10)
  • [27] A comparative study of the bivariate, multivariate and machine-learning-based statistical models for landslide susceptibility mapping in a seismic-prone region in China
    Zhou S.
    Zhang Y.
    Tan X.
    Abbas S.M.
    Arabian Journal of Geosciences, 2021, 14 (6)
  • [28] Evaluation of linear, nonlinear and ensemble machine learning models for landslide susceptibility assessment in southwest China
    Wang, Bingwei
    Lin, Qigen
    Jiang, Tong
    Yin, Huaxiang
    Zhou, Jian
    Sun, Jinhao
    Wang, Dongfang
    Dai, Ran
    GEOCARTO INTERNATIONAL, 2022,
  • [29] Effects of non-landslide sampling strategies on machine learning models in landslide susceptibility mapping
    Gu, Tengfei
    Duan, Ping
    Wang, Mingguo
    Li, Jia
    Zhang, Yanke
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [30] Landslide susceptibility and building exposure assessment using machine learning models and geospatial analysis techniques
    Luu, Chinh
    Ha, Hang
    Tran, Xuan Thong
    Ha Vu, Thai
    Bui, Quynh Duy
    ADVANCES IN SPACE RESEARCH, 2024, 74 (11) : 5489 - 5513