Sample size effects on landslide susceptibility models: A comparative study of heuristic, statistical, machine learning, deep learning and ensemble learning models with SHAP analysis

被引:1
作者
Yang, Shilong [1 ]
Tan, Jiayao [1 ]
Luo, Danyuan [1 ]
Wang, Yuzhou [2 ,3 ]
Guo, Xu [1 ]
Zhu, Qiuyu [1 ,4 ]
Ma, Chuanming [1 ]
Xiong, Hanxiang [1 ]
机构
[1] China Univ Geosci, Sch Environm Studies, Wuhan 430074, Peoples R China
[2] Eastern Inst Technol, Eastern Inst Adv Study, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Environm Sci & Engn, Shanghai 200240, Peoples R China
[4] Hangzhou Yuhang Urban Dev Investment Grp Co Ltd, Hangzhou 311100, Peoples R China
关键词
Landslide susceptibility assessment; Model robustness; Inventory sample size; XGBoost and LightGBM; Explainable machine learning; ANALYTICAL HIERARCHY PROCESS; FREQUENCY RATIO MODEL; LOGISTIC-REGRESSION; NEURAL-NETWORKS; GIS; AREA; HAZARD; PROVINCE; BASIN; INDEX;
D O I
10.1016/j.cageo.2024.105723
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In landslide susceptibility assessment (LSA), inventory incompleteness impacts the accuracy of different models to varying degrees. However, this area remains under-researched. This study investigated six LSA models from heuristic, statistical, machine learning and ensemble learning models (analytical hierarchy process (AHP), frequency ratio (FR), logistic regression (LR), Keras based deep learning (KBDL), XGBoost, and LightGBM) across six different sample sizes (100%, 90%, 75%, 50%, 25%, and 10%). Results revealed that XGBoost and LightGBM consistently outperformed other models across all sample sizes. The LR and KBDL models followed, while FR model was the most affected by sample size variations. AHP, an empirical model, remained unaffected by sample size. Through SHapley Additive exPlanations (SHAP) analysis, elevation, NDVI, slope, land use, and distance to roads and rivers emerged as pivotal indicators for landslide occurrences in the study area, suggesting that human activities significantly influence these events. Five time-varying indicators regarding human activity and climate validated this inference, which provides a new method to identify landslide triggering factors, especially in areas of intense human activity. Based on the findings, a comprehensive framework for LSA is proposed to assist landslide managers in making informed decisions. Future research should focus on expanding model diversity to address the effects of sample size, enhancing the adaptability of the LSA framework, deepening the analysis of human activity impacts on landslides using explainable machine learning techniques, addressing temporal inventory incompleteness in LSA, and critically evaluating model sensitivity to sample size variations across multiple disciplines.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Comparative Study among Bivariate Statistical Models in Landslide Susceptibility Map
    Arifianti, Yukni
    Pamela
    Agustin, Fitriani
    Muslim, Dicky
    INDONESIAN JOURNAL OF GEOSCIENCE, 2020, 7 (01): : 51 - 63
  • [42] Landslide Susceptibility Mapping Using Multi-Criteria Decision-Making (MCDM), Statistical, and Machine Learning Models in the Aube Department, France
    Jari, Abdessamad
    Khaddari, Achraf
    Hajaj, Soufiane
    Bachaoui, El Mostafa
    Mohammedi, Sabine
    Jellouli, Amine
    Mosaid, Hassan
    El Harti, Abderrazak
    Barakat, Ahmed
    EARTH, 2023, 4 (03): : 698 - 713
  • [43] Spatial prediction and mapping of landslide susceptibility using machine learning models
    Chen, Yu
    NATURAL HAZARDS, 2025, : 8367 - 8385
  • [44] Flash flood susceptibility mapping using stacking ensemble machine learning models
    Ilia, Loanna
    Tsangaratos, Paraskevas
    Tzampoglou, Ploutarchos
    Chen, Wei
    Hong, Haoyuan
    GEOCARTO INTERNATIONAL, 2022, 37 (27) : 15010 - 15036
  • [45] Influences of non-landslide sample selection strategies on landslide susceptibility mapping by machine learning
    Dou, Hongqiang
    He, Junbin
    Huang, Siyi
    Jian, Wenbin
    Guo, Chaoxu
    GEOMATICS NATURAL HAZARDS & RISK, 2023, 14 (01)
  • [46] Comparison between Deep Learning and Tree-Based Machine Learning Approaches for Landslide Susceptibility Mapping
    Saha, Sunil
    Roy, Jagabandhu
    Hembram, Tusar Kanti
    Pradhan, Biswajeet
    Dikshit, Abhirup
    Abdul Maulud, Khairul Nizam
    Alamri, Abdullah M.
    WATER, 2021, 13 (19)
  • [47] A comparative study on the landslide susceptibility mapping using logistic regression and statistical index models
    Wu, Zhiyong
    Wu, Yanli
    Yang, Yitian
    Chen, Fuwei
    Zhang, Na
    Ke, Yutian
    Li, Wenping
    ARABIAN JOURNAL OF GEOSCIENCES, 2017, 10 (08)
  • [48] Predictive analysis by ensemble classifier with machine learning models
    Chaya J.D.
    Usha R.N.
    International Journal of Computers and Applications, 2023, 45 (01) : 19 - 26
  • [49] Co-seismic landslide susceptibility mapping for the Luding earthquake area based on heterogeneous ensemble machine learning models
    Zhang, Rui
    Yang, Yunjie
    Wang, Tianyu
    Liu, Anmengyun
    Lv, Jichao
    He, Xu
    Fu, Yin
    Zhang, Bo
    Dai, Keren
    Liu, Guoxiang
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [50] Performance evaluation of machine learning and statistical techniques for modelling landslide susceptibility with limited field data
    Achu, Ashokan Laila
    Thomas, Jobin
    Aju, Chandrika Dhanapalan
    Remani, Praveen Kolappapillai
    Gopinath, Girish
    EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 1025 - 1039