Sample size effects on landslide susceptibility models: A comparative study of heuristic, statistical, machine learning, deep learning and ensemble learning models with SHAP analysis

被引:1
作者
Yang, Shilong [1 ]
Tan, Jiayao [1 ]
Luo, Danyuan [1 ]
Wang, Yuzhou [2 ,3 ]
Guo, Xu [1 ]
Zhu, Qiuyu [1 ,4 ]
Ma, Chuanming [1 ]
Xiong, Hanxiang [1 ]
机构
[1] China Univ Geosci, Sch Environm Studies, Wuhan 430074, Peoples R China
[2] Eastern Inst Technol, Eastern Inst Adv Study, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Environm Sci & Engn, Shanghai 200240, Peoples R China
[4] Hangzhou Yuhang Urban Dev Investment Grp Co Ltd, Hangzhou 311100, Peoples R China
关键词
Landslide susceptibility assessment; Model robustness; Inventory sample size; XGBoost and LightGBM; Explainable machine learning; ANALYTICAL HIERARCHY PROCESS; FREQUENCY RATIO MODEL; LOGISTIC-REGRESSION; NEURAL-NETWORKS; GIS; AREA; HAZARD; PROVINCE; BASIN; INDEX;
D O I
10.1016/j.cageo.2024.105723
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In landslide susceptibility assessment (LSA), inventory incompleteness impacts the accuracy of different models to varying degrees. However, this area remains under-researched. This study investigated six LSA models from heuristic, statistical, machine learning and ensemble learning models (analytical hierarchy process (AHP), frequency ratio (FR), logistic regression (LR), Keras based deep learning (KBDL), XGBoost, and LightGBM) across six different sample sizes (100%, 90%, 75%, 50%, 25%, and 10%). Results revealed that XGBoost and LightGBM consistently outperformed other models across all sample sizes. The LR and KBDL models followed, while FR model was the most affected by sample size variations. AHP, an empirical model, remained unaffected by sample size. Through SHapley Additive exPlanations (SHAP) analysis, elevation, NDVI, slope, land use, and distance to roads and rivers emerged as pivotal indicators for landslide occurrences in the study area, suggesting that human activities significantly influence these events. Five time-varying indicators regarding human activity and climate validated this inference, which provides a new method to identify landslide triggering factors, especially in areas of intense human activity. Based on the findings, a comprehensive framework for LSA is proposed to assist landslide managers in making informed decisions. Future research should focus on expanding model diversity to address the effects of sample size, enhancing the adaptability of the LSA framework, deepening the analysis of human activity impacts on landslides using explainable machine learning techniques, addressing temporal inventory incompleteness in LSA, and critically evaluating model sensitivity to sample size variations across multiple disciplines.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Evaluating the Performance of Individual and Novel Ensemble of Machine Learning and Statistical Models for Landslide Susceptibility Assessment at Rudraprayag District of Garhwal Himalaya
    Saha, Sunil
    Saha, Anik
    Hembram, Tusar Kanti
    Pradhan, Biswajeet
    Alamri, Abdullah M.
    APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [32] Uncertainty analysis of non-landslide sample selection in landslide susceptibility prediction using slope unit-based machine learning models
    Chang, Zhilu
    Huang, Jinsong
    Huang, Faming
    Bhuyan, Kushanav
    Meena, Sansar Raj
    Catani, Filippo
    GONDWANA RESEARCH, 2023, 117 : 307 - 320
  • [33] Application of statistical and machine learning techniques for landslide susceptibility mapping in the Himalayan road corridors
    Sarfraz, Yasir
    Basharat, Muhammad
    Riaz, Muhammad Tayyib
    Akram, Mian Sohail
    Xu, Chong
    Ahmed, Khawaja Shoaib
    Shahzad, Amir
    Al-Ansari, Nadhir
    Linh, Nguyen Thi Thuy
    OPEN GEOSCIENCES, 2022, 14 (01) : 1606 - 1635
  • [34] A comparative study of heterogeneous ensemble-learning techniques for landslide susceptibility mapping
    Fang, Zhice
    Wang, Yi
    Peng, Ling
    Hong, Haoyuan
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2021, 35 (02) : 321 - 347
  • [35] Landslide susceptibility assessment through TrAdaBoost transfer learning models using two landslide inventories
    Fu, Zhiyong
    Li, Changdong
    Yao, Wenmin
    CATENA, 2023, 222
  • [36] Landslide susceptibility mapping using deep learning models in Ardabil province, Iran
    Hamedi, Hossein
    Alesheikh, Ali Asghar
    Panahi, Mahdi
    Lee, Saro
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (12) : 4287 - 4310
  • [37] Novel ensemble machine learning models in flood susceptibility mapping
    Prasad, Pankaj
    Loveson, Victor Joseph
    Das, Bappa
    Kotha, Mahender
    GEOCARTO INTERNATIONAL, 2022, 37 (16) : 4571 - 4593
  • [38] Landslide Susceptibility Prediction Considering Regional Soil Erosion Based on Machine-Learning Models
    Huang, Faming
    Chen, Jiawu
    Du, Zhen
    Yao, Chi
    Huang, Jinsong
    Jiang, Qinghui
    Chang, Zhilu
    Li, Shu
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (06)
  • [39] Comparing classical statistic and machine learning models in landslide susceptibility mapping in Ardanuc (Artvin), Turkey
    Akinci, Halil
    Zeybek, Mustafa
    NATURAL HAZARDS, 2021, 108 (02) : 1515 - 1543
  • [40] Comparing classical statistic and machine learning models in landslide susceptibility mapping in Ardanuc (Artvin), Turkey
    Halil Akinci
    Mustafa Zeybek
    Natural Hazards, 2021, 108 : 1515 - 1543