Landslide Modeling in a Tropical Mountain Basin Using Machine Learning Algorithms and Shapley Additive Explanations

被引:9
作者
Vega, Johnny [1 ]
Sepulveda-Murillo, Fabio Humberto [2 ]
Parra, Melissa [1 ]
机构
[1] Univ Medellin, Fac Ingn, Medellin, Colombia
[2] Univ Medellin, Fac Ciencias Basicas, Medellin, Colombia
来源
AIR SOIL AND WATER RESEARCH | 2023年 / 16卷
关键词
Colombian Andes; landslides; machine learning; SHAP; statistical methods; susceptibility; DECISION TREE; FUZZY MULTICRITERIA; FREQUENCY RATIO; RANDOM FOREST; SUSCEPTIBILITY; SYSTEM; AREA;
D O I
10.1177/11786221231195824
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Landslides are a geological hazard commonly induced by rainfall, earthquakes, deforestation, or human activity causing loss of human life every year specially on highlands or mountain slopes with serious impacts that threaten communities and its infrastructure. The incidence and recurrence of landslides are conditioned by several aspects related to soil properties, geological structure, climatic conditions, soil cover, and water flow. Precisely, Colombia is one of the most affected by this type of natural hazard, as well as by floods, since they are the natural phenomena that bring with them the most severe risks for communities. In this work, we articulated the statistical approach of the landslide conditioning factors, Machine Learning Algorithms (MLA), and Geographic Information System (GIS), evaluating a flexible and agile methodology to estimate the landslide susceptibility defining areas prone to the landslide occurrence. The MLA were validated in a case study in the "La Liboriana" River basin, located in the Municipality of Salgar in the Colombian mountains Andes where Landslide Susceptibility Maps (LSMs) were obtained. The obtained MLA results hold immense potential in the field of regional landslide mapping, facilitating the development of effective strategies aimed at minimizing the devastating impacts on human lives, infrastructure, and the natural environment. By leveraging these findings, proactive measures can be devised to safeguard vulnerable areas, mitigate risks, and ensure the safety and well-being of communities. Seven supervised MLA were employed, two regression algorithms (Logistic) and five decision tree algorithms (Recursive Partitioning and Regression Trees [RPART], Conditional Inference Trees [CTREE], Random Forest [RF], Ranger, and Extreme Gradient Boosting Algorithm [XGBoost]). The LSMs were produced for each MLA. Considering different performance metrics, the RF model yields the best classification accuracy with an area under receiver operating characteristic (ROC) curve of 95% and 90% of accuracy, providing the most representative results. Finally, the contribution of each landslide conditioning factor on predictions with RF model is explained using the SHAP method.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Prediction of Biodiesel Yield Employing Machine Learning: Interpretability Analysis via Shapley Additive Explanations
    Agrawal, Pragati
    Gnanaprakash, R.
    Dhawane, Sumit H.
    FUEL, 2024, 359
  • [22] Spatial prediction and mapping of landslide susceptibility using machine learning models
    Chen, Yu
    NATURAL HAZARDS, 2025, : 8367 - 8385
  • [23] Understanding Arteriosclerotic Heart Disease Patients Using Electronic Health Records: A Machine Learning and Shapley Additive exPlanations Approach
    Miranda, Eka
    Adiarto, Suko
    Bhatti, Faqir M.
    Zakiyyah, Alfi Yusrotis
    Aryuni, Mediana
    Bernando, Charles
    HEALTHCARE INFORMATICS RESEARCH, 2023, 29 (03) : 228 - 238
  • [24] Analysis of factors influencing the energy efficiency in Chinese wastewater treatment plaents through machine learning and SHapley Additive exPlanations
    Li, Jinze
    Du, Zexuan
    Liu, Junyan
    Xu, Linji
    He, Li-ping
    Gu, Li
    Cheng, Hong
    He, Qiang
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 920
  • [25] Machine Learning Models Based on Grid-Search Optimization and Shapley Additive Explanations (SHAP) for Early Stroke Prediction
    Al Mamlook, Rabia Emhamed
    Lahwal, Fathia
    Elgeberi, Najat
    Obeidat, Muhammad
    Al-Na'amneh, Qais
    Nasayreh, Ahmad
    Gharaibeh, Hasan
    Gharaibeh, Tasnim
    Bzizi, Hanin
    4TH INTERDISCIPLINARY CONFERENCE ON ELECTRICS AND COMPUTER, INTCEC 2024, 2024,
  • [26] Electricity Consumption Forecasting: An Approach Using Cooperative Ensemble Learning with SHapley Additive exPlanations
    Alba, Eduardo Luiz
    Oliveira, Gilson Adamczuk
    Ribeiro, Matheus Henrique Dal Molin
    Rodrigues, erick Oliveira
    FORECASTING, 2024, 6 (03): : 839 - 863
  • [27] Conditioning factors determination for mapping and prediction of landslide susceptibility using machine learning algorithms
    Al-Najjar, Husam A. H.
    Kalantar, Bahareh
    Pradhan, Biswjaeet
    Saeidi, Vahideh
    EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS X, 2019, 11156
  • [28] A comparative study for landslide susceptibility assessment using machine learning algorithms based on grid unit and slope unit
    Deng, Niandong
    Li, Yuxin
    Ma, Jianquan
    Shahabi, Himan
    Hashim, Mazlan
    de Oliveira, Gabriel
    Chaeikar, Saman Shojae
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10
  • [29] Viscosity and melting temperature prediction of mold fluxes based on explainable machine learning and SHapley additive exPlanations
    Yan, Wei
    Shen, Yangyang
    Chen, Shoujie
    Wang, Yongyuan
    JOURNAL OF NON-CRYSTALLINE SOLIDS, 2024, 636
  • [30] Machine learning-based Shapley additive explanations approach for corroded pipeline failure mode identification
    Ben Seghier, Mohamed El Amine
    Mohamed, Osama Ahmed
    Ouaer, Hocine
    STRUCTURES, 2024, 65