Landslide Modeling in a Tropical Mountain Basin Using Machine Learning Algorithms and Shapley Additive Explanations

被引:9
作者
Vega, Johnny [1 ]
Sepulveda-Murillo, Fabio Humberto [2 ]
Parra, Melissa [1 ]
机构
[1] Univ Medellin, Fac Ingn, Medellin, Colombia
[2] Univ Medellin, Fac Ciencias Basicas, Medellin, Colombia
来源
AIR SOIL AND WATER RESEARCH | 2023年 / 16卷
关键词
Colombian Andes; landslides; machine learning; SHAP; statistical methods; susceptibility; DECISION TREE; FUZZY MULTICRITERIA; FREQUENCY RATIO; RANDOM FOREST; SUSCEPTIBILITY; SYSTEM; AREA;
D O I
10.1177/11786221231195824
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Landslides are a geological hazard commonly induced by rainfall, earthquakes, deforestation, or human activity causing loss of human life every year specially on highlands or mountain slopes with serious impacts that threaten communities and its infrastructure. The incidence and recurrence of landslides are conditioned by several aspects related to soil properties, geological structure, climatic conditions, soil cover, and water flow. Precisely, Colombia is one of the most affected by this type of natural hazard, as well as by floods, since they are the natural phenomena that bring with them the most severe risks for communities. In this work, we articulated the statistical approach of the landslide conditioning factors, Machine Learning Algorithms (MLA), and Geographic Information System (GIS), evaluating a flexible and agile methodology to estimate the landslide susceptibility defining areas prone to the landslide occurrence. The MLA were validated in a case study in the "La Liboriana" River basin, located in the Municipality of Salgar in the Colombian mountains Andes where Landslide Susceptibility Maps (LSMs) were obtained. The obtained MLA results hold immense potential in the field of regional landslide mapping, facilitating the development of effective strategies aimed at minimizing the devastating impacts on human lives, infrastructure, and the natural environment. By leveraging these findings, proactive measures can be devised to safeguard vulnerable areas, mitigate risks, and ensure the safety and well-being of communities. Seven supervised MLA were employed, two regression algorithms (Logistic) and five decision tree algorithms (Recursive Partitioning and Regression Trees [RPART], Conditional Inference Trees [CTREE], Random Forest [RF], Ranger, and Extreme Gradient Boosting Algorithm [XGBoost]). The LSMs were produced for each MLA. Considering different performance metrics, the RF model yields the best classification accuracy with an area under receiver operating characteristic (ROC) curve of 95% and 90% of accuracy, providing the most representative results. Finally, the contribution of each landslide conditioning factor on predictions with RF model is explained using the SHAP method.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Machine Learning for Data Center Optimizations: Feature Selection Using Shapley Additive exPlanation (SHAP)
    Gebreyesus, Yibrah
    Dalton, Damian
    Nixon, Sebastian
    De Chiara, Davide
    Chinnici, Marta
    FUTURE INTERNET, 2023, 15 (03)
  • [42] Interpretable prediction of acute respiratory infection disease among under-five children in Ethiopia using ensemble machine learning and Shapley additive explanations (SHAP)
    Tadese, Zinabu Bekele
    Hailu, Debela Tsegaye
    Abebe, Aschale Wubete
    Kebede, Shimels Derso
    Walle, Agmasie Damtew
    Seifu, Beminate Lemma
    Nimani, Teshome Demis
    DIGITAL HEALTH, 2024, 10
  • [43] Machine learning-based heat deflection temperature prediction and effect analysis in polypropylene composites using catboost and shapley additive explanations
    Joo, Chonghyo
    Park, Hyundo
    Lim, Jongkoo
    Cho, Hyungtae
    Kim, Junghwan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [44] Landslide susceptibility mapping of mountain roads based on machine learning combined model
    Dou, Hong-qiang
    Huang, Si-yi
    Jian, Wen-bin
    Wang, Hao
    JOURNAL OF MOUNTAIN SCIENCE, 2023, 20 (05) : 1232 - 1248
  • [45] Predicting egg production rate and egg weight of broiler breeders based on machine learning and Shapley additive explanations
    Ji, Hengyi
    Xu, Yidan
    Teng, Ganghui
    POULTRY SCIENCE, 2025, 104 (01)
  • [46] Evaluating the relevance of eggshell and glass powder for cement-based materials using machine learning and SHapley Additive exPlanations (SHAP) analysis
    Amin, Muhammad Nasir
    Ahmad, Waqas
    Khan, Kaffayatullah
    Nazar, Sohaib
    Abu Arab, Abdullah Mohammad
    Deifalla, Ahmed Farouk
    CASE STUDIES IN CONSTRUCTION MATERIALS, 2023, 19
  • [47] Landslide susceptibility mapping: improvements in variable weights estimation through machine learning algorithms-a case study of upper Indus River Basin, Pakistan
    Imtiaz, Iqra
    Umar, Muhammad
    Latif, Muhammad
    Ahmed, Rehan
    Azam, Muhammad
    ENVIRONMENTAL EARTH SCIENCES, 2022, 81 (04)
  • [48] Landslide detection in the Himalayas using machine learning algorithms and U-Net
    Sansar Raj Meena
    Lucas Pedrosa Soares
    Carlos H. Grohmann
    Cees van Westen
    Kushanav Bhuyan
    Ramesh P. Singh
    Mario Floris
    Filippo Catani
    Landslides, 2022, 19 : 1209 - 1229
  • [49] Landslide detection in the Himalayas using machine learning algorithms and U-Net
    Meena, Sansar Raj
    Soares, Lucas Pedrosa
    Grohmann, Carlos H.
    van Westen, Cees
    Bhuyan, Kushanav
    Singh, Ramesh P.
    Floris, Mario
    Catani, Filippo
    LANDSLIDES, 2022, 19 (05) : 1209 - 1229
  • [50] Predictive Performances of Ensemble Machine Learning Algorithms in Landslide Susceptibility Mapping Using Random Forest, Extreme Gradient Boosting (XGBoost) and Natural Gradient Boosting (NGBoost)
    Kavzoglu, Taskin
    Teke, Alihan
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (06) : 7367 - 7385