Quantitative evaluation of uncertainty and interpretability in machine learning-based landslide susceptibility mapping through feature selection and explainable AI

被引:5
作者
Le, Xuan-Hien [1 ,2 ]
Choi, Chanul [1 ]
Eu, Song [3 ]
Yeon, Minho [1 ]
Lee, Giha [1 ]
机构
[1] Kyungpook Natl Univ, Dept Adv Sci & Technol Convergence, Sangju, South Korea
[2] Thuyloi Univ, Fac Water Resources Engn, Hanoi, Vietnam
[3] Natl Inst Forest Sci, Dept Forest Environm & Conservat, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Bayesian optimization; bootstrapping; landslide susceptibility; Monte Carlo; SHAP; uncertainty analysis; LOGISTIC-REGRESSION; STATISTICAL-METHODS; MODELS; PREDICTION; HAZARD; SOIL;
D O I
10.3389/fenvs.2024.1424988
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Landslide susceptibility mapping (LSM) is essential for determining risk regions and guiding mitigation strategies. Machine learning (ML) techniques have been broadly utilized, but the uncertainty and interpretability of these models have not been well-studied. This study conducted a comparative analysis and uncertainty assessment of five ML algorithms-Random Forest (RF), Light Gradient-Boosting Machine (LGB), Extreme Gradient Boosting (XGB), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM)-for LSM in Inje area, South Korea. We optimized these models using Bayesian optimization, a method that refines model performance through probabilistic model-based tuning of hyperparameters. The performance of these algorithms was evaluated using accuracy, Kappa score, and F1 score, with accuracy in detecting landslide-prone locations ranging from 0.916 to 0.947. Among them, the tree-based models (RF, LGB, XGB) showed competitive performance and outperformed the other models. Prediction uncertainty was quantified using bootstrapping and Monte Carlo simulation methods, with the latter providing a more consistent estimate across models. Further, the interpretability of ML predictions was analyzed through sensitivity analysis and SHAP values. We also expanded our investigation to include both the inclusion and exclusion of predictors, providing insights into each significant variable through a comprehensive sensitivity analysis. This paper provides insights into the predictive uncertainty and interpretability of ML algorithms for LSM, contributing to future research in South Korea and beyond.
引用
收藏
页数:15
相关论文
共 75 条
[1]   A review of uncertainty quantification in deep learning: Techniques, applications and challenges [J].
Abdar, Moloud ;
Pourpanah, Farhad ;
Hussain, Sadiq ;
Rezazadegan, Dana ;
Liu, Li ;
Ghavamzadeh, Mohammad ;
Fieguth, Paul ;
Cao, Xiaochun ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
INFORMATION FUSION, 2021, 76 :243-297
[2]   Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia [J].
Aditian, Aril ;
Kubota, Tetsuya ;
Shinohara, Yoshinori .
GEOMORPHOLOGY, 2018, 318 :101-111
[3]   Landslide Susceptibility Mapping Using Machine Learning: A Literature Survey [J].
Ado, Moziihrii ;
Amitab, Khwairakpam ;
Maji, Arnab Kumar ;
Jasinska, Elzbieta ;
Gono, Radomir ;
Leonowicz, Zbigniew ;
Jasinski, Michal .
REMOTE SENSING, 2022, 14 (13)
[4]   Comparing classical statistic and machine learning models in landslide susceptibility mapping in Ardanuc (Artvin), Turkey [J].
Akinci, Halil ;
Zeybek, Mustafa .
NATURAL HAZARDS, 2021, 108 (02) :1515-1543
[5]   The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan [J].
Ayalew, L ;
Yamagishi, H .
GEOMORPHOLOGY, 2005, 65 (1-2) :15-31
[6]   A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India) [J].
Binh Thai Pham ;
Pradhan, Biswajeet ;
Bui, Dieu Tien ;
Prakash, Indra ;
Dholakia, M. B. .
ENVIRONMENTAL MODELLING & SOFTWARE, 2016, 84 :240-250
[7]  
Brownlee J., 2019, A gentle introduction to Monte Carlo sampling for probability
[8]   Qualitative landslide susceptibility assessment by multicriteria analysis: A case study from San Antonio del sur, Guantanamo, Cuba [J].
Castellanos Abella, Enrique A. ;
Van Westen, Cees J. .
GEOMORPHOLOGY, 2008, 94 (3-4) :453-466
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]   GIS-based evaluation of landslide susceptibility using hybrid computational intelligence models [J].
Chen, Wei ;
Li, Yang .
CATENA, 2020, 195