Optimization of negative sample selection for landslide susceptibility mapping based on machine learning using K-means-KNN algorithm

被引:7
作者
Liu, Chao [1 ]
机构
[1] Chengdu Univ Technol, State Key Lab Geohazard Prevent & Geoenvironm Prot, Chengdu 610059, Sichuan, Peoples R China
关键词
Landslide susceptibility; Kmeans-KNN; Sample; Machine learning; SHAP; 3 GORGES RESERVOIR; LOGISTIC-REGRESSION; NEURAL-NETWORKS; CATCHMENT-AREA; DECISION TREE; RANDOM FOREST; VALIDATION; HIMALAYAN; PROVINCE; ANTALYA;
D O I
10.1007/s12145-023-01151-z
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The quality of the sample plays a vital role in developing accurate models using machine learning. This aspect is equally important when evaluating regional landslide susceptibility using machine learning. Previous studies have mostly employed random generation methods to select samples, which often fail to select representative samples. Therefore, this study proposes the KK-sampling method, which uses K-means and KNN algorithms to analyze relevant attributes of the study area and select samples. To evaluate the effectiveness of the proposed method, this study employed MLP, RF, and XGBoost models in conjunction with the KK-sampling method, with Zhong County, Chongqing serving as a case study. The results indicate that the KK-sampling method significantly improves the stability and accuracy of the model. Additionally, this study analyzed the importance of landslide factors in Zhong County using SHAP values. The findings provide a reference for establishing a reasonable and effective landslide susceptibility model in the region.
引用
收藏
页码:4131 / 4152
页数:22
相关论文
共 84 条
  • [1] Spatial modeling and susceptibility zonation of landslides using random forest, naive bayes and K-nearest neighbor in a complicated terrain
    Abu El-Magd, Sherif Ahmed
    Ali, Sk Ajim
    Pham, Quoc Bao
    [J]. EARTH SCIENCE INFORMATICS, 2021, 14 (03) : 1227 - 1243
  • [2] Comparison of machine-learning techniques for landslide susceptibility mapping using two-level random sampling (2LRS) in Alakir catchment area, Antalya, Turkey
    Ada, Metehan
    San, B. Taner
    [J]. NATURAL HAZARDS, 2018, 90 (01) : 237 - 263
  • [3] Improving Spatial Agreement in Machine Learning-Based Landslide Susceptibility Mapping
    Adnan, Mohammed Sarfaraz Gani
    Rahman, Md Salman
    Ahmed, Nahian
    Ahmed, Bayes
    Rabbi, Md. Fazleh
    Rahman, Rashedur M.
    [J]. REMOTE SENSING, 2020, 12 (20) : 1 - 23
  • [4] Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research
    Agatonovic-Kustrin, S
    Beresford, R
    [J]. JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2000, 22 (05) : 717 - 727
  • [5] Comparing classical statistic and machine learning models in landslide susceptibility mapping in Ardanuc (Artvin), Turkey
    Akinci, Halil
    Zeybek, Mustafa
    [J]. NATURAL HAZARDS, 2021, 108 (02) : 1515 - 1543
  • [6] Landslide susceptibility mapping using an automatic sampling algorithm based on two level random sampling
    Aktas, Hakan
    San, Bekir Taner
    [J]. COMPUTERS & GEOSCIENCES, 2019, 133
  • [7] A comparison of slope units and grid cells as mapping units for landslide susceptibility assessment
    Ba, Qianqian
    Chen, Yumin
    Deng, Susu
    Yang, Jiaxin
    Li, Huifang
    [J]. EARTH SCIENCE INFORMATICS, 2018, 11 (03) : 373 - 388
  • [8] A GIS-based factor clustering and landslide susceptibility analysis using AHP for Gish River Basin, India
    Basu, Tirthankar
    Pal, Swades
    [J]. ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2020, 22 (05) : 4787 - 4819
  • [9] Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS
    Binh Thai Pham
    Dieu Tien Bui
    Prakash, Indra
    Dholakia, M. B.
    [J]. CATENA, 2017, 149 : 52 - 63
  • [10] Bishop Christopher M., 1995, Neural Network for Pattern Recognition