Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction

被引:1
作者
Kumar, Chandan [1 ,2 ]
Walton, Gabriel [1 ]
Santi, Paul [1 ]
Luza, Carlos [3 ]
机构
[1] Colorado Sch Mines, Dept Geol & Geol Engn, Golden, CO 80401 USA
[2] Univ Tennessee, Natl Inst Modeling Biol Syst, Knoxville, TN 37996 USA
[3] Univ Nacl San Agustin, Dept Geol Geophys & Mines, Arequipa 04000, Peru
关键词
landslide susceptibility mapping; machine learning; random cross-validation; spatial autocorrelation; spatial cross-validation; SUPPORT VECTOR MACHINE; DECISION TREE; RANDOM FOREST; MODELS; GIS; EARTHQUAKE; ENTROPY; INDEX;
D O I
10.3390/rs17020213
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Machine learning (ML) models are extensively used in spatial predictive modeling, including landslide susceptibility prediction. The performance statistics of these models are vital for assessing their reliability, which is typically obtained using the random cross-validation (R-CV) method. However, R-CV has a major drawback, i.e., it ignores the spatial autocorrelation (SAC) inherent in spatial datasets when partitioning the training and testing sets. We assessed the impact of SAC at three crucial phases of ML modeling: hyperparameter tuning, performance evaluation, and learning curve analysis. As an alternative to R-CV, we used spatial cross-validation (S-CV). This method considers SAC when partitioning the training and testing subsets. This experiment was conducted on regional landslide susceptibility prediction using different ML models: logistic regression (LR), k-nearest neighbor (KNN), linear discriminant analysis (LDA), artificial neural networks (ANN), support vector machine (SVM), random forest (RF), and C5.0. The experimental results showed that R-CV often produces optimistic performance estimates, e.g., 6-18% higher than those obtained using the S-CV. R-CV also occasionally fails to reveal the true importance of the hyperparameters of models such as SVM and ANN. Additionally, R-CV falsely portrays a considerable improvement in model performance as the number of variables increases. However, this was not the case when the models were evaluated using S-CV. The impact of SAC was more noticeable in complex models such as SVM, RF, and C5.0 (except for ANN) than in simple models such as LDA and LR (except for KNN). Overall, we recommend S-CV over R-CV for a reliable assessment of ML model performance in large-scale LSM.
引用
收藏
页数:23
相关论文
共 72 条
  • [1] Improving Spatial Agreement in Machine Learning-Based Landslide Susceptibility Mapping
    Adnan, Mohammed Sarfaraz Gani
    Rahman, Md Salman
    Ahmed, Nahian
    Ahmed, Bayes
    Rabbi, Md. Fazleh
    Rahman, Rashedur M.
    [J]. REMOTE SENSING, 2020, 12 (20) : 1 - 23
  • [2] The spatial leave-pair-out cross-validation method for reliable AUC estimation of spatial classifiers
    Airola, Antti
    Pohjankukka, Jonne
    Torppa, Johanna
    Middleton, Maarit
    Nykanen, Vesa
    Heikkonen, Jukka
    Pahikkala, Tapio
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (03) : 730 - 747
  • [3] GIS-based landslide susceptibility modeling: A comparison between fuzzy multi-criteria and machine learning algorithms
    Ali, Sk Ajim
    Parvin, Farhana
    Vojtekova, Jana
    Costache, Romulus
    Nguyen Thi Thuy Linh
    Quoc Bao Pham
    Vojtek, Matej
    Gigovic, Ljubomir
    Ahmad, Ateeque
    Ghorbani, Mohammad Ali
    [J]. GEOSCIENCE FRONTIERS, 2021, 12 (02) : 857 - 876
  • [4] A permutation test and spatial cross-validation approach to assess models of interspecific competition between trees
    Allen, David
    Kim, Albert Y.
    [J]. PLOS ONE, 2020, 15 (03):
  • [5] A novel integrated model for assessing landslide susceptibility mapping using CHAID and AHP pair-wise comparison
    Althuwaynee, Omar F.
    Pradhan, Biswajeet
    Lee, Saro
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2016, 37 (05) : 1190 - 1209
  • [6] Ayodele TO., 2010, New advances in machine learning, V3, P19
  • [7] On the complexity of model complexity: Viewpoints across the geosciences
    Baartman, Jantiene E. M.
    Melsen, Lieke A.
    Moore, Demie
    van der Ploeg, Martine J.
    [J]. CATENA, 2020, 186
  • [8] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [9] Bischl B, 2016, J MACH LEARN RES, V17
  • [10] Benchmark for filter methods for feature selection in high-dimensional classification data
    Bommert, Andrea
    Sun, Xudong
    Bischl, Bernd
    Rahnenfuehrer, Joerg
    Lang, Michel
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143