Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction

被引：1

作者：

Kumar, Chandan ^{[1
,2
]}

Walton, Gabriel ^{[1
]}

Santi, Paul ^{[1
]}

Luza, Carlos ^{[3
]}

机构：

[1] Colorado Sch Mines, Dept Geol & Geol Engn, Golden, CO 80401 USA

[2] Univ Tennessee, Natl Inst Modeling Biol Syst, Knoxville, TN 37996 USA

[3] Univ Nacl San Agustin, Dept Geol Geophys & Mines, Arequipa 04000, Peru

来源：

REMOTE SENSING | 2025年 / 17卷 / 02期

关键词：

landslide susceptibility mapping; machine learning; random cross-validation; spatial autocorrelation; spatial cross-validation; SUPPORT VECTOR MACHINE; DECISION TREE; RANDOM FOREST; MODELS; GIS; EARTHQUAKE; ENTROPY; INDEX;

D O I：

10.3390/rs17020213

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Machine learning (ML) models are extensively used in spatial predictive modeling, including landslide susceptibility prediction. The performance statistics of these models are vital for assessing their reliability, which is typically obtained using the random cross-validation (R-CV) method. However, R-CV has a major drawback, i.e., it ignores the spatial autocorrelation (SAC) inherent in spatial datasets when partitioning the training and testing sets. We assessed the impact of SAC at three crucial phases of ML modeling: hyperparameter tuning, performance evaluation, and learning curve analysis. As an alternative to R-CV, we used spatial cross-validation (S-CV). This method considers SAC when partitioning the training and testing subsets. This experiment was conducted on regional landslide susceptibility prediction using different ML models: logistic regression (LR), k-nearest neighbor (KNN), linear discriminant analysis (LDA), artificial neural networks (ANN), support vector machine (SVM), random forest (RF), and C5.0. The experimental results showed that R-CV often produces optimistic performance estimates, e.g., 6-18% higher than those obtained using the S-CV. R-CV also occasionally fails to reveal the true importance of the hyperparameters of models such as SVM and ANN. Additionally, R-CV falsely portrays a considerable improvement in model performance as the number of variables increases. However, this was not the case when the models were evaluated using S-CV. The impact of SAC was more noticeable in complex models such as SVM, RF, and C5.0 (except for ANN) than in simple models such as LDA and LR (except for KNN). Overall, we recommend S-CV over R-CV for a reliable assessment of ML model performance in large-scale LSM.

引用

页数：23

共 72 条

[1] Improving Spatial Agreement in Machine Learning-Based Landslide Susceptibility Mapping
Adnan, Mohammed Sarfaraz Gani
Rahman, Md Salman
Ahmed, Nahian
Ahmed, Bayes
Rabbi, Md. Fazleh
Rahman, Rashedur M.
[J]. REMOTE SENSING, 2020, 12 (20) : 1 - 23
[2] The spatial leave-pair-out cross-validation method for reliable AUC estimation of spatial classifiers
Airola, Antti
Pohjankukka, Jonne
Torppa, Johanna
Middleton, Maarit
Nykanen, Vesa
Heikkonen, Jukka
Pahikkala, Tapio
[J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (03) : 730 - 747
[3] GIS-based landslide susceptibility modeling: A comparison between fuzzy multi-criteria and machine learning algorithms
Ali, Sk Ajim
Parvin, Farhana
Vojtekova, Jana
Costache, Romulus
Nguyen Thi Thuy Linh
Quoc Bao Pham
Vojtek, Matej
Gigovic, Ljubomir
Ahmad, Ateeque
Ghorbani, Mohammad Ali
[J]. GEOSCIENCE FRONTIERS, 2021, 12 (02) : 857 - 876
[4] A permutation test and spatial cross-validation approach to assess models of interspecific competition between trees
Allen, David
Kim, Albert Y.
[J]. PLOS ONE, 2020, 15 (03):
[5] A novel integrated model for assessing landslide susceptibility mapping using CHAID and AHP pair-wise comparison
Althuwaynee, Omar F.
Pradhan, Biswajeet
Lee, Saro
[J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2016, 37 (05) : 1190 - 1209
[6] Ayodele TO., 2010, New advances in machine learning, V3, P19
[7] On the complexity of model complexity: Viewpoints across the geosciences
Baartman, Jantiene E. M.
Melsen, Lieke A.
Moore, Demie
van der Ploeg, Martine J.
[J]. CATENA, 2020, 186
[8] Bergstra J, 2012, J MACH LEARN RES, V13, P281
[9] Bischl B, 2016, J MACH LEARN RES, V17
[10] Benchmark for filter methods for feature selection in high-dimensional classification data
Bommert, Andrea
Sun, Xudong
Bischl, Bernd
Rahnenfuehrer, Joerg
Lang, Michel
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143

← 1 2 3 4 5 6 7 8 →