共 63 条
Integrating Feature Selection with Machine Learning for Accurate Reservoir Landslide Displacement Prediction
被引:5
作者:
Ge, Qi
[1
]
Wang, Jingyong
[2
]
Liu, Cheng
[1
]
Wang, Xiaohong
[2
]
Deng, Yiyan
[3
]
Li, Jin
[3
]
机构:
[1] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China
[2] Powerchina Huadong Engn Corp, Hangzhou 310014, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Inst Artificial Intelligence, Nanjing 210044, Peoples R China
来源:
基金:
中国国家自然科学基金;
关键词:
landslide displacement prediction;
feature selection;
machine learning;
Three Gorges Reservoir Area;
TIME-SERIES ANALYSIS;
3 GORGES RESERVOIR;
BAISHUIHE LANDSLIDE;
ALGORITHMS;
MODEL;
D O I:
10.3390/w16152152
中图分类号:
X [环境科学、安全科学];
学科分类号:
08 ;
0830 ;
摘要:
Accurate prediction of reservoir landslide displacements is crucial for early warning and hazard prevention. Current machine learning (ML) paradigms for predicting landslide displacement demonstrate superior performance, while often relying on various feature engineering techniques, such as decomposing into different temporal lags and feature selection. This study investigates the impact of various feature selection techniques on the performance of ML algorithms for landslide displacement prediction. The Shuping and Baishuihe landslides in China's Three Gorges Reservoir Area are used to comprehensively benchmark four prevalent ML algorithms. Both static ML models, including backpropagation neural network (BPNN), support vector machine (SVM), and dynamic models, such as long short-term memory (LSTM), and gated recurrent unit (GRU), are included. Each ML model is evaluated under three feature engineering techniques: raw multivariate time series, and feature selection under maximal information coefficient-partial autocorrelation function (MIC-PACF), or grey relational analysis-PACF (GRA-PACF). The results demonstrate that appropriate feature selection methods could significantly improve the performance of static ML models. In contrast, dynamic models effectively leverage inherent capabilities in capturing temporal dynamics within raw multivariate time series, seeing marginal gains with extensive feature engineering compared to no feature selection strategy. The optimal feature selection approach varies based on the ML model and specific landslide, highlighting the importance of case-specific assessments. The findings in this study offer guidance on integrating feature selection techniques with different machine learning models to maximize the robustness and generalizability of data-driven landslide displacement prediction frameworks.
引用
收藏
页数:19
相关论文