Performance assessment of artificial neural network using chi-square and backward elimination feature selection methods for landslide susceptibility analysis

被引:23
作者
Pham, Binh Thai [1 ]
Van Dao, Dong [2 ]
Acharya, Tri Dev [3 ]
Van Phong, Tran [4 ]
Costache, Romulus [5 ,6 ]
Van Le, Hiep [7 ]
Nguyen, Hanh Bich Thi [7 ]
Prakash, Indra [8 ]
机构
[1] Univ Transport Technol, Geotech Engn & Artificial Intelligence Res Grp GE, 54 Trieu Khuc, Hanoi, Vietnam
[2] Transport Dev & Strategy Inst, 162 Tran Quang Khai, Hanoi, Vietnam
[3] Kangwon Natl Univ, Dept Civil Engn, Chunchon, South Korea
[4] Vietnam Acad Sci & Technol, Inst Geol Sci, 84 Chua Lang St, Hanoi, Vietnam
[5] Transilvania Univ Brasov, Dept Civil Engn, 5 Turnului Str, Brasov 500152, Romania
[6] Danube Delta Natl Inst Res & Dev, 165 Babadag St, Tulcea 820112, Romania
[7] Univ Transport Technol, Dept Civil Engn, 54 Trieu Khuc, Hanoi, Vietnam
[8] Geol Survey India, Dy Director Gen R, Gandhinagar 82010, India
关键词
Landslide susceptibility modeling; Machine learning; Feature selection; Chi square; Backward elimination; Artificial neural networks; MACHINE LEARNING TECHNIQUES; LOGISTIC-REGRESSION; PREDICTION MODEL; FREQUENCY RATIO; ENTROPY MODELS; ALGORITHMS; PROVINCE; FOREST; INDEX; BASIN;
D O I
10.1007/s12665-021-09998-5
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the machine learning models, it is desirable to remove most redundant features from the data set to reduce the data processing time and to improve accuracy of the models. In this paper, chi-square (CS) and backward elimination (BE), which are well-known feature selection methods, were used for the optimum selection of input features/factors for training artificial neural network (ANN) for landslide susceptibility modeling. Initially, seventeen landslide affecting factors were considered for the ANN model which were reduced to twelve and eleven based on the ANN optimized by CS (CSANN) and BE (BEANN), respectively. Accuracy (ACC), Kappa Index, root mean square error (RMSE), and area under the receiver operating characteristic (AUROC) curve were used to evaluate and validate performance of the models. Results show that both the feature selection methods (CS and BE) improved significantly performance of the hybrid BEANN and CSANN models in comparison to single ANN model. Results indicated that performance of the BEANN model (AUROC 0.963; ACC 91.31) is the best in comparison to CSANN (AUROC 0.950; ACC 89.80) and ANN (AUROC 0.949; ACC 76.40) models in the accurate prediction of landslide susceptible areas/zones. Therefore, it is reasonable to state that the BE is more effective feature selection method than the CS in improving performance of the ANN model and thus, it can be used for better landslide susceptibility analysis for the landslide management of the area.
引用
收藏
页数:13
相关论文
共 50 条