Regional landslide susceptibility assessment based on improved semi-supervised clustering and deep learning

被引:17
作者
Jiang, Yuhang [1 ,2 ]
Wang, Wei [1 ,2 ]
Zou, Lifang [3 ]
Cao, Yajun [1 ,2 ]
机构
[1] Hohai Univ, Key Lab Minist Educ Geomech & Embankment Engn, Nanjing 210098, Jiangsu, Peoples R China
[2] Hohai Univ, Geotech Res Inst, Nanjing 210098, Jiangsu, Peoples R China
[3] Hohai Univ, Sch Earth Sci & Engn, Nanjing 211100, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
1D convolutional neural network (1D CNN); Cosine Euclidean distance; Genetic particle swarm optimization (GAPSO); Gated recurrent unit (GRU); Long short-term memory neural network (LSTM); Semi-supervised fuzzy C-means clustering; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINES; LOGISTIC-REGRESSION; 3; GORGES; AREA; SVM; INTEGRATION; MODELS; LSM;
D O I
10.1007/s11440-023-01950-0
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Data-driven model has been increasingly used in landslide susceptibility mapping (LSM). However, the traditional landslide susceptibility assessment lacks reasonable and effective criteria for the selection of negative (non-landslide) samples, which limit the modeling effect of machine learning models. This study proposes a semi-supervised fuzzy C-means clustering based on cosine Euclidean distance (C-SFCM) for reliable landslide negative sample selection, and the genetic particle swarm optimization (GAPSO) algorithm is used to optimize the initial clustering centers. Taking Yiliang County, Yunnan Province, China, as the study area, we selected nine factors, including topography, geological conditions, land cover and human activities as influencing factors for landslide susceptibility mapping, and construct the basic dataset with landslide catalogs. The C-SFCM is performed on the original data set containing a small number of landslide samples and unlabeled samples and to select the low affiliation data far away from the positive samples of landslides as reliable negative samples of landslides. Then, we construct three deep learning models, the 1D convolutional neural network gated recurrent unit (1D CNN-GRU), the 1D convolutional neural networks long short term memory (1D CNN-LSTM) and the deep neural network (DNN), and two traditional machine learning models, support vector machines (SVM) and logistic regression (LR) for training and testing, respectively, and used them for landslide susceptibility mapping. The results show that C-SFCM has a more reasonable negative sample area, while it can identify the spatial characteristics of landslide samples and negative samples effectively than SFCM under traditional Euclidean distance. Compared with the random selection of negative samples scheme, the classifier trained on the C-SFCM-selected dataset has higher accuracy and better prediction capability for potential landslides. Furthermore, 1D CNN-GRU achieved the highest prediction accuracy (AUC = 0.918) in the C-SFCM scenario, with a 9.55% improvement compared to the random selection scenario.
引用
收藏
页码:509 / 529
页数:21
相关论文
共 67 条
[1]   The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan [J].
Ayalew, L ;
Yamagishi, H .
GEOMORPHOLOGY, 2005, 65 (1-2) :15-31
[2]  
Bai F, 2018, COMMUN TECHNOL, V51, P1061, DOI [10.3969/j.issn.1002-0802.2018.05.013, DOI 10.3969/J.ISSN.1002-0802.2018.05.013]
[3]   Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS [J].
Binh Thai Pham ;
Dieu Tien Bui ;
Prakash, Indra ;
Dholakia, M. B. .
CATENA, 2017, 149 :52-63
[4]   Regional bias of landslide data in generating susceptibility maps using logistic regression: Case of Hong Kong Island [J].
Chau, KT ;
Chan, JE .
LANDSLIDES, 2005, 2 (04) :280-290
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]   Integration of convolutional neural network and conventional machine learning classifiers for landslide susceptibility mapping [J].
Fang, Zhice ;
Wang, Yi ;
Peng, Ling ;
Hong, Haoyuan .
COMPUTERS & GEOSCIENCES, 2020, 139
[7]   Application of semi-supervised fuzzy c-means method in clustering multivariate geochemical data, a case study from the Dalli Cu-Au porphyry deposit in central Iran [J].
Fatehi, Moslem ;
Asadi, Hooshang H. .
ORE GEOLOGY REVIEWS, 2017, 81 :245-255
[8]   Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study [J].
Felicisimo, Angel ;
Cuartero, Aurora ;
Remondo, Juan ;
Quiros, Elia .
LANDSLIDES, 2013, 10 (02) :175-189
[9]   Assessment of shallow landslide susceptibility using artificial neural networks in Jabonosa River Basin, Venezuela [J].
Gómez, H ;
Kavzoglu, T .
ENGINEERING GEOLOGY, 2005, 78 (1-2) :11-27
[10]   Interval prediction of landslide displacement with dual-output least squares support vector machine and particle swarm optimization algorithms [J].
Gong, Wenping ;
Tian, Shan ;
Wang, Lei ;
Li, Zhibin ;
Tang, Huiming ;
Li, Tianzheng ;
Zhang, Liang .
ACTA GEOTECHNICA, 2022, 17 (09) :4013-4031