Investigation of the influence of nonoccurrence sampling on landslide susceptibility assessment using Artificial Neural Networks

被引:68
作者
Lucchese, Luisa Vieira [1 ]
de Oliveira, Guilherme Garcia [2 ]
Pedrollo, Olavo Correa [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Pesquisas Hidraul, Av Bento Goncalves 9500, BR-91501970 Porto Alegre, RS, Brazil
[2] Univ Fed Rio Grande do Sul, Dept Interdisciplinar, Rodovia RS 030,11700,Km 92 Emboaba, BR-95590000 Tramandai, RS, Brazil
关键词
Landslides; Mass movements; South America; Rio Grande do Sul Brazil; Sediment transport; Geomorphology; LOGISTIC-REGRESSION MODEL; RANDOM FOREST; NATURAL SLOPES; ABSENCE DATA; RIVER-BASIN; TURKEY; TREE; VALIDATION; STRATEGIES; PREDICTION;
D O I
10.1016/j.catena.2020.105067
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Landslide susceptibility assessment using Artificial Neural Networks (ANNs) requires occurrence (landslide) and nonoccurrence (not prone to landslide) samples for ANN training. We present empirical evidence that a priori intervention on the nonoccurrence samples can produce models that are improper for generalization. Thirteen nonoccurrence cases based on GIS data from Rolante River basin (828.26 km(2)) in Brazil are studied, divided in three groups. The first group was based on six combinations of buffers with different minimum and maximum distances from the mapped scars (BO). The second group (RO) acquired nonoccurrence only from a rectangle in the lowlands, known for not being susceptible to landslides. For BR, six alternatives respectively to BO were presented, with the inclusion of nonoccurrence samples acquired from the same rectangle used for RO. Accuracy (acc) and the Area Under Receiving Operating Characteristic Curve (AUC) were calculated. RO resulted in perfect discrimination between susceptible and not susceptible to landslides (acc = 1 e AUC = 1). This occurred because the model simply provided susceptible classification to points in which attributes are different from those in the rectangle, harming the classification of nonoccurrence sampling points outside the rectangle. RO map shows large areas classified as susceptible which are known to be non-susceptible. In BR, sampling points from the rectangle, which are easy to classify, were added to the verification sample of BR. Average acc for BO 00 m (minimum buffer distance to scars of 0 m): 89.45%, average acc for BR 00 m: 92.33%, average AUC for BO 00 m: 0.9409, average AUC for BR 00 m: 0.9616. Maps of groups BO and BR were alike. This indicates that metrics can be artificially risen if biased samples are added, although the final map is not visibly affected. To avoid this effect, the employment of easily classifiable samples, generated based on expert knowledge, should be made carefully.
引用
收藏
页数:11
相关论文
共 65 条
[61]   Landslide susceptibility mapping: A comparison of logistic regression and neural networks methods in a medium scale study, Hendek region (Turkey) [J].
Yesilnacar, E ;
Topal, T .
ENGINEERING GEOLOGY, 2005, 79 (3-4) :251-266
[62]   The effect of the sampling strategies on the landslide susceptibility mapping by conditional probability and artificial neural networks [J].
Yilmaz, Isik .
ENVIRONMENTAL EARTH SCIENCES, 2010, 60 (03) :505-519
[63]   Landslide susceptibility mapping at Vaz Watershed (Iran) using an artificial neural network model: a comparison between multilayer perceptron (MLP) and radial basic function (RBF) algorithms [J].
Zare, Mohammad ;
Pourghasemi, Hamid Reza ;
Vafakhah, Mahdi ;
Pradhan, Biswajeet .
ARABIAN JOURNAL OF GEOSCIENCES, 2013, 6 (08) :2873-2888
[64]   A similarity-based approach to sampling absence data for landslide susceptibility mapping using data-driven methods [J].
Zhu, A-Xing ;
Miao, Yamin ;
Liu, Junzhi ;
Bai, Shibiao ;
Zeng, Canying ;
Ma, Tianwu ;
Hong, Haoyuan .
CATENA, 2019, 183
[65]   Comparison of the presence-only method and presence-absence method in landslide susceptibility mapping [J].
Zhu, A-Xing ;
Miao, Yamin ;
Yang, Lin ;
Bai, Shibiao ;
Liu, Junzhi ;
Hong, Haoyuan .
CATENA, 2018, 171 :222-233