A New Integrated Approach for Landslide Data Balancing and Spatial Prediction Based on Generative Adversarial Networks (GAN)

被引:34
作者
Al-Najjar, Husam A. H. [1 ]
Pradhan, Biswajeet [1 ,2 ]
Sarkar, Raju [3 ]
Beydoun, Ghassan [1 ]
Alamri, Abdullah [4 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, Ctr Adv Modelling & Geospatial Informat Syst CAMG, Sydney, NSW 2007, Australia
[2] Univ Kebangsaan Malaysia, UKM, Earth Observat Ctr, Inst Climate Change, Bangi 43600, Selangor, Malaysia
[3] Delhi Technol Univ, Dept Civil Engn, Bawana Rd, Delhi 110042, India
[4] King Saud Univ, Dept Geol & Geophys, Coll Sci, POB 2455, Riyadh 11451, Saudi Arabia
关键词
landslide susceptibility; imbalanced dataset; machine learning; generative adversarial network; GIS; remote sensing; Bhutan; SUSCEPTIBILITY ASSESSMENT; MOUNTAINS; MODEL; AREA;
D O I
10.3390/rs13194011
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Landslide susceptibility mapping has significantly progressed with improvements in machine learning techniques. However, the inventory/data imbalance (DI) problem remains one of the challenges in this domain. This problem exists as a good quality landslide inventory map, including a complete record of historical data, is difficult or expensive to collect. As such, this can considerably affect one's ability to obtain a sufficient inventory or representative samples. This research developed a new approach based on generative adversarial networks (GAN) to correct imbalanced landslide datasets. The proposed method was tested at Chukha Dzongkhag, Bhutan, one of the most frequent landslide prone areas in the Himalayan region. The proposed approach was then compared with the standard methods such as the synthetic minority oversampling technique (SMOTE), dense imbalanced sampling, and sparse sampling (i.e., producing non-landslide samples as many as landslide samples). The comparisons were based on five machine learning models, including artificial neural networks (ANN), random forests (RF), decision trees (DT), k-nearest neighbours (kNN), and the support vector machine (SVM). The model evaluation was carried out based on overall accuracy (OA), Kappa Index, F1-score, and area under receiver operating characteristic curves (AUROC). The spatial database was established with a total of 269 landslides and 10 conditioning factors, including altitude, slope, aspect, total curvature, slope length, lithology, distance from the road, distance from the stream, topographic wetness index (TWI), and sediment transport index (STI). The findings of this study have shown that both GAN and SMOTE data balancing approaches have helped to improve the accuracy of machine learning models. According to AUROC, the GAN method was able to boost the models by reaching the maximum accuracy of ANN (0.918), RF (0.933), DT (0.927), kNN (0.878), and SVM (0.907) when default parameters used. With the optimum parameters, all models performed best with GAN at their highest accuracy of ANN (0.927), RF (0.943), DT (0.923) and kNN (0.889), except SVM obtained the highest accuracy of (0.906) with SMOTE. Our finding suggests that RF balanced with GAN can provide the most reasonable criterion for landslide prediction. This research indicates that landslide data balancing may substantially affect the predictive capabilities of machine learning models. Therefore, the issue of DI in the spatial prediction of landslides should not be ignored. Future studies could explore other generative models for landslide data balancing. By using state-of-the-art GAN, the proposed model can be considered in the areas where the data are limited or imbalanced.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] Cross-band spectrum prediction algorithm based on data conversion using generative adversarial networks
    Peng, Chuang
    Zhu, Rangang
    Zhang, Mengbo
    Wang, Lunwen
    CHINA COMMUNICATIONS, 2023, 20 (10) : 136 - 152
  • [22] Application of tabular data synthesis using generative adversarial networks on machine learning-based multiaxial fatigue life prediction
    He, GaoYuan
    Zhao, YongXiang
    Yan, ChuLiang
    INTERNATIONAL JOURNAL OF PRESSURE VESSELS AND PIPING, 2022, 199
  • [23] PBR-GAN: Imitating Physically-Based Rendering With Generative Adversarial Networks
    Li, Ru
    Dai, Peng
    Liu, Guanghui
    Zhang, Shengping
    Zeng, Bing
    Liu, Shuaicheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1827 - 1840
  • [24] RCA-GAN: An Improved Image Denoising Algorithm Based on Generative Adversarial Networks
    Wang, Yuming
    Luo, Shuaili
    Ma, Liyun
    Huang, Min
    Ullo, Silvia Liberata
    ELECTRONICS, 2023, 12 (22)
  • [25] MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks
    Li, Dan
    Chen, Dacheng
    Shi, Lei
    Jin, Baihong
    Goh, Jonathan
    Ng, See-Kiong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 703 - 716
  • [26] Data Augmentation Based on Generative Adversarial Networks for Endoscopic Image Classification
    Park, Hyun-Cheol
    Hong, In-Pyo
    Poudel, Sahadev
    Choi, Chang
    IEEE ACCESS, 2023, 11 : 49216 - 49225
  • [27] Partial Discharge Spectrogram Data Augmentation based on Generative Adversarial Networks
    Santos Petri, Lucas de Paula
    Moutinho, Emanuel Antonio
    Silva, Rondinele Pinheiro
    Capelini, Renato Massoni
    Salustiano, Rogerio
    Figueiredo Ferraz, Guilherme Martinez
    Wanderley Neto, Estacio Tavares
    2019 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2019,
  • [28] Imbalanced data classification: A KNN and generative adversarial networks-based hybrid approach for intrusion detection
    Ding, Hongwei
    Chen, Leiyang
    Dong, Liang
    Fu, Zhongwang
    Cui, Xiaohui
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 131 : 240 - 254
  • [29] SGAD-GAN: Simultaneous Generation and Anomaly Detection for time-series sensor data with Generative Adversarial Networks
    Zhao, Penghui
    Ding, Zhongjun
    Li, Yang
    Zhang, Xiaohan
    Zhao, Yuanqi
    Wang, Hongjun
    Yang, Yang
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 210
  • [30] A new cyclical generative adversarial network based data augmentation method for multiaxial fatigue life prediction
    Sun, Xingyue
    Zhou, Kun
    Shi, Shouwen
    Song, Kai
    Chen, Xu
    INTERNATIONAL JOURNAL OF FATIGUE, 2022, 162