VGAN-BL: imbalanced data classification based on generative adversarial network and biased loss

被引:3
作者
Ding, Hongwei [1 ,2 ]
Sun, Yu [1 ,3 ]
Huang, Nana [2 ]
Cui, Xiaohui [1 ,2 ]
机构
[1] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan, Peoples R China
[2] Wuhan Univ, Key Lab Aerosp Informat Secur & Trusted Comp, Minist Educ, Wuhan, Peoples R China
[3] Natl Univ Singapore, Sch Comp, Singapore, Singapore
基金
国家重点研发计划;
关键词
Imbalanced data; Undersampling; Oversampling; VGAN-BL; SAMPLING METHOD; SMOTE;
D O I
10.1007/s00521-023-09180-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of imbalanced data classification is to solve the problem of unfair learning caused by the large difference in data distribution. Traditional classifiers are designed on the basis of balanced data, but the performance of imbalanced data will decline sharply. Therefore, balancing the majority class and minority class samples before classification is a popular strategy for solving imbalanced learning. Current methods for data balance mainly include oversampling and undersampling. However, the existing undersampling will face the problem of losing important sample information, while oversampling cannot effectively fit the global distribution and generate noise. In recent years, generative adversarial network (GAN) has shown great potential in fitting real sample distributions. Based on this, this paper proposes an improved GAN and biased loss combined model, namely VGAN-BL, to solve the learning problem under imbalanced conditions. In the improvement based on GAN, VAE is used to generate latent vectors with posterior distribution as the input of GAN, and KL similarity measurement loss is introduced into the generator to improve the quality of minority samples generated by GAN. In addition, we propose a biased loss definition method based on the discriminator to improve the performance of classifier. Experiments on 20 real datasets show that the classification performance of the proposed method is significantly improved compared with other advanced methods. The source code can be found here: https://github.com/universuen/VGAN-BL.
引用
收藏
页码:2883 / 2899
页数:17
相关论文
共 50 条
  • [21] A novel generative adversarial network for improving crash severity modeling with imbalanced data
    Chen, Junlan
    Pu, Ziyuan
    Zheng, Nan
    Wen, Xiao
    Ding, Hongliang
    Guo, Xiucheng
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 164
  • [22] Generative adversarial networks for overlapped and imbalanced problems in impact damage classification
    Doan, Quoc Hoan
    Keshtegar, Behrooz
    Kim, Seung-Eock
    Thai, Duc-Kien
    [J]. INFORMATION SCIENCES, 2024, 675
  • [23] Fault Diagnosis of Harmonic Drive With Imbalanced Data Using Generative Adversarial Network
    Yang, Guo
    Zhong, Yong
    Yang, Lie
    Tao, Hui
    Li, Jianying
    Du, Ruxu
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [24] A novel generative adversarial network for improving crash severity modeling with imbalanced data
    Chen, Junlan
    Pu, Ziyuan
    Zheng, Nan
    Wen, Xiao
    Ding, Hongliang
    Guo, Xiucheng
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 164
  • [25] Jujube quality grading using a generative adversarial network with an imbalanced data set
    Cang, Hao
    Yan, Tianying
    Duan, Long
    Yan, Jingkun
    Zhang, Yuan
    Tan, Fei
    Lv, Xin
    Gao, Pan
    [J]. BIOSYSTEMS ENGINEERING, 2023, 236 : 224 - 237
  • [26] An imbalanced data learning method for tool breakage detection based on generative adversarial networks
    Sun, Shixu
    Hu, Xiaofeng
    Liu, Yingchao
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (08) : 2441 - 2455
  • [27] Generative adversarial network based data augmentation to improve cervical cell classification model
    Yu, Suxiang
    Zhang, Shuai
    Wang, Bin
    Dun, Hua
    Xu, Long
    Huang, Xin
    Shi, Ermin
    Feng, Xinxing
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (02) : 1740 - 1752
  • [28] Hyperspectral Image Classification with Imbalanced Data Based on Oversampling and Convolutional Neural Network
    Cai, Lei
    Zhang, Geng
    [J]. AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
  • [29] Generative adversarial network and transfer-learning-based fault detection for rotating machinery with imbalanced data condition
    Li, Jun
    Liu, Yongbao
    Li, Qijie
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2022, 33 (04)
  • [30] The Effectiveness of Generative Adversarial Network-Based Oversampling Methods for Imbalanced Multi-Class Credit Score Classification
    Adiputra, I. Nyoman Mahayasa
    Lin, Pei-Chun
    Wanchai, Paweena
    [J]. ELECTRONICS, 2025, 14 (04):