An ensemble oversampling method for imbalanced classification with prior knowledge via generative adversarial network

被引:7
|
作者
Zhang, Yulin [1 ]
Liu, Yuchen [2 ]
Wang, Yan [2 ]
Yang, Jie [2 ]
机构
[1] Shandong Univ Sci & Technol, Coll Math & Syst Sci, Qingdao 266590, Shandong, Peoples R China
[2] Dalian Univ Technol, Sch Math Sci, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced data; Oversampling; Generative adversarial network; Bagging; SMOTE; MODEL; GAN;
D O I
10.1016/j.chemolab.2023.104775
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, an increasing number of real-world applications show characteristics of class-imbalance classification suffering from severe class distribution skewing, thus requiring brand new algorithms to learn from imbalanced datasets. In this paper, a novel oversampling method using GAN framework is proposed for numerical imbalanced data, namely G-GAN. In the method, a Gaussian distribution of minority samples is estimated to get prior knowledge of minority class for the latent space of GAN. In order to increase the randomness of the generated samples, noises are obtained by a mixed strategy, that is, some noises of generator obey Gaussian distribution and others obey random distribution. Then G-GAN is trained to generate dispersive positive samples with the idea of Bagging, which could avoid the occurrence of overfitting. G-GAN is different from other literatures in that GAN does not directly generate minority samples, but adds the distribution information of minority samples to the latent space of GAN, and then generates minority samples. Compared with 11 commonly used oversampling methods, G-GAN obtains promising results in terms of G-mean, AUC, F-measure and ROC utilizing three classifiers on 11 benchmark imbalanced datasets. Furthermore, G-GAN is also validated on AUC metrics of a real Diabetes imbalanced dataset. The results demonstrate that G-GAN can provide great potential for imbalanced classification in the two numerical experiments.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Towards Imbalanced Image Classification: A Generative Adversarial Network Ensemble Learning Method
    Huang, Yangru
    Jin, Yi
    Li, Yidong
    Lin, Zhiping
    IEEE ACCESS, 2020, 8 : 88399 - 88409
  • [2] Oversampling for Imbalanced Data Classification Using Adversarial Network
    Lee, Sang-Kwang
    Hong, Seung-Jin
    Yang, Seong-Il
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1255 - 1257
  • [3] A new imbalanced data oversampling method based on Bootstrap method and Wasserstein Generative Adversarial Network
    Hou, Binjie
    Chen, Gang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4309 - 4327
  • [4] Local Tangent Generative Adversarial Network for Imbalanced Data Classification
    Li, Zhihao
    Yu, Zhiwen
    Yang, Kaixiang
    Shi, Yifan
    Xu, Yuhong
    Chen, C. L. Philip
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] Multiview Wasserstein generative adversarial network for imbalanced pearl classification
    Gao, Shuang
    Dai, Yun
    Li, Yingjie
    Liu, Kaixin
    Chen, Kun
    Liu, Yi
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2022, 33 (08)
  • [6] Dual Autoencoders Generative Adversarial Network for Imbalanced Classification Problem
    Wu, Ensen
    Cui, Hongyan
    Welsch, Roy E.
    IEEE ACCESS, 2020, 8 : 91265 - 91275
  • [7] Oversampling method using outlier detectable generative adversarial network
    Oh J.-H.
    Hong J.Y.
    Baek J.-G.
    Expert Systems with Applications, 2019, 133 : 1 - 8
  • [8] The Effectiveness of Generative Adversarial Network-Based Oversampling Methods for Imbalanced Multi-Class Credit Score Classification
    Adiputra, I. Nyoman Mahayasa
    Lin, Pei-Chun
    Wanchai, Paweena
    ELECTRONICS, 2025, 14 (04):
  • [9] Oversampling method using outlier detectable generative adversarial network
    Oh, Joo-Hyuk
    Hong, Jae Yeol
    Baek, Jun-Geol
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 133 : 1 - 8
  • [10] OVERSAMPLING METHOD FOR IMBALANCED CLASSIFICATION
    Zheng, Zhuoyuan
    Cai, Yunpeng
    Li, Ye
    COMPUTING AND INFORMATICS, 2015, 34 (05) : 1017 - 1037