The purpose of imbalanced data classification is to solve the problem of unfair learning caused by the large difference in data distribution. Traditional classifiers are designed on the basis of balanced data, but the performance of imbalanced data will decline sharply. Therefore, balancing the majority class and minority class samples before classification is a popular strategy for solving imbalanced learning. Current methods for data balance mainly include oversampling and undersampling. However, the existing undersampling will face the problem of losing important sample information, while oversampling cannot effectively fit the global distribution and generate noise. In recent years, generative adversarial network (GAN) has shown great potential in fitting real sample distributions. Based on this, this paper proposes an improved GAN and biased loss combined model, namely VGAN-BL, to solve the learning problem under imbalanced conditions. In the improvement based on GAN, VAE is used to generate latent vectors with posterior distribution as the input of GAN, and KL similarity measurement loss is introduced into the generator to improve the quality of minority samples generated by GAN. In addition, we propose a biased loss definition method based on the discriminator to improve the performance of classifier. Experiments on 20 real datasets show that the classification performance of the proposed method is significantly improved compared with other advanced methods. The source code can be found here: https://github.com/universuen/VGAN-BL.
机构:
School of Cyber Science and Engineering, Wuhan University, Wuhan
Key Laboratory of Aerospace Information Security and Trusted Computing, Wuhan University, WuhanSchool of Cyber Science and Engineering, Wuhan University, Wuhan
Ding H.
Cui X.
论文数: 0引用数: 0
h-index: 0
机构:
School of Cyber Science and Engineering, Wuhan University, Wuhan
Key Laboratory of Aerospace Information Security and Trusted Computing, Wuhan University, WuhanSchool of Cyber Science and Engineering, Wuhan University, Wuhan
机构:
Dalian Univ Technol, Sch Math Sci, Dalian 116024, Liaoning, Peoples R China
Key Lab Computat Math & Data Intelligence Liaoning, Dalian 116024, Liaoning, Peoples R ChinaDalian Univ Technol, Sch Math Sci, Dalian 116024, Liaoning, Peoples R China
Chen, Yueqi
Pedrycz, Witold
论文数: 0引用数: 0
h-index: 0
机构:
Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2R3, Canada
Polish Acad Sci, Syst Res Inst, PL-00901 Warsaw, Mazowieckie, Poland
Istinye Univ, Fac Engn & Nat Sci, Dept Comp Engn, TR-34010 Istanbul, TurkiyeDalian Univ Technol, Sch Math Sci, Dalian 116024, Liaoning, Peoples R China
Pedrycz, Witold
Pan, Tingting
论文数: 0引用数: 0
h-index: 0
机构:
Dalian Polytech Univ, Dept Basic Courses Teaching, Dalian 116024, Liaoning, Peoples R ChinaDalian Univ Technol, Sch Math Sci, Dalian 116024, Liaoning, Peoples R China
Pan, Tingting
Wang, Jian
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Petr East China, Coll Sci, Qingdao 266580, Shandong, Peoples R ChinaDalian Univ Technol, Sch Math Sci, Dalian 116024, Liaoning, Peoples R China
Wang, Jian
Yang, Jie
论文数: 0引用数: 0
h-index: 0
机构:
Dalian Univ Technol, Sch Math Sci, Dalian 116024, Liaoning, Peoples R China
Key Lab Computat Math & Data Intelligence Liaoning, Dalian 116024, Liaoning, Peoples R ChinaDalian Univ Technol, Sch Math Sci, Dalian 116024, Liaoning, Peoples R China