AWGAN: An adaptive weighting GAN approach for oversampling imbalanced datasets

被引:21
作者
Guan, Shaopeng [1 ]
Zhao, Xiaoyan [1 ]
Xue, Yuewei [1 ]
Pan, Hao [1 ]
机构
[1] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China
关键词
Imbalanced dataset; Oversampling technique; Generative adversarial networks; Overlapping; Intra-class imbalance; SMOTE;
D O I
10.1016/j.ins.2024.120311
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Oversampling is a widely employed technique for addressing imbalanced datasets, facing challenges like class overlaps, intra-class imbalance, and noise. In this paper, we introduce an adaptive weighted oversampling algorithm grounded in generative adversarial networks, which we term AWGAN. To begin, our method computes the local and global densities for each instance, confirming its distribution within its local neighborhood, thereby enabling accurate identification and elimination of noisy instances. Subsequently, we devise a weight calculation strategy based on boundary division. Minority class instances are classified into safe and boundary instances, and weights are calculated based on the density of each instance and its distance from the surrounding instances, assigning different weights to overlapping and non -overlapping regions, and sparse and dense region instances, in order to solve the problems of class overlap and intraclass imbalance. Finally, GAN is used to construct a balanced dataset by adaptively generating minority class instances that match the real data distribution based on the weights. We evaluate AWGAN against six traditional oversampling methods and five GAN-based oversampling methods. The experimental results demonstrate that AWGAN significantly enhances classifier performance, as evident in its F1 -Score, AUC, G -mean, and MCC on 21 diverse datasets.
引用
收藏
页数:24
相关论文
共 50 条
[1]   RN-SMOTE: Reduced Noise SMOTE based on DBSCAN for enhancing imbalanced data classification [J].
Arafa, Ahmed ;
El-Fishawy, Nawal ;
Badawy, Mohammed ;
Radad, Marwa .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) :5059-5074
[2]   SMOTE-LOF for noise identification in imbalanced data classification [J].
Asniar, Nur Ulfa ;
Maulidevi, Nur Ulfa ;
Surendro, Kridanto .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) :3413-3423
[3]   An Investigation of SMOTE Based Methods for Imbalanced Datasets With Data Complexity Analysis [J].
Azhar, Nur Athirah ;
Pozi, Muhammad Syafiq Mohd ;
Din, Aniza Mohamed ;
Jatowt, Adam .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) :6651-6672
[4]   Rule extraction in unsupervised anomaly detection for model explainability: Application to OneClass SVM [J].
Barbado, Alberto ;
Corcho, Oscar ;
Benjamins, Richard .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
[5]   MWMOTE-Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning [J].
Barua, Sukarna ;
Islam, Md. Monirul ;
Yao, Xin ;
Murase, Kazuyuki .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (02) :405-425
[6]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[7]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[8]   Class-Imbalanced Deep Learning via a Class-Balanced Ensemble [J].
Chen, Zhi ;
Duan, Jiang ;
Kang, Li ;
Qiu, Guoping .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) :5626-5640
[9]   Class-overlap undersampling based on Schur decomposition for Class-imbalance problems [J].
Dai, Qi ;
Liu, Jian-wei ;
Shi, Yong-hui .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
[10]   Class-imbalanced positive instances augmentation via three-line hybrid [J].
Dai, Qi ;
Liu, Jian-wei ;
Yang, Jia-peng .
KNOWLEDGE-BASED SYSTEMS, 2022, 257