LEGAN: Addressing Intraclass Imbalance in GAN-Based Medical Image Augmentation for Improved Imbalanced Data Classification

被引：12

作者：

Ding, Hongwei ^{[1
,2
]}

Huang, Nana ^{[3
]}

Wu, Yaoxin ^{[4
]}

Cui, Xiaohui ^{[4
]}

机构：

[1] Northeastern Univ Qinhuangdao, Sch Comp & Commun Engn, Qinhuangdao 066004, Peoples R China

[2] Northeastern Univ Qinhuangdao, Hebei Key Lab Marine Percept Network & Data Proc, Qinhuangdao 066004, Peoples R China

[3] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310000, Peoples R China

[4] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430000, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

关键词：

Generative adversarial networks; Training; Biomedical imaging; Generators; Data models; Information entropy; Deep learning; Generative adversarial network (GAN); information entropy; intraclass imbalance; mode collapse;

D O I：

10.1109/TIM.2024.3396853

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Currently, the medical image classification is challenged by performance degradation due to imbalanced data. Balancing the data through sample augmentation proves to be an effective solution. However, traditional data augmentation methods and simple linear interpolation fall short in generating more diverse new samples, thereby limiting the enhancement of results with augmented data. Although generative adversarial networks (GANs) models have the potential to generate more diverse samples, current GAN models struggle to effectively address the issue of intraclass mode collapse. In this article, we propose a GAN model structure named LEGAN, based on local outlier factor (LOF) and information entropy, to address this problem. The LEGAN model focuses on resolving mode collapse caused by intraclass imbalances. First, LOF is used to detect sparse and dense sample points in intraclass imbalance, and affine transformations (ATs) are performed on sparse sample points to enhance the diversity of sample data and features. Then, we train LEGAN jointly using the augmented sparse samples and dense samples to effectively learn the sample distribution in sparse regions, thereby generating more diverse sparse samples. Second, we propose a decentralization constraint based on information entropy. This method measures the diversity of generated samples using information entropy during the training process and provides feedback to the generator, encouraging it to optimize towards better diversity. We conducted extensive experiments on three medical datasets, namely, BloodMNIST, OrgancMNIST, and PathMNIST, demonstrating that LEGAN can achieve more diverse intraclass sample generation. The quality of the generated images and the classification performance are both significantly improved.

引用

页码：1 / 14

页数：14

共 46 条

[1] A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams [J].

Alghushairy, Omar ;

Alsini, Raed ;

Soule, Terence ;

Ma, Xiaogang .

BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) :1-24

[2]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[3] Comparison of deep convolution and least squares GANs for diabetic retinopathy image synthesis [J].

Atas, Isa .

NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19) :14431-14448

[4]

Bali Mayank, 2023, Procedia Computer Science, P283, DOI 10.1016/j.procs.2023.01.010

[5] SMOTE: Synthetic minority over-sampling technique [J].

Chawla, Nitesh V. ;

Bowyer, Kevin W. ;

Hall, Lawrence O. ;

Kegelmeyer, W. Philip .

2002, American Association for Artificial Intelligence (16)

[6] Generative Adversarial Networks in Medical Image augmentation: A review [J].

Chen, Yizhou ;

Yang, Xu-Hua ;

Wei, Zihan ;

Heidari, Ali Asghar ;

Zheng, Nenggan ;

Li, Zhicheng ;

Chen, Huiling ;

Hu, Haigen ;

Zhou, Qianwei ;

Guan, Qiu .

COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144

[7] TMG-GAN: Generative Adversarial Networks-Based Imbalanced Learning for Network Intrusion Detection [J].

Ding, Hongwei ;

Sun, Yu ;

Huang, Nana ;

Shen, Zhidong ;

Cui, Xiaohui .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :1156-1167

[8] RGAN-EL: A GAN and ensemble learning-based hybrid approach for imbalanced data classification [J].

Ding, Hongwei ;

Sun, Yu ;

Wang, Zhenyu ;

Huang, Nana ;

Shen, Zhidong ;

Cui, Xiaohui .

INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)

[9] Imbalanced data classification: A KNN and generative adversarial networks-based hybrid approach for intrusion detection [J].

Ding, Hongwei ;

Chen, Leiyang ;

Dong, Liang ;

Fu, Zhongwang ;

Cui, Xiaohui .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 131 :240-254

[10] CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification [J].

Elyan, Eyad ;

Moreno-Garcia, Carlos Francisco ;

Jayne, Chrisina .

NEURAL COMPUTING & APPLICATIONS, 2021, 33 (07) :2839-2851

← 1 2 3 4 5 →