A GAN-Based Data Augmentation Method for Imbalanced Multi-Class Skin Lesion Classification

被引:19
作者
Su, Qichen [1 ]
Hamed, Haza Nuzly Abdull [1 ]
Isa, Mohd Adham [1 ]
Hao, Xue [1 ]
Dai, Xin [1 ]
机构
[1] Univ Teknol Malaysia UTM, Fac Comp, Johor Baharu 81310, Malaysia
关键词
Skin; Training; Lesions; Image color analysis; Data augmentation; Generative adversarial networks; Data models; data imbalance; generative adversarial networks; skin lesion classification; DIAGNOSIS;
D O I
10.1109/ACCESS.2024.3360215
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Skin cancer is one of the most common types of cancer globally. Despite the remarkable advancements of deep learning methods in computer vision, automatic diagnosis of skin diseases still faces challenges such as limited data and class imbalance. Generative Adversarial Networks (GANs), which can synthesize realistic data, appear as an alternative to mitigate these issues. However, for imbalanced data, unconditional GANs either generate uneven data distribution or neglect universal knowledge of the whole dataset, while state-of-the-art (SOTA) conditional GANs suffer from performance degradation due to the mode collapse of minority classes. This paper proposes a two-stage GAN-based method to synthesize fine-grained and diverse 256x256 pixels skin lesion images for the imbalanced dataset, named Self-Transfer GAN (STGAN). STGAN first learns universal knowledge from all classes then transfers this shared knowledge to each class and fuses it with class-specific knowledge to synthesize high-quality images. Furthermore, based on STGAN, a framework to enhance the classification performance is established. Both data generation and classification tasks are evaluated on HAM10000 dataset. In terms of Frechet Inception Distance (FID), Inception Score, Precision, and Recall, STGAN improved by 16%, 16%, 4%, 33% compared with SOTA conditional StyleGAN2. For classification, the STGAN-based framework achieves remarkable results, with an Accuracy of 98.23%, Sensitivity of 88.85%, Precision of 90.23%, F1-score of 89.48%, and Specificity of 98.34%.
引用
收藏
页码:16498 / 16513
页数:16
相关论文
共 50 条
[31]   Multi-class imbalanced image classification using conditioned GANs [J].
Kumar, M. R. Pavan ;
Jayagopal, Prabhu .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2021, 10 (03) :143-153
[32]   Multi-class imbalanced image classification using conditioned GANs [J].
M R Pavan Kumar ;
Prabhu Jayagopal .
International Journal of Multimedia Information Retrieval, 2021, 10 :143-153
[33]   An Optimized Training Method for GAN-Based Hyperspectral Image Classification [J].
Zhang, Fan ;
Bai, Jing ;
Zhang, Jingsen ;
Xiao, Zhu ;
Pei, Changxing .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (10) :1791-1795
[34]   Gan-based data augmentation to improve breast ultrasound and mammography mass classification [J].
Jimenez-Gaona, Yuliana ;
Carrion-Figueroa, Diana ;
Lakshminarayanan, Vasudevan ;
Rodriguez-Alvarez, Maria Jose .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
[35]   A new method for GAN-based data augmentation for classes with distinct clusters [J].
Kuntalp, Mehmet ;
Duzyel, Okan .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
[36]   Semi-GAN: An Improved GAN-Based Missing Data Imputation Method for the Semiconductor Industry [J].
Lee, Sun-Yong ;
Connerton, Timothy Paul ;
Lee, Yeon-Woo ;
Kim, Daeyoung ;
Kim, Donghwan ;
Kim, Jin-Ho .
IEEE ACCESS, 2022, 10 :72328-72338
[37]   Conditional Wasserstein GAN-based oversampling of tabular data for imbalanced learning [J].
Engelmann, Justin ;
Lessmann, Stefan .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 174
[38]   GAN-based data augmentation of time series for fault diagnosis in railway track [J].
Fernandez-Bobadilla, Hector A. ;
Bouchikhi, Yahya ;
Martin, Ullrich .
RAILWAY ENGINEERING SCIENCE, 2025,
[39]   GAN-Based Data Augmentation for Visual Finger Spelling Recognition [J].
Kwolek, Bogdan .
ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
[40]   Multi-class Classification of Class Imbalanced Skin Lesion Dataset Using a Modified SMOTE-ENN Gabor-Enhanced VGG-19 Architecture [J].
Madhusmita Priyadarshini Sahoo ;
Rajeswari Sridhar .
SN Computer Science, 6 (3)