Data-GAN Augmentation Techniques in Medical Image Analysis: A Deep Survey

被引：0

作者：

Archana Dash ^{[1
]}

Tripti Swarnkar ^{[2
]}

机构：

[1] Department of Computer Science, SOA University, Odisha, Bhubaneswar

[2] Tata Consultancy Services, Uttar Pradesh, Noida

[3] National Institute of Technology (NITRR) Raipur, Chattisgarh, Raipur

来源：

SN Computer Science | / 6卷 / 4期

关键词：

Artificial intelligence; Class imbalance; Data augmentation; Deep learning; GAN; Medical image analysis;

D O I：

10.1007/s42979-025-03867-9

中图分类号：

学科分类号：

摘要：

Generative Adversarial Networks (GANs) have emerged as a powerful tool for data augmentation in medical imaging, enabling the generation of realistic synthetic images to augment small and heterogeneous training datasets. This work emphasizes to introduce the basics of data augmentation and types (both traditional and advanced) then present a detailed review of different GAN-based data augmentation techniques. Here we discuss the advantages and limitations of each technique and summarize the various evaluation metrics used to assess their performance. In recent studies we reviewed recent studies that have used GAN-based data augmentation to improve the performance of deep learning models in various medical imaging applications, including MRI and CT image analysis, retinal image segmentation, and pulmonary nodule detection. Finally, we discuss the current challenges and future research directions in this field, including the need for large-scale evaluation studies and the development of more efficient and effective GAN-based data augmentation methods. This work provides a comprehensive overview of the state-of-the-art in GAN-based data augmentation techniques in medical image analysis and highlights their potential to improve the accuracy and reliability of deep learning models in medical imaging. Based on our observations, this trend will continue, and we therefore conducted a deep review of recent advances in medical imaging using the GAN techniques with a hope of benefiting researchers interested in this technique. Each finding is complimented by a novel table summary and explains how well the GAN models have blended in each of the medical application with increasing use in coming years with correct amalgamation of model, dataset and strategy chosen for the research problem. This work firmly believe that this survey will prove to be a handy summary for all queries researchers ideally look for before choosing the GAN model as their research problem. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2025.

引用

共 79 条

[31]

Isola P., Zhu J.-Y., Zhou T., Efros A.A., Research B.A., Image-to-image translation with conditional adversarial networks

[32]

Arjovsky M., Chintala S., Bottou L., Wasserstein Generative adversarial networks, Proceedings of the 34th International Conference on Machine Learning, 70, pp. 214-223, (2017)

[33]

Kim J., Kim M., Kang H.K., U-gat-it: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation., (2019)

[34]

Diaz E., Manzano F.J., Villamil J., Rodriguez J.J., Mohedano A.F., A novel approach for weakly-supervised medical image classification and segmentation using attention-DCGANs, Appl Sci, 9, 23, (2019)

[35]

Viazovetskyi Y., Ivashkin V., Kashin E., Stylegan2 distillation for feed-forward image manipulation, Computer Vision–ECCV 2020: 16Th European Conference, pp. 170-186, (2020)

[36]

Gal R., Bermano A., Zhang H., Cohen-Or D., MRGAN: Multi-rooted 3D shape representation learning with unsupervised part disentanglement, In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2039-2048, (2021)

[37]

Calli E., Sogancioglu E., van Ginneken B., van Leeuwen K.G., Murphy K., Deep learning for chest X-ray analysis: a survey, Med Image Anal, 72, (2021)

[38]

Kosaraju V., Sadeghian A., Martin-Martin R., Reid I., Rezatofighi H., Savarese S., Social-bigat: Multimodal trajectory forecasting using bicycle-gan and graph attention networks, Advances in neural information processing systems, (2019)

[39]

He Z., Zuo W., Kan M., Shan S., Chen X., AttGAN: facial attribute editing by only changing what you want, IEEE Trans Image Process, 28, 11, pp. 5464-5478, (2019)

[40]

Tang H., Liu H., Xu D., Torr P.H., Sebe N., Attentiongan: Unpaired image-to-image translation using attention-guided generative adversarial networks, IEEE transactions on neural networks and learning systems, 34, 4, pp. 1972-1987, (2021)

← 1 2 3 4 5 6 7 8 →