A Review of Generative Adversarial Networks for Computer Vision Tasks

被引：3

作者：

Simion, Ana-Maria ^{[1
]}

Radu, Serban ^{[1
]}

Florea, Adina Magda ^{[1
]}

机构：

[1] Natl Univ Sci & Technol Politehn Bucharest, Fac Automat Control & Comp Sci, Bucharest 060042, Romania

来源：

ELECTRONICS | 2024年 / 13卷 / 04期

关键词：

generative adversarial networks; annotated dataset; augmentation; medical imaging; computer vision;

D O I：

10.3390/electronics13040713

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, computer vision tasks have gained a lot of popularity, accompanied by the development of numerous powerful architectures consistently delivering outstanding results when applied to well-annotated datasets. However, acquiring a high-quality dataset remains a challenge, particularly in sensitive domains like medical imaging, where expense and ethical concerns represent a challenge. Generative adversarial networks (GANs) offer a possible solution to artificially expand datasets, providing a basic resource for applications requiring large and diverse data. This work presents a thorough review and comparative analysis of the most promising GAN architectures. This review is intended to serve as a valuable reference for selecting the most suitable architecture for diverse projects, diminishing the challenges posed by limited and constrained datasets. Furthermore, we developed practical experimentation, focusing on the augmentation of a medical dataset derived from a colonoscopy video. We also applied one of the GAN architectures outlined in our work to a dataset consisting of histopathology images. The goal was to illustrate how GANs can enhance and augment datasets, showcasing their potential to improve overall data quality. Through this research, we aim to contribute to the broader understanding and application of GANs in scenarios where dataset scarcity poses a significant obstacle, particularly in medical imaging applications.

引用

页数：17

共 36 条

[31]

Wang XT, 2018, Arxiv, DOI [arXiv:1809.00219, DOI 10.48550/ARXIV.1809.00219, 10.48550/arXiv.1809.00219]

[32] Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature Transform [J].

Wang, Xintao ;

Yu, Ke ;

Dong, Chao ;

Loy, Chen Change .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :606-615

[33] AMD-GAN: Attention encoder and multi-branch structure based generative adversarial networks for fundus disease detection from scanning laser ophthalmoscopy images [J].

Xie, Hai ;

Lei, Haijun ;

Zeng, Xianlu ;

He, Yejun ;

Chen, Guozhen ;

Elazab, Ahmed ;

Yue, Guanghui ;

Wang, Jiantao ;

Zhang, Guoming ;

Lei, Baiying .

NEURAL NETWORKS, 2020, 132 :477-490

[34] One-Class Classification Using Generative Adversarial Networks [J].

Yang, Yang ;

Hou, Chunping ;

Lang, Yue ;

Yue, Guanghui ;

He, Yuan .

IEEE ACCESS, 2019, 7 :37970-37979

[35] MHW-GAN: Multidiscriminator Hierarchical Wavelet Generative Adversarial Network for Multimodal Image Fusion [J].

Zhao, Cheng ;

Yang, Peng ;

Zhou, Feng ;

Yue, Guanghui ;

Wang, Shuigen ;

Wu, Huisi ;

Chen, Guoliang ;

Wang, Tianfu ;

Lei, Baiying .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) :13713-13727

[36]

Zhu JY, 2020, Arxiv, DOI [arXiv:1703.10593, DOI 10.48550/ARXIV.1703.10593]

← 1 2 3 4 →