A Review of Generative Adversarial Networks for Computer Vision Tasks

被引：3

作者：

Simion, Ana-Maria ^{[1
]}

Radu, Serban ^{[1
]}

Florea, Adina Magda ^{[1
]}

机构：

[1] Natl Univ Sci & Technol Politehn Bucharest, Fac Automat Control & Comp Sci, Bucharest 060042, Romania

来源：

ELECTRONICS | 2024年 / 13卷 / 04期

关键词：

generative adversarial networks; annotated dataset; augmentation; medical imaging; computer vision;

D O I：

10.3390/electronics13040713

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, computer vision tasks have gained a lot of popularity, accompanied by the development of numerous powerful architectures consistently delivering outstanding results when applied to well-annotated datasets. However, acquiring a high-quality dataset remains a challenge, particularly in sensitive domains like medical imaging, where expense and ethical concerns represent a challenge. Generative adversarial networks (GANs) offer a possible solution to artificially expand datasets, providing a basic resource for applications requiring large and diverse data. This work presents a thorough review and comparative analysis of the most promising GAN architectures. This review is intended to serve as a valuable reference for selecting the most suitable architecture for diverse projects, diminishing the challenges posed by limited and constrained datasets. Furthermore, we developed practical experimentation, focusing on the augmentation of a medical dataset derived from a colonoscopy video. We also applied one of the GAN architectures outlined in our work to a dataset consisting of histopathology images. The goal was to illustrate how GANs can enhance and augment datasets, showcasing their potential to improve overall data quality. Through this research, we aim to contribute to the broader understanding and application of GANs in scenarios where dataset scarcity poses a significant obstacle, particularly in medical imaging applications.

引用

页数：17

共 36 条

[1] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].

Agustsson, Eirikur ;

Timofte, Radu .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131

[2]

[Anonymous], Toloka History of Generative AI

[3]

Antipov G, 2017, Arxiv, DOI [arXiv:1702.01983, DOI 10.48550/ARXIV.1702.01983]

[4]

Arjovsky M, 2017, Arxiv, DOI [arXiv:1701.07875, 10.48550/arXiv.1701.07875]

[5]

Armanious K, 2019, Arxiv, DOI [arXiv:1806.06397, DOI 10.1016/J.COMPMEDIMAG.2019.101684]

[6] Synthesizing electronic health records using improved generative adversarial networks [J].

Baowaly, Mrinal Kanti ;

Lin, Chia-Ching ;

Liu, Chao-Lin ;

Chen, Kuan-Ta .

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (03) :228-241

[7]

Developer N., Detecting Financial Fraud Using GANs at Swedbank with Hopsworks and NVIDIA GPUs

[8] Fundamental Technologies in Modern Speech Recognition [J].

Furui, Sadaoki ;

Deng, Li ;

Gales, Mark ;

Ney, Hermann ;

Tokuda, Keiichi .

IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :16-17

[9]

Goodfellow I, 2014, arXiv

[10]

Gulrajani I, 2017, Arxiv, DOI [arXiv:1704.00028, 10.48550/arXiv.1704.00028]

← 1 2 3 4 →