Unsupervised Image-to-Image Translation: A Review

被引：17

作者：

Hoyez, Henri ^{[1
,2
]}

Schockaert, Cedric ^{[1
]}

Rambach, Jason ^{[3
]}

Mirbach, Bruno ^{[3
]}

Stricker, Didier ^{[1
,2
]}

机构：

[1] Paul Wurth SA, L-1122 Luxembourg, Luxembourg

[2] Tech Univ Kaiserslautern, Dept Comp Sci, D-67663 Kaiserslautern, Germany

[3] German Res Ctr Artificial Intelligence DFKI, D-67663 Kaiserslautern, Germany

来源：

SENSORS | 2022年 / 22卷 / 21期

关键词：

unsupervised image-to-image translation; machine learning; computer vision; deep learning; generative adversarial networks; review;

D O I：

10.3390/s22218540

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Supervised image-to-image translation has been proven to generate realistic images with sharp details and to have good quantitative performance. Such methods are trained on a paired dataset, where an image from the source domain already has a corresponding translated image in the target domain. However, this paired dataset requirement imposes a huge practical constraint, requires domain knowledge or is even impossible to obtain in certain cases. Due to these problems, unsupervised image-to-image translation has been proposed, which does not require domain expertise and can take advantage of a large unlabeled dataset. Although such models perform well, they are hard to train due to the major constraints induced in their loss functions, which make training unstable. Since CycleGAN has been released, numerous methods have been proposed which try to address various problems from different perspectives. In this review, we firstly describe the general image-to-image translation framework and discuss the datasets and metrics involved in the topic. Furthermore, we revise the current state-of-the-art with a classification of existing works. This part is followed by a small quantitative evaluation, for which results were taken from papers.

引用

页数：27

共 50 条

[21] Latent Filter Scaling for Multimodal Unsupervised Image-to-Image Translation
Alharbi, Yazeed
Smith, Neil
Wonka, Peter
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1458 - 1466
[22] Towards Unsupervised Deformable-Instances Image-to-Image Translation
Su, Sitong
Song, Jingkuan
Gao, Lianli
Zhu, Junchen
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1004 - 1010
[23] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
Deng, Haipeng
Wu, Qiuxia
Huang, Han
Yang, Xiaowei
Wang, Zhiyong
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22): : 16593 - 16605
[24] TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation
Wu, Wayne
Cao, Kaidi
Li, Cheng
Qian, Chen
Loy, Chen Change
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8004 - 8013
[25] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
Haipeng Deng
Qiuxia Wu
Han Huang
Xiaowei Yang
Zhiyong Wang
Neural Computing and Applications, 2023, 35 : 16593 - 16605
[26] DUNIT: Detection-based Unsupervised Image-to-Image Translation
Bhattacharjee, Deblina
Kim, Seungryong
Vizier, Guillaume
Salzmann, Mathieu
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4786 - 4795
[27] Truly Unsupervised Image-to-Image Translation with Contrastive Representation Learning
Hong, Zhiwei
Feng, Jianxing
Jiang, Tao
COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 239 - 255
[28] Semantically Consistent Image-to-Image Translation for Unsupervised Domain Adaptation
Brehm, Stephan
Scherer, Sebastian
Lienhart, Rainer
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 131 - 141
[29] Underwater dam crack image generation based on unsupervised image-to-image translation
Huang, Ben
Kang, Fei
Li, Xinyu
Zhu, Sisi
AUTOMATION IN CONSTRUCTION, 2024, 163
[30] UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION VIA FAIR REPRESENTATI ON OF GENDER BIAS
Hwang, Sunhee
Byun, Hyeran
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1953 - 1957

← 1 2 3 4 5 →