Unsupervised Image-to-Image Translation: A Review

被引:17
|
作者
Hoyez, Henri [1 ,2 ]
Schockaert, Cedric [1 ]
Rambach, Jason [3 ]
Mirbach, Bruno [3 ]
Stricker, Didier [1 ,2 ]
机构
[1] Paul Wurth SA, L-1122 Luxembourg, Luxembourg
[2] Tech Univ Kaiserslautern, Dept Comp Sci, D-67663 Kaiserslautern, Germany
[3] German Res Ctr Artificial Intelligence DFKI, D-67663 Kaiserslautern, Germany
关键词
unsupervised image-to-image translation; machine learning; computer vision; deep learning; generative adversarial networks; review;
D O I
10.3390/s22218540
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Supervised image-to-image translation has been proven to generate realistic images with sharp details and to have good quantitative performance. Such methods are trained on a paired dataset, where an image from the source domain already has a corresponding translated image in the target domain. However, this paired dataset requirement imposes a huge practical constraint, requires domain knowledge or is even impossible to obtain in certain cases. Due to these problems, unsupervised image-to-image translation has been proposed, which does not require domain expertise and can take advantage of a large unlabeled dataset. Although such models perform well, they are hard to train due to the major constraints induced in their loss functions, which make training unstable. Since CycleGAN has been released, numerous methods have been proposed which try to address various problems from different perspectives. In this review, we firstly describe the general image-to-image translation framework and discuss the datasets and metrics involved in the topic. Furthermore, we revise the current state-of-the-art with a classification of existing works. This part is followed by a small quantitative evaluation, for which results were taken from papers.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] Latent Filter Scaling for Multimodal Unsupervised Image-to-Image Translation
    Alharbi, Yazeed
    Smith, Neil
    Wonka, Peter
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1458 - 1466
  • [22] Towards Unsupervised Deformable-Instances Image-to-Image Translation
    Su, Sitong
    Song, Jingkuan
    Gao, Lianli
    Zhu, Junchen
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1004 - 1010
  • [23] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
    Deng, Haipeng
    Wu, Qiuxia
    Huang, Han
    Yang, Xiaowei
    Wang, Zhiyong
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22): : 16593 - 16605
  • [24] TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation
    Wu, Wayne
    Cao, Kaidi
    Li, Cheng
    Qian, Chen
    Loy, Chen Change
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8004 - 8013
  • [25] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
    Haipeng Deng
    Qiuxia Wu
    Han Huang
    Xiaowei Yang
    Zhiyong Wang
    Neural Computing and Applications, 2023, 35 : 16593 - 16605
  • [26] DUNIT: Detection-based Unsupervised Image-to-Image Translation
    Bhattacharjee, Deblina
    Kim, Seungryong
    Vizier, Guillaume
    Salzmann, Mathieu
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4786 - 4795
  • [27] Truly Unsupervised Image-to-Image Translation with Contrastive Representation Learning
    Hong, Zhiwei
    Feng, Jianxing
    Jiang, Tao
    COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 239 - 255
  • [28] Semantically Consistent Image-to-Image Translation for Unsupervised Domain Adaptation
    Brehm, Stephan
    Scherer, Sebastian
    Lienhart, Rainer
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 131 - 141
  • [29] Underwater dam crack image generation based on unsupervised image-to-image translation
    Huang, Ben
    Kang, Fei
    Li, Xinyu
    Zhu, Sisi
    AUTOMATION IN CONSTRUCTION, 2024, 163
  • [30] UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION VIA FAIR REPRESENTATI ON OF GENDER BIAS
    Hwang, Sunhee
    Byun, Hyeran
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1953 - 1957