Rethinking the Truly Unsupervised Image-to-Image Translation

被引:55
作者
Baek, Kyungjune [1 ]
Choi, Yunjey [2 ]
Uh, Youngjung [1 ]
Yoo, Jaejun [3 ]
Shim, Hyunjung [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
[2] NAVER AI Lab, Seongnam, South Korea
[3] UNIST, Ulsan, South Korea
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.01389
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Every recent image-to-image translation model inherently requires either image-level (i.e. input-output pairs) or set-level (i.e. domain labels) supervision. However, even set-level supervision can be a severe bottleneck for data collection in practice. In this paper, we tackle image-to-image translation in a fully unsupervised setting, i.e., neither paired images nor domain labels. To this end, we propose a truly unsupervised image-to-image translation model (TUNIT) that simultaneously learns to separate image domains and translates input images into the estimated domains. Experimental results show that our model achieves comparable or even better performance than the set-level supervised model trained with full labels, generalizes well on various datasets, and is robust against the choice of hyperparameters (e.g. the preset number of pseudo domains). Furthermore, TUNIT can be easily extended to semi-supervised learning with a few labeled data.
引用
收藏
页码:14134 / 14143
页数:10
相关论文
共 39 条
[21]  
Komodakis Nikos, 2018, INT C LEARNING REPRE
[22]   DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks [J].
Kupyn, Orest ;
Budzan, Volodymyr ;
Mykhailych, Mykola ;
Mishkin, Dmytro ;
Matas, Jiri .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8183-8192
[23]   Fear of Crime Out West: Determinants of Fear of Property and Violent Crime in Five States [J].
Lee, Heeuk D. ;
Reyns, Bradford W. ;
Kim, David ;
Maher, Cooper .
INTERNATIONAL JOURNAL OF OFFENDER THERAPY AND COMPARATIVE CRIMINOLOGY, 2020, 64 (12) :1299-1316
[24]  
Liu M. Y., 2017, ADV NEURAL INFORM PR, DOI DOI 10.48550/ARXIV.1703.00848
[25]   Few-Shot Unsupervised Image-to-Image Translation [J].
Liu, Ming-Yu ;
Huang, Xun ;
Mallya, Arun ;
Karras, Tero ;
Aila, Timo ;
Lehtinen, Jaakko ;
Kautz, Jan .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :10550-10559
[26]   The Trusted and Decentralized Network Resource Management [J].
Liu, Shuai ;
Yang, Fei ;
Li, Dandan ;
Bagnulo, Marcelo ;
Liu, Bingyang ;
Huang, Xiaohong .
2020 29TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2020), 2020,
[27]  
LUCIC M, 2019, INT C MACH LEARN, V97
[28]  
Mirza M, 2014, Adv. Neural Inf. Proces. Syst., V27
[29]  
Naeem Muhammad Ferjad, 2020, Reliable fidelity and diversity metrics for generative models, V13, P15
[30]   Profiling Dynamic Data Access Patterns with Controlled Overhead and Quality [J].
Park, SeongJae ;
Lee, Yunjae ;
Yeom, Heon Y. .
PROCEEDINGS OF THE 2019 20TH INTERNATIONAL MIDDLEWARE CONFERENCE INDUSTRIAL TRACK (MIDDLEWARE INDUSTRY '19), 2019, :1-7