Latent Filter Scaling for Multimodal Unsupervised Image-to-Image Translation

被引：23

作者：

Alharbi, Yazeed ^{[1
]}

Smith, Neil ^{[2
]}

Wonka, Peter ^{[2
]}

机构：

[1] King Abdullah Univ Sci & Technol KAUST, Thuwal, Saudi Arabia

[2] KAUST, Thuwal, Saudi Arabia

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00155

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multimodal unsupervised image-to-image translation tasks, the goal is to translate an image from the source domain to many images in the target domain. We present a simple method that produces higher quality images than current state-of-the-art while maintaining the same amount of multimodal diversity. Previous methods follow the unconditional approach of trying to map the latent code directly to a full-size image. This leads to complicated network architectures with several introduced hyperparameters to tune. By treating the latent code as a modifier of the convolutional filters, we produce multimodal output while maintaining the traditional Generative Adversarial Network (GAN) loss and without additional hyperparameters. The only tuning required by our method controls the tradeoff between variability and quality of generated images. Furthermore, we achieve disentanglement between source domain content and target domain style for free as a by-product of our formulation. We perform qualitative and quantitative experiments showing the advantages of our method compared with the state-of-the art on multiple benchmark image-to-image translation datasets.

引用

页码：1458 / 1466

页数：9

共 27 条

[1]

[Anonymous], ABS180103924 CORR

[2]

[Anonymous], 2017, ARXIV170805349

[3]

[Anonymous], 2018, ARXIV180210151

[4]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[5]

Arjovsky Martin, 2017, INT C LEARNING REPRE

[6] Photographic Image Synthesis with Cascaded Refinement Networks [J].

Chen, Qifeng ;

Koltun, Vladlen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1520-1529

[7]

Chen Xi, 2016, Advances in Neural Information Processing Systems, V29

[8]

Denton E. L., 2015, Deep Generative Image Models using a obj Laplacian Pyramid of Adversarial Networks, P1486

[9]

Ghosh Arnab, 2017, ARXIV170402906, V1

[10]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

← 1 2 3 →