DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks

被引:101
作者
Ma, Shuang [1 ,2 ]
Fu, Jianlong [2 ]
Chen, Chang Wen [1 ,3 ]
Mei, Tao [2 ]
机构
[1] SUNY Buffalo, Comp Sci & Engn, Buffalo, NY 14260 USA
[2] Microsoft Res, Redmond, WA 98052 USA
[3] Chinese Univ Hong Kong, Sch Sci & Engn, Hong Kong, Peoples R China
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00593
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised image translation, which aims in translating two independent sets of images, is challenging in discovering the correct correspondences without paired data. Existing works build upon Generative Adversarial Networks (GANs) such that the distribution of the translated images are indistinguishable from the distribution of the target set. However, such set-level constraints cannot learn the instance-level correspondences (e.g. aligned semantic parts in object transfiguration task). This limitation often results in false positives (e.g. geometric or semantic artifacts), and further leads to mode collapse problem. To address the above issues, we propose a novel framework for instance-level image translation by Deep Attention GAN (DA-GAN). Such a design enables DA-GAN to decompose the task of translating samples from two sets into translating instances in a highly-structured latent space. Specifically, we jointly learn a deep attention encoder, and the instance-level correspondences could be consequently discovered through attending on the learned instances. Therefore, the constraints could be exploited on both set-level and instance-level. Comparisons against several state-of-the-arts demonstrate the superiority of our approach, and the broad application capability, e.g, pose morphing, data augmentation, etc., pushes the margin of domain translation problem.(1)
引用
收藏
页码:5657 / 5666
页数:10
相关论文
共 44 条
[1]  
[Anonymous], 2016, INFOGAN INTERPRETABL
[2]  
[Anonymous], 2016, CORR
[3]  
[Anonymous], 2016, CORR
[4]  
[Anonymous], 2016, P IEEE C COMPUTER VI
[5]  
[Anonymous], 2016, IMPROVED TECHNIQUES
[6]  
[Anonymous], PROC ICLR 2015
[7]  
[Anonymous], 2010, MNIST HANDWRITTEN DI
[8]  
[Anonymous], 2016, CORR
[9]  
[Anonymous], CORR
[10]  
[Anonymous], 2016, Attribute2Image: Conditional Image Generation from Visual Attributes