CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training

被引:417
作者
Bao, Jianmin [1 ]
Chen, Dong [2 ]
Wen, Fang [2 ]
Li, Houqiang [1 ]
Hua, Gang [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Microsoft Res, Redmond, WA USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
关键词
D O I
10.1109/ICCV.2017.299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present variational generative adversarial networks, a general learning framework that combines a variational auto-encoder with a generative adversarial network, for synthesizing images in fine-grained categories, such as faces of a specific person or objects in a category. Our approach models an image as a composition of label and latent attributes in a probabilistic model. By varying the fine-grained category label fed into the resulting generative model, we can generate images in a specific category with randomly drawn values on a latent attribute vector. Our approach has two novel aspects. First, we adopt a cross entropy loss for the discriminative and classifier network, but a mean discrepancy objective for the generative network. This kind of asymmetric loss function makes the GAN training more stable. Second, we adopt an encoder network to learn the relationship between the latent space and the real image space, and use pairwise feature matching to keep the structure of generated images. We experiment with natural images of faces, flowers, and birds, and demonstrate that the proposed models are capable of generating realistic and diverse samples with fine-grained category labels. We further show that our models can be applied to other tasks, such as image inpainting, super-resolution, and data augmentation for training better face recognition models.
引用
收藏
页码:2764 / 2773
页数:10
相关论文
共 48 条
[1]  
[Anonymous], 2016, P ADV NEURAL INFORM
[2]  
[Anonymous], ARXIV170208398
[3]  
[Anonymous], P IEEE INT C AC SPEE
[4]  
[Anonymous], 2015, ARXIV151101844
[5]  
[Anonymous], 2011, P 14 INT C ARTIFICIA
[6]  
[Anonymous], 2017, NIPS 2016 WORKSH ADV
[7]  
[Anonymous], 2016, Advances in Face Detection and Facial Image Analysis, DOI 10.1007/978-3-319-25958-1
[8]  
[Anonymous], 2016, ARXIV161200005
[9]  
[Anonymous], 2009, Deep boltzmann machines
[10]  
[Anonymous], 2014, CoRR