CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training

被引:381
|
作者
Bao, Jianmin [1 ]
Chen, Dong [2 ]
Wen, Fang [2 ]
Li, Houqiang [1 ]
Hua, Gang [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Microsoft Res, Redmond, WA USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
关键词
D O I
10.1109/ICCV.2017.299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present variational generative adversarial networks, a general learning framework that combines a variational auto-encoder with a generative adversarial network, for synthesizing images in fine-grained categories, such as faces of a specific person or objects in a category. Our approach models an image as a composition of label and latent attributes in a probabilistic model. By varying the fine-grained category label fed into the resulting generative model, we can generate images in a specific category with randomly drawn values on a latent attribute vector. Our approach has two novel aspects. First, we adopt a cross entropy loss for the discriminative and classifier network, but a mean discrepancy objective for the generative network. This kind of asymmetric loss function makes the GAN training more stable. Second, we adopt an encoder network to learn the relationship between the latent space and the real image space, and use pairwise feature matching to keep the structure of generated images. We experiment with natural images of faces, flowers, and birds, and demonstrate that the proposed models are capable of generating realistic and diverse samples with fine-grained category labels. We further show that our models can be applied to other tasks, such as image inpainting, super-resolution, and data augmentation for training better face recognition models.
引用
收藏
页码:2764 / 2773
页数:10
相关论文
共 50 条
  • [1] Multi-view Image Generation by Cycle CVAE-GAN Networks
    Lai, Zhichen
    Tang, Chenwei
    Lv, Jiancheng
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 43 - 54
  • [2] Variational Conditional GAN for Fine-grained Controllable Image Generation
    Hu, Mingqi
    Zhou, Deyu
    He, Yulan
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 109 - 124
  • [3] Hierarchical CVAE for Fine-Grained Hate Speech Classification
    Qian, Jing
    ElSherief, Mai
    Belding, Elizabeth
    Wang, William Yang
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3550 - 3559
  • [4] Fine-grained attention for image caption generation
    Chang, Yan-Shuo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 2959 - 2971
  • [5] Fine-grained attention for image caption generation
    Yan-Shuo Chang
    Multimedia Tools and Applications, 2018, 77 : 2959 - 2971
  • [6] 4-Class MI-EEG Signal Generation and Recognition with CVAE-GAN
    Yang, Jun
    Yu, Huijuan
    Shen, Tao
    Song, Yaolian
    Chen, Zhuangfei
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 14
  • [7] Fine-Grained Image Search
    Xie, Lingxi
    Wang, Jingdong
    Zhang, Bo
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (05) : 636 - 647
  • [8] Improving the Conditional Fine-Grained Image Generation With Part Perception
    Han, Xuan
    You, Mingyu
    Lu, Ping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4792 - 4804
  • [9] Text-to-Image Generation Grounded by Fine-Grained User Attention
    Koh, Jing Yu
    Baldridge, Jason
    Lee, Honglak
    Yang, Yinfei
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 237 - 246
  • [10] A Survey of Fine-Grained Image Categorization
    Zheng, Min
    Li, Qingyong
    Geng, Yangli-ao
    Yu, Haomin
    Wang, Jianzhu
    Gan, Jinrui
    Xue, Wenyuan
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 533 - 538