Appearance and shape based image synthesis by conditional variational generative adversarial network

被引:10
作者
Chen, Ying [1 ,2 ]
Xia, Shixiong [1 ,2 ]
Zhao, Jiaqi [1 ,2 ]
Zhou, Yong [1 ,2 ]
Niu, Qiang [1 ,2 ]
Yao, Rui [1 ,2 ]
Zhu, Dongjun [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[2] Minist Educ Peoples Republ China, Engn Res Ctr Mine Digitizat, Xuzhou 221116, Jiangsu, Peoples R China
关键词
Image synthesis; Deep generative models; Variational inference; Generative adversarial network;
D O I
10.1016/j.knosys.2019.105450
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person image synthesis based on shape and appearance using deep generative models opens the door in mickle applications, such as person re-identification (ReID) and movie industry. The methods of image synthesis are driven by producing the image of an object directly, which fail to recover spatial deformations when images are generated. In this paper, we present a conditional variational generative adversarial network (CVGAN) to synthesize desired images guided by target shape by modeling the inherent interplay between shape and appearance. Firstly, the shape and appearance of the given images are disentangled by adopting variational inference, which enables us to generate person images with arbitrary shapes. Secondly, to preserve the details and generate photo-realistic images, the Kullback-Leibler (KL) loss is adopted to reduce the gap between the condition image and generated image. Thirdly, to prevent partly gradient vanishing problem for training our framework stably, we propose combined general learning method, where the discriminative network leverages least squares loss. In addition, we experiment on COCO, DeepFashion and Market-1501 datasets, and results demonstrate that VGAN significantly improves the synthesis of images on discriminability, diversity and quality over the existing methods. (c) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 48 条
  • [1] [Anonymous], INT C MACH LEARN ICM
  • [2] [Anonymous], 2019, INT C MACH LEARN ICM
  • [3] [Anonymous], CCF CHIN C COMP VIS
  • [4] [Anonymous], INT C COMP INF SCI I
  • [5] [Anonymous], 2017, IMPROVED TRAINING WA
  • [6] [Anonymous], 2016, P ADV NEUR INF PROC
  • [7] CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training
    Bao, Jianmin
    Chen, Dong
    Wen, Fang
    Li, Houqiang
    Hua, Gang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2764 - 2773
  • [8] Deep Unsupervised Similarity Learning using Partially Ordered Sets
    Bautista, Miguel A.
    Sanakoyeu, Artsiom
    Ommer, Bjoern
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1923 - 1932
  • [9] LSTM Self-Supervision for Detailed Behavior Analysis
    Brattoli, Biagio
    Buechler, Uta
    Wahl, Anna-Sophia
    Schwab, Martin E.
    Ommer, Bjoern
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3747 - 3756
  • [10] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310