A Variational U-Net for Conditional Appearance and Shape Generation

被引:290
|
作者
Esser, Patrick [1 ]
Sutter, Ekaterina [1 ]
Ommer, Bjoern [1 ]
机构
[1] Heidelberg Univ, IWR, Heidelberg Collaboratory Image Proc, Heidelberg, Germany
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00923
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep generative models have demonstrated great performance in image synthesis. However, results deteriorate in case of spatial deformations, since they generate images of objects directly, rather than modeling the intricate interplay of their inherent shape and appearance. We present a conditional U-Net [30] for shape-guided image generation, conditioned on the output of a variational autoencoder for appearance. The approach is trained end-to-end on images, without requiring samples of the same object with varying pose or appearance. Experiments show that the model enables conditional image generation and transfer. Therefore, either shape or appearance can be retained from a query image, while freely altering the other. Moreover, appearance can be sampled due to its stochastic latent representation, while preserving shape. In quantitative and qualitative experiments on COCO [20], DeepFashion [21, 23], shoes [43], Market-1501 [47] and handbags [49] the approach demonstrates significant improvements over the state-of-the-art.
引用
收藏
页码:8857 / 8866
页数:10
相关论文
共 50 条
  • [1] A variational U-Net for motion retargeting
    Uk Kim, Seong
    Jang, Hanyoung
    Kim, Jongmin
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
  • [2] A Variational U-Net for Motion Retargeting
    Jang, Hanyoung
    Kwon, Byungjun
    Yu, Moonwon
    Kim, Seong Uk
    Kim, Jongmin
    SA'18: SIGGRAPH ASIA 2018 POSTERS, 2018,
  • [3] Enhanced Variational U-Net for Weather Forecasting
    Kwok, Pak Hay
    Qi, Qi
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5758 - 5763
  • [4] Towards speech enhancement using a variational U-Net architecture
    Nustede, Eike J.
    Anemueller, Joern
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 481 - 485
  • [5] Non-Intrusive Load Monitoring Based on Residual U-Net and Conditional Generation Adversarial Networks
    Wang, Jinlong
    Pang, Chengxin
    Zeng, Xinhua
    Chen, Yongbo
    IEEE ACCESS, 2023, 11 : 77441 - 77451
  • [6] Facial Expression Synthesis by U-Net Conditional Generative Adversarial Networks
    Wang, Xueping
    Li, Weixin
    Mu, Guodong
    Huang, Di
    Wang, Yunhong
    ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 283 - 290
  • [7] Improved Breast Mass Segmentation in Mammograms with Conditional Residual U-Net
    Li, Heyi
    Chen, Dongdong
    Nailon, William H.
    Davies, Mike E.
    Laurenson, David
    IMAGE ANALYSIS FOR MOVING ORGAN, BREAST, AND THORACIC IMAGES, 2018, 11040 : 81 - 89
  • [8] Chimeric U-Net - Modifying the standard U-Net towards explainability
    Schulze, Kenrick
    Peppert, Felix
    Schuette, Christof
    Sunkara, Vikram
    ARTIFICIAL INTELLIGENCE, 2025, 338
  • [9] UIU-Net: U-Net in U-Net for Infrared Small Object Detection
    Wu, Xin
    Hong, Danfeng
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 364 - 376
  • [10] U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?
    Jia, Xi
    Bartlett, Joseph
    Zhang, Tianyang
    Lu, Wenqi
    Qiu, Zhaowen
    Duan, Jinming
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 151 - 160