A Variational U-Net for Conditional Appearance and Shape Generation

被引：290

作者：

Esser, Patrick ^{[1
]}

Sutter, Ekaterina ^{[1
]}

Ommer, Bjoern ^{[1
]}

机构：

[1] Heidelberg Univ, IWR, Heidelberg Collaboratory Image Proc, Heidelberg, Germany

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00923

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep generative models have demonstrated great performance in image synthesis. However, results deteriorate in case of spatial deformations, since they generate images of objects directly, rather than modeling the intricate interplay of their inherent shape and appearance. We present a conditional U-Net [30] for shape-guided image generation, conditioned on the output of a variational autoencoder for appearance. The approach is trained end-to-end on images, without requiring samples of the same object with varying pose or appearance. Experiments show that the model enables conditional image generation and transfer. Therefore, either shape or appearance can be retained from a query image, while freely altering the other. Moreover, appearance can be sampled due to its stochastic latent representation, while preserving shape. In quantitative and qualitative experiments on COCO [20], DeepFashion [21, 23], shoes [43], Market-1501 [47] and handbags [49] the approach demonstrates significant improvements over the state-of-the-art.

引用

页码：8857 / 8866

页数：10

共 50 条

[1] A variational U-Net for motion retargeting
Uk Kim, Seong
Jang, Hanyoung
Kim, Jongmin
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
[2] A Variational U-Net for Motion Retargeting
Jang, Hanyoung
Kwon, Byungjun
Yu, Moonwon
Kim, Seong Uk
Kim, Jongmin
SA'18: SIGGRAPH ASIA 2018 POSTERS, 2018,
[3] Enhanced Variational U-Net for Weather Forecasting
Kwok, Pak Hay
Qi, Qi
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5758 - 5763
[4] Towards speech enhancement using a variational U-Net architecture
Nustede, Eike J.
Anemueller, Joern
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 481 - 485
[5] Non-Intrusive Load Monitoring Based on Residual U-Net and Conditional Generation Adversarial Networks
Wang, Jinlong
Pang, Chengxin
Zeng, Xinhua
Chen, Yongbo
IEEE ACCESS, 2023, 11 : 77441 - 77451
[6] Facial Expression Synthesis by U-Net Conditional Generative Adversarial Networks
Wang, Xueping
Li, Weixin
Mu, Guodong
Huang, Di
Wang, Yunhong
ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 283 - 290
[7] Improved Breast Mass Segmentation in Mammograms with Conditional Residual U-Net
Li, Heyi
Chen, Dongdong
Nailon, William H.
Davies, Mike E.
Laurenson, David
IMAGE ANALYSIS FOR MOVING ORGAN, BREAST, AND THORACIC IMAGES, 2018, 11040 : 81 - 89
[8] Chimeric U-Net - Modifying the standard U-Net towards explainability
Schulze, Kenrick
Peppert, Felix
Schuette, Christof
Sunkara, Vikram
ARTIFICIAL INTELLIGENCE, 2025, 338
[9] UIU-Net: U-Net in U-Net for Infrared Small Object Detection
Wu, Xin
Hong, Danfeng
Chanussot, Jocelyn
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 364 - 376
[10] U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?
Jia, Xi
Bartlett, Joseph
Zhang, Tianyang
Lu, Wenqi
Qiu, Zhaowen
Duan, Jinming
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 151 - 160

← 1 2 3 4 5 →