A Variational U-Net for Conditional Appearance and Shape Generation

被引:290
作者
Esser, Patrick [1 ]
Sutter, Ekaterina [1 ]
Ommer, Bjoern [1 ]
机构
[1] Heidelberg Univ, IWR, Heidelberg Collaboratory Image Proc, Heidelberg, Germany
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00923
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep generative models have demonstrated great performance in image synthesis. However, results deteriorate in case of spatial deformations, since they generate images of objects directly, rather than modeling the intricate interplay of their inherent shape and appearance. We present a conditional U-Net [30] for shape-guided image generation, conditioned on the output of a variational autoencoder for appearance. The approach is trained end-to-end on images, without requiring samples of the same object with varying pose or appearance. Experiments show that the model enables conditional image generation and transfer. Therefore, either shape or appearance can be retained from a query image, while freely altering the other. Moreover, appearance can be sampled due to its stochastic latent representation, while preserving shape. In quantitative and qualitative experiments on COCO [20], DeepFashion [21, 23], shoes [43], Market-1501 [47] and handbags [49] the approach demonstrates significant improvements over the state-of-the-art.
引用
收藏
页码:8857 / 8866
页数:10
相关论文
共 50 条
[41]   Slim U-Net: Efficient Anatomical Feature Preserving U-net Architecture for Ultrasound Image Segmentation [J].
Raina, Deepak ;
Verma, Kashish ;
Chandrashekhara, Sheragaru Hanumanthappa ;
Saha, Subir Kumar .
2022 9TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, ICBBE 2022, 2022, :41-48
[42]   Robust U-Net: Development of robust image enhancement model using modified U-Net architecture [J].
Bhavani, Murapaka Dhanalakshmi ;
Murugan, Raman ;
Goel, Tripti .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28)
[43]   Stacked Generative Adversarial Networks for Image Generation based on U-Net discriminator [J].
Feng, Wanyan ;
Meng, Zuqiang ;
Wang, Lin .
2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, :762-768
[44]   tagnant zone segmentation with U-net [J].
Waktola, Selam ;
Grudzien, Krzysztof ;
Babout, Laurent .
2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, :277-280
[45]   Segmentation of roots in soil with U-Net [J].
Abraham George Smith ;
Jens Petersen ;
Raghavendra Selvan ;
Camilla Ruø Rasmussen .
Plant Methods, 16
[46]   Segmentation of roots in soil with U-Net [J].
Smith, Abraham George ;
Petersen, Jens ;
Selvan, Raghavendra ;
Rasmussen, Camilla Ruo .
PLANT METHODS, 2020, 16 (01)
[47]   Attention U-Net for Kidney and Masses [J].
Lee, Duho ;
Choi, Heeyeon .
KIDNEY AND KIDNEY TUMOR SEGMENTATION, KITS 2023, 2024, 14540 :139-142
[48]   U-Net for SPECT Image Denoising [J].
Reymann, Maximilian P. ;
Wuerfl, Tobias ;
Ritt, Philipp ;
Stimpel, Bernhard ;
Cachovan, Michal ;
Vija, A. Hans ;
Maier, Andreas .
2019 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2019,
[49]   Integration of U-Net into Windows/NT [J].
von Eicken, T .
PARALLEL COMPUTER ROUTING AND COMMUNICATION, 1998, 1417 :185-185
[50]   Improved Segmentation by Adversarial U-Net [J].
Sriker, David ;
Cohen, Dana ;
Cahan, Noa ;
Greenspan, Hayit .
MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597