3D-Aware Generative Model for Improved Side-View Image Synthesis

被引:0
作者
Jo, Kyungmin [1 ]
Jin, Wonjoon [2 ]
Choo, Jaegul [1 ]
Lee, Hyunjoon [3 ]
Cho, Sunghyun [2 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] POSTECH, Pohang, Gyeongbuk, South Korea
[3] Kakao Brain, Seongnam Si, Gyeonggi Do, South Korea
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While recent 3D-aware generative models have shown photo-realistic image synthesis with multi-view consistency, the synthesized image quality degrades depending on the camera pose (e.g., a face with a blurry and noisy boundary at a side viewpoint). Such degradation is mainly caused by the difficulty of learning both pose consistency and photorealism simultaneously from a dataset with heavily imbalanced poses. In this paper, we propose SideGAN, a novel 3D GAN training method to generate photo-realistic images irrespective of the camera pose, especially for faces of side-view angles. To ease the challenging problem of learning photo-realistic and pose-consistent image synthesis, we split the problem into two subproblems, each of which can be solved more easily. Specifically, we formulate the problem as a combination of two simple discrimination problems, one of which learns to discriminate whether a synthesized image looks real or not, and the other learns to discriminate whether a synthesized image agrees with the camera pose. Based on this, we propose a dual-branched discriminator with two discrimination branches. We also propose a pose-matching loss to learn the pose consistency of 3D GANs. In addition, we present a pose sampling strategy to increase learning opportunities for steep angles in a pose-imbalanced dataset. With extensive validation, we demonstrate that our approach enables 3D GANs to generate high-quality geometries and photo-realistic images irrespective of the camera pose.
引用
收藏
页码:22805 / 22815
页数:11
相关论文
共 50 条
[21]   GIRAFFE HD: A High-Resolution 3D-aware Generative Model [J].
Xue, Yang ;
Li, Yuheng ;
Singh, Krishna Kumar ;
Lee, Yong Jae .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18419-18428
[22]   Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis [J].
Do, Hoseok ;
Yoo, EunKyung ;
Kim, Taehyeong ;
Lee, Chul ;
Choi, Tin Young .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :8529-8538
[23]   VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel Grids [J].
Schwarz, Katja ;
Sauer, Axel ;
Niemeyer, Michael ;
Liao, Yiyi ;
Geiger, Andreas .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[24]   3D-aware Image Synthesis via Learning Structural and Textural Representations [J].
Xu, Yinghao ;
Peng, Sida ;
Yang, Ceyuan ;
Shen, Yujun ;
Zhou, Bolei .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18409-18418
[25]   The 3D-aware image synthesis of prohibited items in the X-ray security inspection by stylized generative radiance fields [J].
Liu, Jian ;
Yu, Zhen ;
Guo, Wenyu .
ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (03) :1801-1821
[26]   Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks [J].
Barthel, Florian ;
Beckmann, Arian ;
Morgenstern, Wieland ;
Hilsmann, Anna ;
Eisert, Peter .
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, :7963-7972
[27]   DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis [J].
Xu, Yinghao ;
Chai, Menglei ;
Shi, Zifan ;
Peng, Sida ;
Skorokhodov, Ivan ;
Siarohin, Aliaksandr ;
Yang, Ceyuan ;
Shen, Yujun ;
Lee, Hsin-Ying ;
Zhou, Bolei ;
Tulyakov, Sergey .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :4402-4412
[28]   NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion [J].
Gu, Jiatao ;
Trevithick, Alex ;
Lin, Kai-En ;
Susskind, Josh ;
Theohalt, Christian ;
Liu, Lingjie ;
Ramamoorthi, Ravi .
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[29]   3D Congealing: 3D-Aware Image Alignment in the Wild [J].
Zhang, Yunzhi ;
Li, Zizhang ;
Raj, Amit ;
Engelhardt, Andreas ;
Li, Yuanzhen ;
Hou, Tingbo ;
Wu, Jiajun ;
Jampani, Varun .
COMPUTER VISION-ECCV 2024, PT I, 2025, 15059 :387-404
[30]   Generative Multiplane Images: Making a 2D GAN 3D-Aware [J].
Zhao, Xiaoming ;
Ma, Fangchang ;
Guera, David ;
Ren, Zhile ;
Schwing, Alexander G. ;
Colburn, Alex .
COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 :18-35