3D-Aware Generative Model for Improved Side-View Image Synthesis

被引:0
作者
Jo, Kyungmin [1 ]
Jin, Wonjoon [2 ]
Choo, Jaegul [1 ]
Lee, Hyunjoon [3 ]
Cho, Sunghyun [2 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] POSTECH, Pohang, Gyeongbuk, South Korea
[3] Kakao Brain, Seongnam Si, Gyeonggi Do, South Korea
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While recent 3D-aware generative models have shown photo-realistic image synthesis with multi-view consistency, the synthesized image quality degrades depending on the camera pose (e.g., a face with a blurry and noisy boundary at a side viewpoint). Such degradation is mainly caused by the difficulty of learning both pose consistency and photorealism simultaneously from a dataset with heavily imbalanced poses. In this paper, we propose SideGAN, a novel 3D GAN training method to generate photo-realistic images irrespective of the camera pose, especially for faces of side-view angles. To ease the challenging problem of learning photo-realistic and pose-consistent image synthesis, we split the problem into two subproblems, each of which can be solved more easily. Specifically, we formulate the problem as a combination of two simple discrimination problems, one of which learns to discriminate whether a synthesized image looks real or not, and the other learns to discriminate whether a synthesized image agrees with the camera pose. Based on this, we propose a dual-branched discriminator with two discrimination branches. We also propose a pose-matching loss to learn the pose consistency of 3D GANs. In addition, we present a pose sampling strategy to increase learning opportunities for steep angles in a pose-imbalanced dataset. With extensive validation, we demonstrate that our approach enables 3D GANs to generate high-quality geometries and photo-realistic images irrespective of the camera pose.
引用
收藏
页码:22805 / 22815
页数:11
相关论文
共 50 条
[41]   3D-aware Image Generation using 2D Diffusion Models [J].
Xiang, Jianfeng ;
Yang, Jiaolong ;
Huang, Binbin ;
Tong, Xin .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :2383-2393
[42]   Discovering Interpretable Latent Space Directions for 3D-Aware Image Generation [J].
Yang, Zhiyuan ;
Zhang, Qingfu .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03) :2570-2580
[43]   AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars [J].
Wu, Yue ;
Deng, Yu ;
Yang, Jiaolong ;
Wei, Fangyun ;
Chen, Qifeng ;
Tong, Xin .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[44]   MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis [J].
Wu, Zhenyu ;
Hoang, Duc ;
Lin, Shih-Yao ;
Xie, Yusheng ;
Chen, Liangjian ;
Lin, Yen-Yu ;
Wang, Zhangyang ;
Fan, Wei .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2508-2516
[45]   Generative Occupancy Fields for 3D Surface-Aware Image Synthesis [J].
Xu, Xudong ;
Pan, Xingang ;
Lin, Dahua ;
Dai, Bo .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[46]   3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose Mapping [J].
Yang, Zhuoqian ;
Li, Shikai ;
Wu, Wayne ;
Dai, Bo .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :22951-22962
[47]   ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context [J].
Wang, Binglun ;
Dutt, Niladri Shekhar ;
Mitra, Niloy J. .
PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2024, 7 (01)
[48]   OBJECT 3DIT: Language-guided 3D-aware Image Editing [J].
Michel, Oscar ;
Bhattad, Anand ;
VanderBilt, Eli ;
Krishna, Ranjay ;
Kembhavi, Aniruddha ;
Gupta, Tanmay .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[49]   3D Face Reconstruction from One Side-View Face Images [J].
Jo, Jaeik ;
Jung, Yu Jin ;
Kim, Jaihie .
2014 International Conference on Electronics, Information and Communications (ICEIC), 2014,
[50]   Real-Time 3D-Aware Portrait Editing from a Single Image [J].
Bai, Qingyan ;
Shi, Zifan ;
Xu, Yinghao ;
Ouyang, Hao ;
Wang, Qiuyu ;
Yang, Ceyuan ;
Wang, Xuan ;
Wetzstein, Gordon ;
Shen, Yujun ;
Chen, Qifeng .
COMPUTER VISION - ECCV 2024, PT LI, 2025, 15109 :344-362