3D-Aware Generative Model for Improved Side-View Image Synthesis

被引：0

作者：

Jo, Kyungmin ^{[1
]}

Jin, Wonjoon ^{[2
]}

Choo, Jaegul ^{[1
]}

Lee, Hyunjoon ^{[3
]}

Cho, Sunghyun ^{[2
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[2] POSTECH, Pohang, Gyeongbuk, South Korea

[3] Kakao Brain, Seongnam Si, Gyeonggi Do, South Korea

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While recent 3D-aware generative models have shown photo-realistic image synthesis with multi-view consistency, the synthesized image quality degrades depending on the camera pose (e.g., a face with a blurry and noisy boundary at a side viewpoint). Such degradation is mainly caused by the difficulty of learning both pose consistency and photorealism simultaneously from a dataset with heavily imbalanced poses. In this paper, we propose SideGAN, a novel 3D GAN training method to generate photo-realistic images irrespective of the camera pose, especially for faces of side-view angles. To ease the challenging problem of learning photo-realistic and pose-consistent image synthesis, we split the problem into two subproblems, each of which can be solved more easily. Specifically, we formulate the problem as a combination of two simple discrimination problems, one of which learns to discriminate whether a synthesized image looks real or not, and the other learns to discriminate whether a synthesized image agrees with the camera pose. Based on this, we propose a dual-branched discriminator with two discrimination branches. We also propose a pose-matching loss to learn the pose consistency of 3D GANs. In addition, we present a pose sampling strategy to increase learning opportunities for steep angles in a pose-imbalanced dataset. With extensive validation, we demonstrate that our approach enables 3D GANs to generate high-quality geometries and photo-realistic images irrespective of the camera pose.

引用

页码：22805 / 22815

页数：11

共 50 条

[41] 3D-aware Image Generation using 2D Diffusion Models [J].

Xiang, Jianfeng ;

Yang, Jiaolong ;

Huang, Binbin ;

Tong, Xin .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :2383-2393

[42] Discovering Interpretable Latent Space Directions for 3D-Aware Image Generation [J].

Yang, Zhiyuan ;

Zhang, Qingfu .

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03) :2570-2580

[43] AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars [J].

Wu, Yue ;

Deng, Yu ;

Yang, Jiaolong ;

Wei, Fangyun ;

Chen, Qifeng ;

Tong, Xin .

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

[44] MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis [J].

Wu, Zhenyu ;

Hoang, Duc ;

Lin, Shih-Yao ;

Xie, Yusheng ;

Chen, Liangjian ;

Lin, Yen-Yu ;

Wang, Zhangyang ;

Fan, Wei .

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2508-2516

[45] Generative Occupancy Fields for 3D Surface-Aware Image Synthesis [J].

Xu, Xudong ;

Pan, Xingang ;

Lin, Dahua ;

Dai, Bo .

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

[46] 3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose Mapping [J].

Yang, Zhuoqian ;

Li, Shikai ;

Wu, Wayne ;

Dai, Bo .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :22951-22962

[47] ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context [J].

Wang, Binglun ;

Dutt, Niladri Shekhar ;

Mitra, Niloy J. .

PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2024, 7 (01)

[48] OBJECT 3DIT: Language-guided 3D-aware Image Editing [J].

Michel, Oscar ;

Bhattad, Anand ;

VanderBilt, Eli ;

Krishna, Ranjay ;

Kembhavi, Aniruddha ;

Gupta, Tanmay .

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

[49] 3D Face Reconstruction from One Side-View Face Images [J].

Jo, Jaeik ;

Jung, Yu Jin ;

Kim, Jaihie .

2014 International Conference on Electronics, Information and Communications (ICEIC), 2014,

[50] Real-Time 3D-Aware Portrait Editing from a Single Image [J].

Bai, Qingyan ;

Shi, Zifan ;

Xu, Yinghao ;

Ouyang, Hao ;

Wang, Qiuyu ;

Yang, Ceyuan ;

Wang, Xuan ;

Wetzstein, Gordon ;

Shen, Yujun ;

Chen, Qifeng .

COMPUTER VISION - ECCV 2024, PT LI, 2025, 15109 :344-362

← 1 2 3 4 5 →