Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

被引：0

作者：

Shi, Zifan ^{[1
,3
]}

Xu, Yinghao ^{[2
,3
]}

Shen, Yujun ^{[3
]}

Zhao, Deli ^{[3
]}

Chen, Qifeng ^{[1
]}

Yeung, Dit-Yan ^{[1
]}

机构：

[1] HKUST, Hong Kong, Peoples R China

[2] CUHK, Hong Kong, Peoples R China

[3] Ant Grp, Hangzhou, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D-aware image synthesis aims at learning a generative model that can render photo-realistic 2D images while capturing decent underlying 3D shapes. A popular solution is to adopt the generative adversarial network (GAN) and replace the generator with a 3D renderer, where volume rendering with neural radiance field (NeRF) is commonly used. Despite the advancement of synthesis quality, existing methods fail to obtain moderate 3D shapes. We argue that, considering the two-player game in the formulation of GANs, only making the generator 3D-aware is not enough. In other words, displacing the generative mechanism only offers the capability, but not the guarantee, of producing 3D-aware images, because the supervision of the generator primarily comes from the discriminator. To address this issue, we propose GeoD through learning a geometry-aware discriminator to improve 3D-aware GANs. Concretely, besides differentiating real and fake samples from the 2D image space, the discriminator is additionally asked to derive the geometry information from the inputs, which is then applied as the guidance of the generator. Such a simple yet effective design facilitates learning substantially more accurate 3D shapes. Extensive experiments on various generator architectures and training datasets verify the superiority of GeoD over state-of-the-art alternatives. Moreover, our approach is registered as a general framework such that a more capable discriminator (i.e., with a third task of novel view synthesis beyond domain classification and geometry extraction) can further assist the generator with a better multi-view consistency. Project page can be found here.

引用

页数：12

共 50 条

[21] 3D-Aware Face Swapping
Li, Yixuan
Ma, Chao
Yan, Yichao
Zhu, Wenhan
Yang, Xiaokang
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12705 - 12714
[22] Geometry-Aware Neural Rendering
Tobin, Josh
Abbeel, Pieter
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[23] A GEOMETRY-AWARE FRAMEWORK FOR COMPRESSING 3D MESH TEXTURES
Nasiri, Fatemeh
Bidgoli, Navid Mahmoudian
Payan, Frederic
Maugey, Thomas
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4015 - 4019
[24] 3D-Aware Indoor Scene Synthesis with Depth Priors
Shi, Zifan
Shen, Yujun
Zhu, Jiapeng
Yeung, Dit-Yan
Chen, Qifeng
COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 406 - 422
[25] Supervised, Geometry-aware segmentation of 3D mesh Models
Bamba, Keisuke
Ohbuchi, Ryutarou
2012 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2012, : 49 - 54
[26] Efficient Geometry-aware 3D Generative Adversarial Networks
Chan, Eric R.
Lin, Connor Z.
Chan, Matthew A.
Nagano, Koki
Pan, Boxiao
de Mello, Shalini
Gallo, Orazio
Guibas, Leonidas
Tremblay, Jonathan
Khamis, Sameh
Karras, Tero
Wetzstein, Gordon
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16102 - 16112
[27] Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis
Zhang, Xuanmeng
Zheng, Zhedong
Gao, Daiheng
Zhang, Bang
Pan, Pan
Yang, Yi
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18429 - 18438
[28] Geometry-aware Deep Transform
Huang, Jiaji
Qiu, Qiang
Calderbank, Robert
Sapiro, Guillermo
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4139 - 4147
[29] Improving Robustness of Language Models from a Geometry-aware Perspective
Zhu, Bin
Gu, Zhaoquan
Wang, Le
Chen, Jinyin
Xuan, Qi
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3115 - 3125
[30] Generative Multiplane Neural Radiance for 3D-Aware Image Generation
Kumar, Amandeep
Bhunia, Ankan Kumar
Narayan, Sanath
Cholakkal, Hisham
Anwer, Rao Muhammad
Khan, Salman
Yang, Ming-Hsuan
Khan, Fahad Shahbaz
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7354 - 7364

← 1 2 3 4 5 →