Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

被引:0
|
作者
Shi, Zifan [1 ,3 ]
Xu, Yinghao [2 ,3 ]
Shen, Yujun [3 ]
Zhao, Deli [3 ]
Chen, Qifeng [1 ]
Yeung, Dit-Yan [1 ]
机构
[1] HKUST, Hong Kong, Peoples R China
[2] CUHK, Hong Kong, Peoples R China
[3] Ant Grp, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D-aware image synthesis aims at learning a generative model that can render photo-realistic 2D images while capturing decent underlying 3D shapes. A popular solution is to adopt the generative adversarial network (GAN) and replace the generator with a 3D renderer, where volume rendering with neural radiance field (NeRF) is commonly used. Despite the advancement of synthesis quality, existing methods fail to obtain moderate 3D shapes. We argue that, considering the two-player game in the formulation of GANs, only making the generator 3D-aware is not enough. In other words, displacing the generative mechanism only offers the capability, but not the guarantee, of producing 3D-aware images, because the supervision of the generator primarily comes from the discriminator. To address this issue, we propose GeoD through learning a geometry-aware discriminator to improve 3D-aware GANs. Concretely, besides differentiating real and fake samples from the 2D image space, the discriminator is additionally asked to derive the geometry information from the inputs, which is then applied as the guidance of the generator. Such a simple yet effective design facilitates learning substantially more accurate 3D shapes. Extensive experiments on various generator architectures and training datasets verify the superiority of GeoD over state-of-the-art alternatives. Moreover, our approach is registered as a general framework such that a more capable discriminator (i.e., with a third task of novel view synthesis beyond domain classification and geometry extraction) can further assist the generator with a better multi-view consistency. Project page can be found here.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] 3D-Aware Face Swapping
    Li, Yixuan
    Ma, Chao
    Yan, Yichao
    Zhu, Wenhan
    Yang, Xiaokang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12705 - 12714
  • [22] Geometry-Aware Neural Rendering
    Tobin, Josh
    Abbeel, Pieter
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [23] A GEOMETRY-AWARE FRAMEWORK FOR COMPRESSING 3D MESH TEXTURES
    Nasiri, Fatemeh
    Bidgoli, Navid Mahmoudian
    Payan, Frederic
    Maugey, Thomas
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4015 - 4019
  • [24] 3D-Aware Indoor Scene Synthesis with Depth Priors
    Shi, Zifan
    Shen, Yujun
    Zhu, Jiapeng
    Yeung, Dit-Yan
    Chen, Qifeng
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 406 - 422
  • [25] Supervised, Geometry-aware segmentation of 3D mesh Models
    Bamba, Keisuke
    Ohbuchi, Ryutarou
    2012 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2012, : 49 - 54
  • [26] Efficient Geometry-aware 3D Generative Adversarial Networks
    Chan, Eric R.
    Lin, Connor Z.
    Chan, Matthew A.
    Nagano, Koki
    Pan, Boxiao
    de Mello, Shalini
    Gallo, Orazio
    Guibas, Leonidas
    Tremblay, Jonathan
    Khamis, Sameh
    Karras, Tero
    Wetzstein, Gordon
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16102 - 16112
  • [27] Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis
    Zhang, Xuanmeng
    Zheng, Zhedong
    Gao, Daiheng
    Zhang, Bang
    Pan, Pan
    Yang, Yi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18429 - 18438
  • [28] Geometry-aware Deep Transform
    Huang, Jiaji
    Qiu, Qiang
    Calderbank, Robert
    Sapiro, Guillermo
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4139 - 4147
  • [29] Improving Robustness of Language Models from a Geometry-aware Perspective
    Zhu, Bin
    Gu, Zhaoquan
    Wang, Le
    Chen, Jinyin
    Xuan, Qi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3115 - 3125
  • [30] Generative Multiplane Neural Radiance for 3D-Aware Image Generation
    Kumar, Amandeep
    Bhunia, Ankan Kumar
    Narayan, Sanath
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Khan, Salman
    Yang, Ming-Hsuan
    Khan, Fahad Shahbaz
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7354 - 7364