Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

被引:0
|
作者
Shi, Zifan [1 ,3 ]
Xu, Yinghao [2 ,3 ]
Shen, Yujun [3 ]
Zhao, Deli [3 ]
Chen, Qifeng [1 ]
Yeung, Dit-Yan [1 ]
机构
[1] HKUST, Hong Kong, Peoples R China
[2] CUHK, Hong Kong, Peoples R China
[3] Ant Grp, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D-aware image synthesis aims at learning a generative model that can render photo-realistic 2D images while capturing decent underlying 3D shapes. A popular solution is to adopt the generative adversarial network (GAN) and replace the generator with a 3D renderer, where volume rendering with neural radiance field (NeRF) is commonly used. Despite the advancement of synthesis quality, existing methods fail to obtain moderate 3D shapes. We argue that, considering the two-player game in the formulation of GANs, only making the generator 3D-aware is not enough. In other words, displacing the generative mechanism only offers the capability, but not the guarantee, of producing 3D-aware images, because the supervision of the generator primarily comes from the discriminator. To address this issue, we propose GeoD through learning a geometry-aware discriminator to improve 3D-aware GANs. Concretely, besides differentiating real and fake samples from the 2D image space, the discriminator is additionally asked to derive the geometry information from the inputs, which is then applied as the guidance of the generator. Such a simple yet effective design facilitates learning substantially more accurate 3D shapes. Extensive experiments on various generator architectures and training datasets verify the superiority of GeoD over state-of-the-art alternatives. Moreover, our approach is registered as a general framework such that a more capable discriminator (i.e., with a third task of novel view synthesis beyond domain classification and geometry extraction) can further assist the generator with a better multi-view consistency. Project page can be found here.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] 3D-aware Conditional Image Synthesis
    Deng, Kangle
    Yang, Gengshan
    Ramanan, Deva
    Zhu, Jun-Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4434 - 4445
  • [2] BallGAN: 3D-aware Image Synthesis with a Spherical Background
    Shin, Minjung
    Seo, Yunji
    Bae, Jeongmin
    Choi, Young Sun
    Kim, Hyunsu
    Byun, Hyeran
    Uh, Youngjung
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7234 - 7245
  • [3] HairNeRF: Geometry-Aware Image Synthesis for Hairstyle Transfer
    Chang, Seunggyu
    Kim, Gihoon
    Kim, Hayeon
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2448 - 2458
  • [4] A Survey on Deep Generative 3D-aware Image Synthesis
    Xia, Weihao
    Xue, Jing-Hao
    ACM COMPUTING SURVEYS, 2024, 56 (04)
  • [5] Multi3D: 3D-aware multimodal image synthesis
    Zhou, Wenyang
    Yuan, Lu
    Mu, Taijiang
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (06) : 1205 - 1217
  • [6] Learning 3D-aware Image Synthesis with Unknown Pose Distribution
    Shi, Zifan
    Shen, Yujun
    Xu, Yinghao
    Peng, Sida
    Liao, Yiyi
    Guo, Sheng
    Chen, Qifeng
    Yeung, Dit-Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13062 - 13071
  • [7] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
    Do, Hoseok
    Yoo, EunKyung
    Kim, Taehyeong
    Lee, Chul
    Choi, Tin Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
  • [8] GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis
    Schwarz, Katja
    Liao, Yiyi
    Niemeyer, Michael
    Geiger, Andreas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [9] Geometry-Aware Eye Image-To-Image Translation
    Lu, Conny
    Zhang, Qian
    Krishnakumar, Kapil
    Chen, Jixu
    Fuchs, Henry
    Talathi, Sachin
    Liu, Kun
    2022 ACM SYMPOSIUM ON EYE TRACKING RESEARCH AND APPLICATIONS, ETRA 2022, 2022,
  • [10] VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel Grids
    Schwarz, Katja
    Sauer, Axel
    Niemeyer, Michael
    Liao, Yiyi
    Geiger, Andreas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,