Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

被引:0
|
作者
Shi, Zifan [1 ,3 ]
Xu, Yinghao [2 ,3 ]
Shen, Yujun [3 ]
Zhao, Deli [3 ]
Chen, Qifeng [1 ]
Yeung, Dit-Yan [1 ]
机构
[1] HKUST, Hong Kong, Peoples R China
[2] CUHK, Hong Kong, Peoples R China
[3] Ant Grp, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D-aware image synthesis aims at learning a generative model that can render photo-realistic 2D images while capturing decent underlying 3D shapes. A popular solution is to adopt the generative adversarial network (GAN) and replace the generator with a 3D renderer, where volume rendering with neural radiance field (NeRF) is commonly used. Despite the advancement of synthesis quality, existing methods fail to obtain moderate 3D shapes. We argue that, considering the two-player game in the formulation of GANs, only making the generator 3D-aware is not enough. In other words, displacing the generative mechanism only offers the capability, but not the guarantee, of producing 3D-aware images, because the supervision of the generator primarily comes from the discriminator. To address this issue, we propose GeoD through learning a geometry-aware discriminator to improve 3D-aware GANs. Concretely, besides differentiating real and fake samples from the 2D image space, the discriminator is additionally asked to derive the geometry information from the inputs, which is then applied as the guidance of the generator. Such a simple yet effective design facilitates learning substantially more accurate 3D shapes. Extensive experiments on various generator architectures and training datasets verify the superiority of GeoD over state-of-the-art alternatives. Moreover, our approach is registered as a general framework such that a more capable discriminator (i.e., with a third task of novel view synthesis beyond domain classification and geometry extraction) can further assist the generator with a better multi-view consistency. Project page can be found here.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
    Deng, Yu
    Yang, Jiaolong
    Xiang, Jianfeng
    Tong, Xin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10663 - 10673
  • [32] Geometry-Aware Discriminative Dictionary Learning for PolSAR Image Classification
    Zhang, Yachao
    Lai, Xuan
    Xie, Yuan
    Qu, Yanyun
    Li, Cuihua
    REMOTE SENSING, 2021, 13 (06)
  • [33] Geometry-aware 3D pose transfer using transformer autoencoder
    Liu, Shanghuan
    Gai, Shaoyan
    Da, Feipeng
    Waris, Fazal
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (6) : 1063 - 1078
  • [34] 3D-Aware Multi-Class Image-to-Image Translation with NeRFs
    Li, Senmao
    van de Weijer, Joost
    Wang, Yaxing
    Khan, Fahad Shahbaz
    Liu, Meiqin
    Yang, Jian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12652 - 12662
  • [35] Geometry-Aware Reference Synthesis for Multi-View Image Super-Resolution
    Cheng, Ri
    Sun, Yuqi
    Yan, Bo
    Tan, Weimin
    Ma, Chenxi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6083 - 6093
  • [36] 3D Geometry-Aware Semantic Labeling of Outdoor Street Scenes
    Zhong, Yiran
    Dai, Yuchao
    Li, Hongdong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2343 - 2349
  • [37] Geometry-aware Tracking of Manipulability Ellipsoids
    Jaquier, Noemie
    Rozo, Leonel
    Caldwell, Darwin G.
    Calinon, Sylvain
    ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [38] Geometry-Aware Face Completion and Editing
    Song, Linsen
    Cao, Jie
    Song, Lingxiao
    Hu, Yibo
    He, Ran
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2506 - 2513
  • [39] Geometry-aware Dynamic Movement Primitives
    Abu-Dakka, Fares J.
    Kyrki, Ville
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4421 - 4426
  • [40] 3D-aware Blending with Generative NeRFs
    Kim, Hyunsu
    Lee, Gayoung
    Choi, Yunjey
    Kim, Jin-Hwa
    Zhu, Jun-Yan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22849 - 22861