3D-Aware Multi-Class Image-to-Image Translation with NeRFs

被引:2
|
作者
Li, Senmao [1 ]
van de Weijer, Joost [2 ]
Wang, Yaxing [1 ]
Khan, Fahad Shahbaz [3 ,4 ]
Liu, Meiqin [5 ]
Yang, Jian [1 ]
机构
[1] Nankai Univ, CS, VCIP, Tianjin, Peoples R China
[2] Univ Autonoma Barcelona, Barcelona, Spain
[3] Mohamed bin Zayed Univ AI, Abu Dhabi, U Arab Emirates
[4] Linkoping Univ, Linkoping, Sweden
[5] Beijing Jiaotong Univ, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in 3D-aware generative models (3D-aware GANs) combined with Neural Radiance Fields (NeRF) have achieved impressive results. However no prior works investigate 3D-aware GANs for 3D consistent multi-class image-to-image (3D-aware I2I) translation. Naively using 2D-I2I translation methods suffers from unrealistic shape/identity change. To perform 3D-aware multi-class I2I translation, we decouple this learning process into a multi-class 3D-aware GAN step and a 3D-aware I2I translation step. In the first step, we propose two novel techniques: a new conditional architecture and an effective training strategy. In the second step, based on the well-trained multi-class 3D-aware GAN architecture, that preserves view-consistency, we construct a 3D-aware I2I translation system. To further reduce the view-consistency problems, we propose several new techniques, including a U-net-like adaptor network design, a hierarchical representation constrain and a relative regularization loss. In extensive experiments on two datasets, quantitative and qualitative results demonstrate that we successfully perform 3D-aware I2I translation with multi-view consistency. Code is available in 3DI2I.
引用
收藏
页码:12652 / 12662
页数:11
相关论文
共 50 条
  • [1] 3D-aware Blending with Generative NeRFs
    Kim, Hyunsu
    Lee, Gayoung
    Choi, Yunjey
    Kim, Jin-Hwa
    Zhu, Jun-Yan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22849 - 22861
  • [2] Multi3D: 3D-aware multimodal image synthesis
    Zhou, Wenyang
    Yuan, Lu
    Mu, Taijiang
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (06) : 1205 - 1217
  • [3] Panoptic-aware Image-to-Image Translation
    Zhang, Liyun
    Ratsamee, Photchara
    Wang, Bowen
    Luo, Zhaojie
    Uranishi, Yuki
    Higashida, Manabu
    Takemura, Haruo
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 259 - 268
  • [4] 3D-aware Conditional Image Synthesis
    Deng, Kangle
    Yang, Gengshan
    Ramanan, Deva
    Zhu, Jun-Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4434 - 4445
  • [5] Quality-Aware Unpaired Image-to-Image Translation
    Chen, Lei
    Wu, Le
    Hu, Zhenzhen
    Wang, Meng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (10) : 2664 - 2674
  • [6] Geometry-Aware Eye Image-To-Image Translation
    Lu, Conny
    Zhang, Qian
    Krishnakumar, Kapil
    Chen, Jixu
    Fuchs, Henry
    Talathi, Sachin
    Liu, Kun
    2022 ACM SYMPOSIUM ON EYE TRACKING RESEARCH AND APPLICATIONS, ETRA 2022, 2022,
  • [7] InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
    Kim, Soohyun
    Baek, Jongbeom
    Park, Jihye
    Kim, Gyeongnyeon
    Kim, Seungryong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18300 - 18310
  • [8] Unsupervised Exemplar-Domain Aware Image-to-Image Translation
    Fu, Yuanbin
    Ma, Jiayi
    Guo, Xiaojie
    ENTROPY, 2021, 23 (05)
  • [9] TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation
    Wu, Wayne
    Cao, Kaidi
    Li, Cheng
    Qian, Chen
    Loy, Chen Change
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8004 - 8013
  • [10] Semi-supervised Task Aware Image-to-Image Translation
    Muetze, Annika
    Rottmann, Matthias
    Gottschalk, Hanno
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2023, 2024, 2103 : 98 - 122