Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation

被引:82
作者
Chen, Rui [1 ]
Chen, Yongwei [1 ]
Jiao, Ningxin [1 ]
Jia, Kui [1 ]
机构
[1] South China Univ Technol, Guangzhou, Guangdong, Peoples R China
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
关键词
D O I
10.1109/ICCV51070.2023.02033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic 3D content creation has achieved rapid progress recently due to the availability of pre-trained, large language models and image diffusion models, forming the emerging topic of text-to-3D content creation. Existing text-to-3D methods commonly use implicit scene representations, which couple the geometry and appearance via volume rendering and are suboptimal in terms of recovering finer geometries and achieving photorealistic rendering; consequently, they are less effective for generating highquality 3D assets. In this work, we propose a new method of Fantasia3D for high- quality text-to-3D content creation. Key to Fantasia3D is the disentangled modeling and learning of geometry and appearance. For geometry learning, we rely on a hybrid scene representation, and propose to encode surface normal extracted from the representation as the input of the image diffusion model. For appearance modeling, we introduce the spatially varying bidirectional reflectance distribution function (BRDF) into the text-to-3D task, and learn the surface material for photorealistic rendering of the generated surface. Our disentangled framework is more compatible with popular graphics engines, supporting relighting, editing, and physical simulation of the generated 3D assets. We conduct thorough experiments that show the advantages of our method over existing ones under different text-to-3D task settings. Project page and source codes: https://fantasia3d.github.io/.
引用
收藏
页码:22189 / 22199
页数:11
相关论文
共 48 条
  • [1] Reflectance Modeling by Neural Texture Synthesis
    Aittala, Miika
    Aila, Timo
    Lehtinen, Jaakko
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04):
  • [2] Practical SVBRDF Capture In The Frequency Domain
    Aittala, Miika
    Weyrich, Tim
    Lehtinen, Jaakko
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04):
  • [3] [Anonymous], About Us
  • [4] B. O. Community, 2018, BLENDER A 3D MODELLI
  • [5] Balaji Yogesh, 2022, arXiv preprint arXiv:2211.01324
  • [6] Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields
    Barron, Jonathan T.
    Mildenhall, Ben
    Tancik, Matthew
    Hedman, Peter
    Martin-Brualla, Ricardo
    Srinivasan, Pratul P.
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5835 - 5844
  • [7] Bi Sai, 2020, ARXIV200803824
  • [8] Chen WZ, 2019, ADV NEUR IN, V32
  • [9] Chen Wenzheng, 2021, Adv. Neural Inf. Process. Syst., V34, P22834, DOI DOI 10.48550/ARXIV.2111.00140
  • [10] DDG-Based Optimization Metrics for Defect Prediction
    Chen, Yong
    Xu, Chao
    He, Jing Selena
    Xiao, Sheng
    Shen, Fanfan
    [J]. ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 3 - 16