SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections

被引:1
|
作者
Chen, Zhaoxi [1 ]
Wang, Guangcong [1 ]
Liu, Ziwei [1 ]
机构
[1] Nanyang Technol Univ, S Lab, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Three-dimensional displays; Solid modeling; Semantics; Cameras; Training; Rendering (computer graphics); Geometry; 3D generative model; GAN; neural rendering; unbounded scene generation;
D O I
10.1109/TPAMI.2023.3321857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising: 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables: 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.
引用
收藏
页码:15562 / 15576
页数:15
相关论文
共 50 条
  • [41] Cartesian Partitioning Models for 2D and 3D Parallel SpGEMM Algorithms
    Demirci, Gunduz Vehbi
    Aykanat, Cevdet
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (12) : 2763 - 2775
  • [42] Transfer Learning for Nonrigid 2D/3D Cardiovascular Images Registration
    Guan, Shaoya
    Wang, Tianmiao
    Sun, Kai
    Meng, Cai
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (09) : 3300 - 3309
  • [43] Deformable Linear Objects 3D Shape Estimation and Tracking From Multiple 2D Views
    Caporali, Alessio
    Galassi, Kevin
    Palli, Gianluca
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3851 - 3858
  • [44] Self-Supervised Auxiliary Domain Alignment for Unsupervised 2D Image-Based 3D Shape Retrieval
    Liu, An-An
    Zhang, Chenyu
    Li, Wenhui
    Gao, Xingyu
    Sun, Zhengya
    Li, Xuanya
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8809 - 8821
  • [45] Path Tracing in 2D, 3D, and Physicalized Networks
    McGuffin, Michael J.
    Servera, Ryan
    Forest, Marie
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3564 - 3577
  • [46] 3D Shape Estimation from 2D Landmarks: A Convex Relaxation Approach
    Zhou, Xiaowei
    Leonardos, Spyridon
    Hu, Xiaoyan
    Daniilidis, Kostas
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4447 - 4455
  • [47] Extracting 3D Parametric Curves from 2D Images of Helical Objects
    Willcocks, Chris G.
    Jackson, Philip T. G.
    Nelson, Carl J.
    Obara, Boguslaw
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (09) : 1757 - 1769
  • [48] Learning Typical 3D Representation from a Single 2D Correspondence Using 2D-3D Transformation Network
    Ul Islam, Naeem
    Lee, Sukhan
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 440 - 455
  • [49] PL-GM:RGB-D SLAM With a Novel 2D and 3D Geometric Constraint Model of Point and Line Features
    Zhang, Chenyang
    IEEE ACCESS, 2021, 9 : 9958 - 9971
  • [50] Swin3D: A pretrained transformer backbone for 3D indoor scene understanding
    Yang, Yu-Qi
    Guo, Yu-Xiao
    Xiong, Jian-Yu
    Liu, Yang
    Pan, Hao
    Wang, Peng-Shuai
    Tong, Xin
    Guo, Baining
    COMPUTATIONAL VISUAL MEDIA, 2025, 11 (01): : 83 - 101