SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections

被引:1
|
作者
Chen, Zhaoxi [1 ]
Wang, Guangcong [1 ]
Liu, Ziwei [1 ]
机构
[1] Nanyang Technol Univ, S Lab, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Three-dimensional displays; Solid modeling; Semantics; Cameras; Training; Rendering (computer graphics); Geometry; 3D generative model; GAN; neural rendering; unbounded scene generation;
D O I
10.1109/TPAMI.2023.3321857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising: 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables: 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.
引用
收藏
页码:15562 / 15576
页数:15
相关论文
共 50 条
  • [21] Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval
    Zhou, Yaqian
    Liu, Yu
    Zhou, Heyu
    Cheng, Zhiyong
    Li, Xuanya
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7147 - 7159
  • [22] Lost & Found: Tracking Changes From Egocentric Observations in 3D Dynamic Scene Graphs
    Behrens, Tjark
    Zurbrugg, Rene
    Pollefeys, Marc
    Bauer, Zuria
    Blum, Hermann
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (04): : 3739 - 3746
  • [23] Joint Intermediate Domain Generation and Distribution Alignment for 2D Image-Based 3D Objects Retrieval
    Su, Yuting
    Li, Yuqian
    Song, Dan
    Liu, Anan
    Nie, Jie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2127 - 2138
  • [24] Robust Shape Fitting for 3D Scene Abstraction
    Kluger, Florian
    Brachmann, Eric
    Yang, Michael Ying
    Rosenhahn, Bodo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6306 - 6325
  • [25] Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation
    Wang, Nanyang
    Zhang, Yinda
    Li, Zhuwen
    Fu, Yanwei
    Yu, Hang
    Liu, Wei
    Xue, Xiangyang
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3600 - 3613
  • [26] Single View 3D Reconstruction Based on Improved RGB-D Image
    Cao, Mingwei
    Zheng, Liping
    Liu, Xiaoping
    IEEE SENSORS JOURNAL, 2020, 20 (20) : 12049 - 12056
  • [27] Fusion of 4D Point Clouds From a 2D Profilometer and a 3D Lidar on an Excavator
    Immonen, Matti
    Niskanen, Ilpo
    Hallman, Lauri
    Keranen, Pekka
    Hiltunen, Mikko
    Kostamovaara, Juha
    Heikkila, Rauno
    IEEE SENSORS JOURNAL, 2021, 21 (15) : 17200 - 17206
  • [28] 3D Image Generation From Single Image Using Color Filtered Aperture and 2.1D Sketch-A Computational 3D Imaging System and Qualitative Analysis
    Deshpande, Rashmi R.
    Madhavi, Ch Renu
    Bhatt, Mahabaleswara Ram
    IEEE ACCESS, 2021, 9 : 93580 - 93592
  • [29] Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images
    Wen, Chao
    Zhang, Yinda
    Cao, Chenjie
    Li, Zhuwen
    Xue, Xiangyang
    Fu, Yanwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2166 - 2180
  • [30] Explore Contextual Information for 3D Scene Graph Generation
    Liu, Yuanyuan
    Long, Chengjiang
    Zhang, Zhaoxuan
    Liu, Bokai
    Zhang, Qiang
    Yin, Baocai
    Yang, Xin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (12) : 5556 - 5568