SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections

被引:1
|
作者
Chen, Zhaoxi [1 ]
Wang, Guangcong [1 ]
Liu, Ziwei [1 ]
机构
[1] Nanyang Technol Univ, S Lab, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Three-dimensional displays; Solid modeling; Semantics; Cameras; Training; Rendering (computer graphics); Geometry; 3D generative model; GAN; neural rendering; unbounded scene generation;
D O I
10.1109/TPAMI.2023.3321857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising: 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables: 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.
引用
收藏
页码:15562 / 15576
页数:15
相关论文
共 50 条
  • [1] SSR-2D: Semantic 3D Scene Reconstruction From 2D Images
    Huang, Junwen
    Artemov, Alexey
    Chen, Yujin
    Zhi, Shuaifeng
    Xu, Kai
    Niessner, Matthias
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8486 - 8501
  • [2] Semantic Scene Completion With 2D and 3D Feature Fusion
    Park, Sang-Min
    Ha, Jong-Eun
    IEEE ACCESS, 2024, 12 : 141594 - 141603
  • [3] Understanding Pixel-Level 2D Image Semantics With 3D Keypoint Knowledge Engine
    You, Yang
    Li, Chengkun
    Lou, Yujing
    Cheng, Zhoujun
    Li, Liangwei
    Ma, Lizhuang
    Wang, Weiming
    Lu, Cewu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5780 - 5795
  • [4] KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D
    Liao, Yiyi
    Xie, Jun
    Geiger, Andreas
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3292 - 3310
  • [5] 3D Scene Graph Generation From Point Clouds
    Wei, Wenwen
    Wei, Ping
    Qin, Jialu
    Liao, Zhimin
    Wang, Shuaijie
    Cheng, Xiang
    Liu, Meiqin
    Zheng, Nanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5358 - 5368
  • [6] Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image
    Yin, Wei
    Zhang, Jianming
    Wang, Oliver
    Niklaus, Simon
    Chen, Simon
    Liu, Yifan
    Shen, Chunhua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6480 - 6494
  • [7] Text2NeRF: Text-Driven 3D Scene Generation With Neural Radiance Fields
    Zhang, Jingbo
    Li, Xiaoyu
    Wan, Ziyu
    Wang, Can
    Liao, Jing
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (12) : 7749 - 7762
  • [8] 3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild
    Tu, Xiaoguang
    Zhao, Jian
    Xie, Mei
    Jiang, Zihang
    Balamurugan, Akshaya
    Luo, Yao
    Zhao, Yang
    He, Lingxiao
    Ma, Zheng
    Feng, Jiashi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 (23) : 1160 - 1172
  • [9] 3D Layout Estimation via Weakly Supervised Learning of Plane Parameters From 2D Segmentation
    Zhang, Weidong
    Zhang, Youmei
    Song, Ran
    Liu, Ying
    Zhang, Wei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 868 - 879
  • [10] Recurrent Diffusion for 3D Point Cloud Generation From a Single Image
    Zhou, Yan
    Ye, Dewang
    Zhang, Huaidong
    Xu, Xuemiao
    Sun, Huajie
    Xu, Yewen
    Liu, Xiangyu
    Zhou, Yuexia
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1753 - 1765