SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections

被引:1
|
作者
Chen, Zhaoxi [1 ]
Wang, Guangcong [1 ]
Liu, Ziwei [1 ]
机构
[1] Nanyang Technol Univ, S Lab, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Three-dimensional displays; Solid modeling; Semantics; Cameras; Training; Rendering (computer graphics); Geometry; 3D generative model; GAN; neural rendering; unbounded scene generation;
D O I
10.1109/TPAMI.2023.3321857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising: 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables: 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.
引用
收藏
页码:15562 / 15576
页数:15
相关论文
共 50 条
  • [31] Scene Graph Generation Using Depth, Spatial, and Visual Cues in 2D Images
    Kumar, Aiswarya S.
    Nair, Jyothisha J.
    IEEE ACCESS, 2022, 10 : 1968 - 1978
  • [32] Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
    Ding, Runyu
    Yang, Jihan
    Xue, Chuhui
    Zhang, Wenqing
    Bai, Song
    Qi, Xiaojuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8517 - 8533
  • [33] A Comparative Study of 2D and 3D Digital Image Correlation Approaches for the Characterization and Numerical Analysis of Composite Materials
    Pisonero, Javier
    Lopez-Rebollo, Jorge
    Garcia-Martin, Roberto
    Rodriguez-Martin, Manuel
    Javier Sanchez-Aparicio, Luis
    Munoz-Nieto, A.
    Gonzalez-Aguilera, Diego
    IEEE ACCESS, 2021, 9 : 160675 - 160687
  • [34] 2D to 3D Evolutionary Deep Convolutional Neural Networks for Medical Image Segmentation
    Hassanzadeh, Tahereh
    Essam, Daryl
    Sarker, Ruhul
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) : 712 - 721
  • [35] DiffTF++: 3D-Aware Diffusion Transformer for Large-Vocabulary 3D Generation
    Cao, Ziang
    Hong, Fangzhou
    Wu, Tong
    Pan, Liang
    Liu, Ziwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 3018 - 3030
  • [36] PIXGAN-Drone: 3D Avatar of Human Body Reconstruction From Multi-View 2D Images
    Rasheed, Ali Salim
    Jabberi, Marwa
    Hamdani, Tarek M.
    Alimi, Adel M.
    IEEE ACCESS, 2024, 12 : 74762 - 74776
  • [37] Joint Heterogeneous Feature Learning and Distribution Alignment for 2D Image-Based 3D Object Retrieval
    Su, Yuting
    Li, Yuqian
    Nie, Weizhi
    Song, Dan
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) : 3765 - 3776
  • [38] Flattening-Net: Deep Regular 2D Representation for 3D Point Cloud Analysis
    Zhang, Qijian
    Hou, Junhui
    Qian, Yue
    Zeng, Yiming
    Zhang, Juyong
    He, Ying
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9726 - 9742
  • [39] From 2D to 3D geodesic-based garment matching
    Avots, Egils
    Madadi, Meysam
    Escalera, Sergio
    Gonzalez, Jordi
    Baro, Xavier
    Pallin, Paul
    Anbarjafari, Gholamreza
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25829 - 25853
  • [40] No Reference 3D Mesh Quality Assessment Learned From Quality Scores on 2D Projections
    Ibork, Zaineb
    Nouri, Anass
    Lezoray, Olivier
    Charrier, Christophe
    Touahni, Raja
    IEEE ACCESS, 2024, 12 : 106924 - 106936