SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections

被引：1

作者：

Chen, Zhaoxi ^{[1
]}

Wang, Guangcong ^{[1
]}

Liu, Ziwei ^{[1
]}

机构：

[1] Nanyang Technol Univ, S Lab, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 12期

基金：

新加坡国家研究基金会;

关键词：

Three-dimensional displays; Solid modeling; Semantics; Cameras; Training; Rendering (computer graphics); Geometry; 3D generative model; GAN; neural rendering; unbounded scene generation;

D O I：

10.1109/TPAMI.2023.3321857

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising: 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables: 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.

引用

页码：15562 / 15576

页数：15

共 50 条

[31] Scene Graph Generation Using Depth, Spatial, and Visual Cues in 2D Images
Kumar, Aiswarya S.
Nair, Jyothisha J.
IEEE ACCESS, 2022, 10 : 1968 - 1978
[32] Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Ding, Runyu
Yang, Jihan
Xue, Chuhui
Zhang, Wenqing
Bai, Song
Qi, Xiaojuan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8517 - 8533
[33] A Comparative Study of 2D and 3D Digital Image Correlation Approaches for the Characterization and Numerical Analysis of Composite Materials
Pisonero, Javier
Lopez-Rebollo, Jorge
Garcia-Martin, Roberto
Rodriguez-Martin, Manuel
Javier Sanchez-Aparicio, Luis
Munoz-Nieto, A.
Gonzalez-Aguilera, Diego
IEEE ACCESS, 2021, 9 : 160675 - 160687
[34] 2D to 3D Evolutionary Deep Convolutional Neural Networks for Medical Image Segmentation
Hassanzadeh, Tahereh
Essam, Daryl
Sarker, Ruhul
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) : 712 - 721
[35] DiffTF++: 3D-Aware Diffusion Transformer for Large-Vocabulary 3D Generation
Cao, Ziang
Hong, Fangzhou
Wu, Tong
Pan, Liang
Liu, Ziwei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 3018 - 3030
[36] PIXGAN-Drone: 3D Avatar of Human Body Reconstruction From Multi-View 2D Images
Rasheed, Ali Salim
Jabberi, Marwa
Hamdani, Tarek M.
Alimi, Adel M.
IEEE ACCESS, 2024, 12 : 74762 - 74776
[37] Joint Heterogeneous Feature Learning and Distribution Alignment for 2D Image-Based 3D Object Retrieval
Su, Yuting
Li, Yuqian
Nie, Weizhi
Song, Dan
Liu, An-An
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) : 3765 - 3776
[38] Flattening-Net: Deep Regular 2D Representation for 3D Point Cloud Analysis
Zhang, Qijian
Hou, Junhui
Qian, Yue
Zeng, Yiming
Zhang, Juyong
He, Ying
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9726 - 9742
[39] From 2D to 3D geodesic-based garment matching
Avots, Egils
Madadi, Meysam
Escalera, Sergio
Gonzalez, Jordi
Baro, Xavier
Pallin, Paul
Anbarjafari, Gholamreza
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25829 - 25853
[40] No Reference 3D Mesh Quality Assessment Learned From Quality Scores on 2D Projections
Ibork, Zaineb
Nouri, Anass
Lezoray, Olivier
Charrier, Christophe
Touahni, Raja
IEEE ACCESS, 2024, 12 : 106924 - 106936

← 1 2 3 4 5 →