SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections

被引：1

作者：

Chen, Zhaoxi ^{[1
]}

Wang, Guangcong ^{[1
]}

Liu, Ziwei ^{[1
]}

机构：

[1] Nanyang Technol Univ, S Lab, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 12期

基金：

新加坡国家研究基金会;

关键词：

Three-dimensional displays; Solid modeling; Semantics; Cameras; Training; Rendering (computer graphics); Geometry; 3D generative model; GAN; neural rendering; unbounded scene generation;

D O I：

10.1109/TPAMI.2023.3321857

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising: 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables: 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.

引用

页码：15562 / 15576

页数：15

共 50 条

[21] Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval
Zhou, Yaqian
Liu, Yu
Zhou, Heyu
Cheng, Zhiyong
Li, Xuanya
Liu, An-An
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7147 - 7159
[22] Lost & Found: Tracking Changes From Egocentric Observations in 3D Dynamic Scene Graphs
Behrens, Tjark
Zurbrugg, Rene
Pollefeys, Marc
Bauer, Zuria
Blum, Hermann
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (04): : 3739 - 3746
[23] Joint Intermediate Domain Generation and Distribution Alignment for 2D Image-Based 3D Objects Retrieval
Su, Yuting
Li, Yuqian
Song, Dan
Liu, Anan
Nie, Jie
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2127 - 2138
[24] Robust Shape Fitting for 3D Scene Abstraction
Kluger, Florian
Brachmann, Eric
Yang, Michael Ying
Rosenhahn, Bodo
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6306 - 6325
[25] Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation
Wang, Nanyang
Zhang, Yinda
Li, Zhuwen
Fu, Yanwei
Yu, Hang
Liu, Wei
Xue, Xiangyang
Jiang, Yu-Gang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3600 - 3613
[26] Single View 3D Reconstruction Based on Improved RGB-D Image
Cao, Mingwei
Zheng, Liping
Liu, Xiaoping
IEEE SENSORS JOURNAL, 2020, 20 (20) : 12049 - 12056
[27] Fusion of 4D Point Clouds From a 2D Profilometer and a 3D Lidar on an Excavator
Immonen, Matti
Niskanen, Ilpo
Hallman, Lauri
Keranen, Pekka
Hiltunen, Mikko
Kostamovaara, Juha
Heikkila, Rauno
IEEE SENSORS JOURNAL, 2021, 21 (15) : 17200 - 17206
[28] 3D Image Generation From Single Image Using Color Filtered Aperture and 2.1D Sketch-A Computational 3D Imaging System and Qualitative Analysis
Deshpande, Rashmi R.
Madhavi, Ch Renu
Bhatt, Mahabaleswara Ram
IEEE ACCESS, 2021, 9 : 93580 - 93592
[29] Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images
Wen, Chao
Zhang, Yinda
Cao, Chenjie
Li, Zhuwen
Xue, Xiangyang
Fu, Yanwei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2166 - 2180
[30] Explore Contextual Information for 3D Scene Graph Generation
Liu, Yuanyuan
Long, Chengjiang
Zhang, Zhaoxuan
Liu, Bokai
Zhang, Qiang
Yin, Baocai
Yang, Xin
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (12) : 5556 - 5568

← 1 2 3 4 5 →