SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections

被引：1

作者：

Chen, Zhaoxi ^{[1
]}

Wang, Guangcong ^{[1
]}

Liu, Ziwei ^{[1
]}

机构：

[1] Nanyang Technol Univ, S Lab, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 12期

基金：

新加坡国家研究基金会;

关键词：

Three-dimensional displays; Solid modeling; Semantics; Cameras; Training; Rendering (computer graphics); Geometry; 3D generative model; GAN; neural rendering; unbounded scene generation;

D O I：

10.1109/TPAMI.2023.3321857

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising: 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables: 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.

引用

页码：15562 / 15576

页数：15

共 50 条

[1] SSR-2D: Semantic 3D Scene Reconstruction From 2D Images
Huang, Junwen
Artemov, Alexey
Chen, Yujin
Zhi, Shuaifeng
Xu, Kai
Niessner, Matthias
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8486 - 8501
[2] Semantic Scene Completion With 2D and 3D Feature Fusion
Park, Sang-Min
Ha, Jong-Eun
IEEE ACCESS, 2024, 12 : 141594 - 141603
[3] Understanding Pixel-Level 2D Image Semantics With 3D Keypoint Knowledge Engine
You, Yang
Li, Chengkun
Lou, Yujing
Cheng, Zhoujun
Li, Liangwei
Ma, Lizhuang
Wang, Weiming
Lu, Cewu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5780 - 5795
[4] KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D
Liao, Yiyi
Xie, Jun
Geiger, Andreas
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3292 - 3310
[5] 3D Scene Graph Generation From Point Clouds
Wei, Wenwen
Wei, Ping
Qin, Jialu
Liao, Zhimin
Wang, Shuaijie
Cheng, Xiang
Liu, Meiqin
Zheng, Nanning
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5358 - 5368
[6] Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image
Yin, Wei
Zhang, Jianming
Wang, Oliver
Niklaus, Simon
Chen, Simon
Liu, Yifan
Shen, Chunhua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6480 - 6494
[7] Text2NeRF: Text-Driven 3D Scene Generation With Neural Radiance Fields
Zhang, Jingbo
Li, Xiaoyu
Wan, Ziyu
Wang, Can
Liao, Jing
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (12) : 7749 - 7762
[8] 3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild
Tu, Xiaoguang
Zhao, Jian
Xie, Mei
Jiang, Zihang
Balamurugan, Akshaya
Luo, Yao
Zhao, Yang
He, Lingxiao
Ma, Zheng
Feng, Jiashi
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 (23) : 1160 - 1172
[9] 3D Layout Estimation via Weakly Supervised Learning of Plane Parameters From 2D Segmentation
Zhang, Weidong
Zhang, Youmei
Song, Ran
Liu, Ying
Zhang, Wei
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 868 - 879
[10] Recurrent Diffusion for 3D Point Cloud Generation From a Single Image
Zhou, Yan
Ye, Dewang
Zhang, Huaidong
Xu, Xuemiao
Sun, Huajie
Xu, Yewen
Liu, Xiangyu
Zhou, Yuexia
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1753 - 1765

← 1 2 3 4 5 →