ScanNet plus plus : A High-Fidelity Dataset of 3D Indoor Scenes

被引：28

作者：

Yeshwanth, Chandan ^{[1
]}

Liu, Yueh-Cheng ^{[1
]}

Niessner, Matthias ^{[1
]}

Dai, Angela ^{[1
]}

机构：

[1] Tech Univ Munich, Munich, Germany

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.00008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present ScanNet++, a large-scale dataset that couples together capture of high-quality and commodity-level geometry and color of indoor scenes. Each scene is captured with a high-end laser scanner at sub-millimeter resolution, along with registered 33-megapixel images from a DSLR camera, and RGB-D streams from an iPhone. Scene reconstructions are further annotated with an open vocabulary of semantics, with label-ambiguous scenarios explicitly annotated for comprehensive semantic understanding. ScanNet++ enables a new real-world benchmark for novel view synthesis, both from high-quality RGB capture, and importantly also from commodity-level images, in addition to a new benchmark for 3D semantic scene understanding that comprehensively encapsulates diverse and ambiguous semantic labeling scenarios. Currently, ScanNet++ contains 460 scenes, 280,000 captured DSLR images, and over 3.7M iPhone RGBD frames.

引用

页码：12 / 22

页数：11

共 53 条

[31]

Qi Charles Ruizhongtai, 2017, NeurIPS, V30, P8

[32] Language-Grounded Indoor 3D Semantic Segmentation in the Wild [J].

Rozenberszki, David ;

Litany, Or ;

Dai, Angela .

COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 :125-141

[33] Structure-from-Motion Revisited [J].

Schonberger, Johannes L. ;

Frahm, Jan -Michael .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4104-4113

[34] Pixelwise View Selection for Unstructured Multi-View Stereo [J].

Schonberger, Johannes L. ;

Zheng, Enliang ;

Frahm, Jan-Michael ;

Pollefeys, Marc .

COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :501-518

[35] A Multi-View Stereo Benchmark with High-Resolution Images and Multi-Camera Videos [J].

Schops, Thomas ;

Schonberger, Johannes L. ;

Galliani, Silvano ;

Sattler, Torsten ;

Schindler, Konrad ;

Pollefeys, Marc ;

Geiger, Andreas .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2538-2547

[36]

Siddiqui Y., 2022, ARXIV

[37]

Silberman N., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P601, DOI 10.1109/ICCVW.2011.6130298

[38]

Song SR, 2015, PROC CVPR IEEE, P567, DOI 10.1109/CVPR.2015.7298655

[39] Generalizable Patch-Based Neural Rendering [J].

Suhail, Mohammed ;

Esteves, Carlos ;

Sigal, Leonid ;

Makadia, Ameesh .

COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 :156-174

[40] Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction [J].

Sun, Cheng ;

Sun, Min ;

Chen, Hwann-Tzong .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :5449-5459

← 1 2 3 4 5 6 →