ScanNet plus plus : A High-Fidelity Dataset of 3D Indoor Scenes

被引:28
作者
Yeshwanth, Chandan [1 ]
Liu, Yueh-Cheng [1 ]
Niessner, Matthias [1 ]
Dai, Angela [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV | 2023年
关键词
D O I
10.1109/ICCV51070.2023.00008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present ScanNet++, a large-scale dataset that couples together capture of high-quality and commodity-level geometry and color of indoor scenes. Each scene is captured with a high-end laser scanner at sub-millimeter resolution, along with registered 33-megapixel images from a DSLR camera, and RGB-D streams from an iPhone. Scene reconstructions are further annotated with an open vocabulary of semantics, with label-ambiguous scenarios explicitly annotated for comprehensive semantic understanding. ScanNet++ enables a new real-world benchmark for novel view synthesis, both from high-quality RGB capture, and importantly also from commodity-level images, in addition to a new benchmark for 3D semantic scene understanding that comprehensively encapsulates diverse and ambiguous semantic labeling scenarios. Currently, ScanNet++ contains 460 scenes, 280,000 captured DSLR images, and over 3.7M iPhone RGBD frames.
引用
收藏
页码:12 / 22
页数:11
相关论文
共 53 条
[31]  
Qi Charles Ruizhongtai, 2017, NeurIPS, V30, P8
[32]   Language-Grounded Indoor 3D Semantic Segmentation in the Wild [J].
Rozenberszki, David ;
Litany, Or ;
Dai, Angela .
COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 :125-141
[33]   Structure-from-Motion Revisited [J].
Schonberger, Johannes L. ;
Frahm, Jan -Michael .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4104-4113
[34]   Pixelwise View Selection for Unstructured Multi-View Stereo [J].
Schonberger, Johannes L. ;
Zheng, Enliang ;
Frahm, Jan-Michael ;
Pollefeys, Marc .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :501-518
[35]   A Multi-View Stereo Benchmark with High-Resolution Images and Multi-Camera Videos [J].
Schops, Thomas ;
Schonberger, Johannes L. ;
Galliani, Silvano ;
Sattler, Torsten ;
Schindler, Konrad ;
Pollefeys, Marc ;
Geiger, Andreas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2538-2547
[36]  
Siddiqui Y., 2022, ARXIV
[37]  
Silberman N., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P601, DOI 10.1109/ICCVW.2011.6130298
[38]  
Song SR, 2015, PROC CVPR IEEE, P567, DOI 10.1109/CVPR.2015.7298655
[39]   Generalizable Patch-Based Neural Rendering [J].
Suhail, Mohammed ;
Esteves, Carlos ;
Sigal, Leonid ;
Makadia, Ameesh .
COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 :156-174
[40]   Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction [J].
Sun, Cheng ;
Sun, Min ;
Chen, Hwann-Tzong .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :5449-5459