Semantic Scene Completion from a Single Depth Image

被引:756
作者
Song, Shuran [1 ]
Yu, Fisher [1 ]
Zeng, Andy [1 ]
Chang, Angel X. [1 ]
Savva, Manolis [1 ]
Funkhouser, Thomas [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2017.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of these two tasks, we introduce the semantic scene completion network (SSCNet), an end-to-end 3D convolutional network that takes a single depth image as input and simultaneously outputs occupancy and semantic labels for all voxels in the camera view frustum. Our network uses a dilation-based 3D context module to efficiently expand the receptive field and enable 3D context learning. To train our network, we construct SUNCG - a manually created large-scale dataset of synthetic 3D scenes with dense volumetric annotations. Our experiments demonstrate that the joint model outperforms methods addressing each task in isolation and outperforms alternative approaches on the semantic scene completion task. The dataset and code is available at http://sscnet.cs.princeton.edu.
引用
收藏
页码:190 / 198
页数:9
相关论文
共 35 条
[1]  
[Anonymous], 3DMATCH LEARNING MAT
[2]  
[Anonymous], 2014, COMPUTER GRAPHICS FO
[3]  
[Anonymous], 2015, P 37 GERM C PATT REC
[4]  
Blaha Maros., Large-scale semantic 3d reconstruction: an adaptive multi-resolution model for multi-class volumetric labeling
[5]  
Chang A. X., 2015, ARXIV
[6]   Structured Prediction of Unobserved Voxels From a Single Depth Image [J].
Firman, Michael ;
Mac Aodha, Oisin ;
Julier, Simon ;
Brostow, Gabriel J. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5431-5440
[7]  
Guo Ruiqi., 2015, ICCV
[8]   Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images [J].
Gupta, Saurabh ;
Arbelaez, Pablo ;
Malik, Jitendra .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :564-571
[9]  
Gupta Saurabh., 2015, Computer Vision and Pattern Recognition CVPR
[10]   Joint 3D Scene Reconstruction and Class Segmentation [J].
Haene, Christian ;
Zach, Christopher ;
Cohen, Andrea ;
Angst, Roland ;
Pollefeys, Marc .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :97-104