Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View

被引:36
作者
Song, Shuran [1 ]
Zeng, Andy [1 ]
Chang, Angel X. [1 ]
Savva, Manolis [1 ]
Savarese, Silvio [2 ]
Funkhouser, Thomas [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] Stanford Univ, Stanford, CA 94305 USA
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00405
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Im2Pano3D, a convolutional neural network that generates a dense prediction of 3D structure and a probability distribution of semantic labels for a full 360 panoramic view of an indoor scene when given only a partial observation (<= 50%) in the form of an RGB-D image. To make this possible, Im2Pano3D leverages strong contextual priors learned from large-scale synthetic and real world indoor scenes. To ease the prediction of 3D structure, we propose to parameterize 3D surfaces with their plane equations and train the model to predict these parameters directly. To provide meaningful training supervision, we use multiple loss functions that consider both pixel level accuracy and global context consistency. Experiments demonstrate that Im2Pano3D is able to predict the semantics and 3D structure of the unobserved scene with more than 56% pixel accuracy and less than 0.52m average distance error, which is significantly better than alternative approaches.
引用
收藏
页码:3847 / 3856
页数:10
相关论文
共 40 条
[1]  
[Anonymous], 2017, ARXIV170403489
[2]  
[Anonymous], FULLY CONVOLUTIONAL
[3]  
[Anonymous], PROC CVPR IEEE
[4]  
[Anonymous], 2015, ROCK 3D MOD
[5]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[6]  
[Anonymous], 2015, 3D SHAPENETS DEEP RE
[7]  
[Anonymous], PROC CVPR IEEE
[8]  
[Anonymous], COMP VIS 1998 6 INT
[9]  
[Anonymous], 2017, Matterport3d: Learning from rgb-d data in indoor environments
[10]  
[Anonymous], ICCV 2017