ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth Image

被引:49
作者
Wang, Yida [1 ]
Tan, David Joseph [2 ]
Navab, Nassir [1 ]
Tombari, Federico [1 ,2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Google Inc, Mountain View, CA USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel model for 3D semantic completion from a single depth image, based on a single encoder and three separate generators used to reconstruct different geometric and semantic representations of the original and completed scene, all sharing the same latent space. To transfer information between the geometric and semantic branches of the network, we introduce paths between them concatenating features at corresponding network layers. Motivated by the limited amount of training samples from real scenes, an interesting attribute of our architecture is the capacity to supplement the existing dataset by generating a new training dataset with high quality, realistic scenes that even includes occlusion and real noise. We build the new dataset by sampling the features directly from latent space which generates a pair of partial volumetric surface and completed volumetric semantic surface. Moreover, we utilize multiple discriminators to increase the accuracy and realism of the reconstructions. We demonstrate the benefits of our approach on standard benchmarks for the two most common completion tasks: semantic 3D scene completion and 3D object completion.
引用
收藏
页码:8607 / 8616
页数:10
相关论文
共 40 条
[1]  
[Anonymous], 2015, P 37 GERM C PATT REC
[2]  
Chang Angel X., 2015, arXiv
[3]   3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction [J].
Choy, Christopher B. ;
Xu, Danfei ;
Gwak, Jun Young ;
Chen, Kevin ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :628-644
[4]   ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans [J].
Dai, Angela ;
Ritchie, Daniel ;
Bokeloh, Martin ;
Reed, Scott ;
Sturm, Juergen ;
Niessner, Matthias .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4578-4587
[5]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[6]  
Demir U., 2018, ABS180307422 CORR
[7]  
Fan Haoqiang, 2017, P IEEE C COMP VIS PA, V2
[8]   Structured Prediction of Unobserved Voxels From a Single Depth Image [J].
Firman, Michael ;
Mac Aodha, Oisin ;
Julier, Simon ;
Brostow, Gabriel J. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5431-5440
[9]   3D Shape Induction from 2D Views of Multiple Objects [J].
Gadelha, Matheus ;
Maji, Subhransu ;
Wang, Rui .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :402-411
[10]  
Ghahramani Zoubin, 2000, ADV NEURAL INFORM PR