Deep Reinforcement Learning of Volume-guided Progressive View Inpainting for 3D Point Scene Completion from a Single Depth Image

被引:34
作者
Han, Xiaoguang [1 ,3 ]
Zhang, Zhaoxuan [2 ,3 ]
Du, Dong [3 ,4 ]
Yang, Mingdai [1 ,3 ]
Yu, Jingming [5 ]
Pan, Pan [5 ]
Yang, Xin [2 ]
Liu, Ligang [4 ]
Xiong, Zixiang [6 ]
Cui, Shuguang [1 ,3 ]
机构
[1] CUHK Shenzhen, Shenzhen, Peoples R China
[2] DUT, Dalian, Peoples R China
[3] SRIBD, Shenzhen, Peoples R China
[4] USTC, Hefei, Anhui, Peoples R China
[5] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
[6] TAMU, College Stn, TX USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
基金
中国国家自然科学基金;
关键词
APPROXIMATE SYMMETRY DETECTION; ALGORITHM;
D O I
10.1109/CVPR.2019.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a deep reinforcement learning method of progressive view inpainting for 3D point scene completion under volume guidance, achieving high-quality scene reconstruction from only a single depth image with severe occlusion. Our approach is end-to-end, consisting of three modules: 3D scene volume reconstruction, 2D depth map inpainting, and multi-view selection for completion. Given a single depth image, our method first goes through the 3D volume branch to obtain a volumetric scene reconstruction as a guide to the next view inpainting step, which attempts to make up the missing information; the third step involves projecting the volume under the same view of the input, concatenating them to complete the current view depth, and integrating all depth into the point cloud. Since the occluded areas are unavailable, we resort to a deep Q-Network to glance around and pick the next best view for large hole completion progressively until a scene is adequately reconstructed while guaranteeing validity. All steps are learned jointly to achieve robust and consistent results. We perform qualitative and quantitative evaluations with extensive experiments on the SUNCG data, obtaining better results than the state of the art.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 60 条
[1]  
[Anonymous], 2015, ARXIV150402437
[2]  
[Anonymous], 2012, ACM T GRAPHIC, DOI DOI 10.1145/2366145.2366155
[3]  
[Anonymous], ARXIV180608482
[4]  
[Anonymous], 2018, ARXIV180708772
[5]   Rare association of acromegaly with left atrial myxoma in Carney's complex due to novel PRKAR1A mutation [J].
Birla, Shweta ;
Aggarwal, Sameer ;
Sharma, Arundhati ;
Tandon, Nikhil .
ENDOCRINOLOGY DIABETES AND METABOLISM CASE REPORTS, 2014,
[6]  
Blaer PS, 2007, 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, P423
[7]   Structure Recovery by Part Assembly [J].
Shen, Chao-Hui ;
Fu, Hongbo ;
Chen, Kang ;
Hu, Shi-Min .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (06)
[8]   Automatic Semantic Modeling of Indoor Scenes from Low-quality RGB-D Data using Contextual Information [J].
Chen, Kang ;
Lai, Yu-Kun ;
Wu, Yu-Xin ;
Martin, Ralph ;
Hu, Shi-Min .
ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06)
[9]   An improved edge detection algorithm for depth map inpainting [J].
Chen, Weihai ;
Yue, Haosong ;
Wang, Jianhua ;
Wu, Xingming .
OPTICS AND LASERS IN ENGINEERING, 2014, 55 :69-77
[10]   3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction [J].
Choy, Christopher B. ;
Xu, Danfei ;
Gwak, Jun Young ;
Chen, Kevin ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :628-644