Object-RPE: Dense 3D reconstruction and pose estimation with convolutional neural networks

被引:23
作者
Hoang, Dinh-Cuong [1 ]
Lilienthal, Achim J. [1 ]
Stoyanov, Todor [1 ]
机构
[1] Orebro Univ, Ctr Appl Autonomous Sensor Syst AASS, Orebro, Sweden
关键词
Object pose estimation; 3D reconstruction; Semantic mapping; 3D registration;
D O I
10.1016/j.robot.2020.103632
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an approach for recognizing objects present in a scene and estimating their full pose by means of an accurate 3D instance-aware semantic reconstruction. Our framework couples convolutional neural networks (CNNs) and a state-of-the-art dense Simultaneous Localization and Mapping (SLAM) system, ElasticFusion (Whelan et al., 2016), to achieve both high-quality semantic reconstruction as well as robust 6D pose estimation for relevant objects. We leverage the pipeline of ElasticFusion as a backbone, and propose a joint geometric and photometric error function with per-pixel adaptive weights. While the main trend in CNN-based 6D pose estimation has been to infer object's position and orientation from single views of the scene, our approach explores performing pose estimation from multiple viewpoints, under the conjecture that combining multiple predictions can improve the robustness of an object detection system. The resulting system is capable of producing high-quality instance-aware semantic reconstructions of room-sized environments, as well as accurately detecting objects and their 6D poses. The developed method has been verified through extensive experiments on different datasets. Experimental results confirmed that the proposed system achieves improvements over state-of-the-art methods in terms of surface reconstruction and object pose prediction. Our code and video are available at https://sites.google.com/view/object-rpe. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 32 条
[1]  
[Anonymous], 2015, ACS SYM SER
[2]  
Antonello M, 2018, IEEE INT CONF ROBOT, P1855
[3]  
Canelhas DR, 2013, IEEE INT C INT ROBOT, P3671, DOI 10.1109/IROS.2013.6696880
[4]   Coarse filters for shape matching [J].
Corney, J ;
Rea, H ;
Clark, D ;
Pritchard, J ;
Breaks, M ;
MacLeod, R .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2002, 22 (03) :65-74
[5]   BundleFusion: Real-Time Globally Consistent 3D Reconstruction Using On-the-Fly Surface Reintegration [J].
Dai, Angela ;
Niessner, Matthias ;
Zollhofer, Michael ;
Izadi, Shahram ;
Theobalt, Christian .
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (03)
[6]  
Fuchs S., 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010), P4862, DOI 10.1109/IROS.2010.5651046
[7]  
Germann M, 2007, 3DIM 2007: SIXTH INTERNATIONAL CONFERENCE ON 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, P81
[8]   Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery [J].
Grinvald, Margarita ;
Furrer, Fadri ;
Novkovic, Tonci ;
Chung, Jen Jen ;
Cadena, Cesar ;
Siegwart, Roland ;
Nieto, Juan .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) :3037-3044
[9]  
Hermans A, 2014, IEEE INT CONF ROBOT, P2631, DOI 10.1109/ICRA.2014.6907236
[10]  
Hoang D.-C., 2019, EUR C MOB ROB ECMR, P1