Real-Time Seamless Single Shot 6D Object Pose Prediction

被引:677
作者
Tekin, Bugra [1 ]
Sinha, Sudipta N. [2 ]
Fua, Pascal [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[2] Microsoft Res, Redmond, WA USA
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
基金
瑞士国家科学基金会;
关键词
RECOGNITION;
D O I
10.1109/CVPR.2018.00038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a single-shot approach for simultaneously detecting an object in an RGB image and predicting its 6D pose without requiring multiple stages or having to examine multiple hypotheses. Unlike a recently proposed single-shot technique for this task [10] that only predicts an approximate 6D pose that must then be refined, ours is accurate enough not to require additional post-processing. As a result, it is much faster 50 fps on a Titan X (Pascal) GPU and more suitable for real-time processing. The key component of our method is a new CNN architecture inspired by, [27, 28]that directly predicts the 2D image locations of the projected vertices of the object's 3D bounding box. The object's 6D pose is then estimated using a PnP algorithm. For single object and multiple object pose estimation on the LINEMOD and OCCLUSION datasets, our approach substantially outperforms other recent CNN-based approaches [10, 25] when they are all used without post processing. During post-processing, a pose refinement step can be used to boost the accuracy of these two methods, but at 10 fps or less, they are much slower than our method.
引用
收藏
页码:292 / 301
页数:10
相关论文
共 40 条
[1]  
[Anonymous], 2016, 3DV
[2]  
[Anonymous], 2012, ACCV
[3]  
[Anonymous], 2015, CVPR
[4]  
[Anonymous], 2016, P IEEE C COMP VIS PA
[5]  
[Anonymous], 2014, ECCV
[6]  
[Anonymous], CVPR
[7]  
[Anonymous], CVPR
[8]  
[Anonymous], 2015, P IEEE C COMP VIS PA
[9]  
[Anonymous], 2016, CVPR
[10]  
[Anonymous], 2017, ICCV