A Segmentation-Driven Approach for 6D Object Pose Estimation in the Crowd

被引:0
作者
Bi, Sheng [1 ]
Chai, Ziqi [1 ]
Liu, Chao [1 ]
Xiong, Zhenhua [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai, Peoples R China
来源
2019 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM) | 2019年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/aim.2019.8868606
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The task of estimating the 6D pose of object instances in the crowd (scenes with multiple object instances, severe foreground occlusions and background distractors), has been a research hotspot in recent years since it is very common in industrial applications. In this work, we present a segmentation-driven approach to recover the 6D object pose in the crowd. Firstly, a convolution neural network framework Mask R-CNN is applied to segment masks and bounding boxes of target object instances from the scene image in this stage. Then, the bounding boxes are segmented into smaller patches with slide windows. After that, a Sparse Auto Encoder is employed to extract invariant features of these patches, and we can obtain several candidate rough poses by Hough Voting. Finally, Iterative Closest Point (ICP) method is used to refine the 6D object pose for a better result. We tested our approach on the commonly used LINEMOD dataset [1]. Experimental results show that our approach achieves high accuracy and robustness under foreground occlusions and background distractors.
引用
收藏
页码:19 / 24
页数:6
相关论文
共 28 条
[1]  
Aldoma A., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P585, DOI 10.1109/ICCVW.2011.6130296
[2]  
[Anonymous], 2008, Conference on Computer Vision and Pattern Recognition
[3]   Speeded-Up Robust Features (SURF) [J].
Bay, Herbert ;
Ess, Andreas ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359
[4]   HIERARCHICAL CHAMFER MATCHING - A PARAMETRIC EDGE MATCHING ALGORITHM [J].
BORGEFORS, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (06) :849-865
[5]   Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].
Brachmann, Eric ;
Michel, Frank ;
Krull, Alexander ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372
[6]  
Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
[7]   Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition [J].
Buch, Anders Glent ;
Kiforenko, Lilita ;
Kraft, Dirk .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4137-4145
[8]  
Chai Z., 2018, 2018 25 INT C MECH M, P1
[9]   Recovering 6D Object Pose and Predicting Next-Best-View in the Crowd [J].
Doumanoglou, Andreas ;
Kouskouridas, Rigas ;
Malassiotis, Sotiris ;
Kim, Tae-Kyun .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3583-3592
[10]  
Drost B., 2010, 2010 IEEE COMP SOC C, DOI DOI 10.1109/CVPR.2010.5540108