Faster and Finer Pose Estimation for Object Pool in a Single RGB Image

被引:1
作者
Aing, Lee [1 ]
Lie, Wen-Nung [1 ,2 ,3 ]
Chiang, Jui-Chiu [1 ,2 ]
机构
[1] Natl Chung Cheng Univ CCU, Dept Elect Engn, Chiayi, Taiwan
[2] Natl Chung Cheng Univ CCU, Ctr Innovat Res Aging Soc CIRAS, Chiayi, Taiwan
[3] Natl Chung Cheng Univ CCU, Adv Inst Mfg High Tech Innovat AIM HI, Chiayi, Taiwan
来源
2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2021年
关键词
6DoF; object pose estimation; bottom-up approach;
D O I
10.1109/VCIP53242.2021.9675316
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting/estimating the 6DoF pose parameters for multi-instance objects accurately in a fast manner is an important issue in robotic and computer vision. Even though some bottom-up methods have been proposed to be able to estimate multiple instance poses simultaneously, their accuracy cannot be considered as good enough when compared to other state-of-the-art top-down methods. Their processing speed still cannot respond to practical applications. In this paper, we present a faster and finer bottom-up approach of deep convolutional neural network to estimate poses of the object pool even multiple instances of the same object category present high occlusion/overlapping. Several techniques such as prediction of semantic segmentation map, multiple keypoint vector field, and 3D coordinate map, and diagonal graph clustering are proposed and combined to achieve the purpose. Experimental results and ablation studies show that the proposed system can achieve comparable accuracy at a speed of 24.7 frames per second for up to 7 objects by evaluation on the well-known Occlusion LINEMOD dataset.
引用
收藏
页数:5
相关论文
共 27 条
[1]   Detecting Object Surface Keypoints From a Single RGB Image via Deep Learning Network for 6-DoF Pose Estimation [J].
Aing, Lee ;
Lie, Wen-Nung .
IEEE ACCESS, 2021, 9 :77729-77741
[2]   SilhoNet: An RGB Method for 6D Object Pose Estimation [J].
Billings, Gideon ;
Johnson-Roberson, Matthew .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) :3727-3734
[3]  
Bochkovskiy A, 2020, Yolov4: optimal speed and accuracy of object detection, DOI 10.48550/ARXIV.2004.10934
[4]   Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].
Brachmann, Eric ;
Michel, Frank ;
Krull, Alexander ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372
[5]  
Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
[6]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[7]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[8]  
He Y., 2020, CVPR
[9]  
Hinterstoisser S., 2013, ACCV, V7724, P548, DOI [DOI 10.1007/978-3-642-37331-242, 10.1007/978- 3- 642-37331-2_42]
[10]   EPOS: Estimating 6D Pose of Objects with Symmetries [J].
Hodan, Tomas ;
Barath, Daniel ;
Matas, Jiri .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11700-11709