Focal segmentation for robust 6D object pose estimation

被引:0
作者
Yuning Ye
Hanhoon Park
机构
[1] Pukyong National University,Department of Artificial Intelligence Convergence, Graduate School
[2] Pukyong National University,Division of Electronics and Communications Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Object pose estimation; Focal segmentation; Keypoint detection; Severe occlusion; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
In the field of augmented reality, 6D pose estimation of rigid objects poses limitations and challenges. Most of the previous 6D pose estimation methods have trained deep neural networks to directly regress poses from input images or predict the 2D locations of 3D keypoints for pose estimation; thus, they are vulnerable to large occlusion. This study addresses the challenge of 6D pose estimation from a single RGB image under severe occlusion. A novel method is proposed that is based on PVNet but improves its performance. Similar to PVNet, our method regresses target object segments and pixel-wise direction vectors from an RGB image. Subsequently, the 2D locations of 3D keypoints are computed using the direction vectors of object pixels, and the 6D object pose is obtained using a PnP algorithm. However, accurate segmentation of object pixels is difficult, particularly under severe occlusion. To this end, a focal segmentation mechanism is proposed that ensures accurate complete segmentation of occluded objects. Extensive experiments on LINEMOD, LINEMOD-Occlusion datasets validate the effectiveness and superiority of our method. Our method improves the accuracy of PVNet by 1.09 and 5.14 on average in terms of the 2D reprojection error and ADD metric, respectively, without increasing the computational time.
引用
收藏
页码:47563 / 47585
页数:22
相关论文
共 25 条
[1]  
Drummond T(2002)Real-time visual tracking of complex structures IEEE Trans Pattern Anal Mach Intell 24 932-946
[2]  
Cipolla R(2020)Hybrid camera pose estimation with online partitioning for SLAM IEEE Robot Autom Lett 5 1453-1460
[3]  
Li X(2022)PVNet: Pixel-wise voting network for 6DoF object pose estimation IEEE Trans Pattern Anal Mach Intell 44 3212-3223
[4]  
Ling H(2020)Focal loss for dense object detection IEEE Trans Pattern Anal Mach Intell 42 318-327
[5]  
Peng S(2009)EPnP: An accurate O(n) solution to the PnP problem Int J Comput Vis 81 155-166
[6]  
Zhou X(2012)Gradient response maps for real-time detection of textureless objects IEEE Trans Pattern Anal Mach Intell 34 876-888
[7]  
Liu Y(undefined)undefined undefined undefined undefined-undefined
[8]  
Lin H(undefined)undefined undefined undefined undefined-undefined
[9]  
Huang Q(undefined)undefined undefined undefined undefined-undefined
[10]  
Bao H(undefined)undefined undefined undefined undefined-undefined