Focal segmentation for robust 6D object pose estimation

被引:0
|
作者
Yuning Ye
Hanhoon Park
机构
[1] Pukyong National University,Department of Artificial Intelligence Convergence, Graduate School
[2] Pukyong National University,Division of Electronics and Communications Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Object pose estimation; Focal segmentation; Keypoint detection; Severe occlusion; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
In the field of augmented reality, 6D pose estimation of rigid objects poses limitations and challenges. Most of the previous 6D pose estimation methods have trained deep neural networks to directly regress poses from input images or predict the 2D locations of 3D keypoints for pose estimation; thus, they are vulnerable to large occlusion. This study addresses the challenge of 6D pose estimation from a single RGB image under severe occlusion. A novel method is proposed that is based on PVNet but improves its performance. Similar to PVNet, our method regresses target object segments and pixel-wise direction vectors from an RGB image. Subsequently, the 2D locations of 3D keypoints are computed using the direction vectors of object pixels, and the 6D object pose is obtained using a PnP algorithm. However, accurate segmentation of object pixels is difficult, particularly under severe occlusion. To this end, a focal segmentation mechanism is proposed that ensures accurate complete segmentation of occluded objects. Extensive experiments on LINEMOD, LINEMOD-Occlusion datasets validate the effectiveness and superiority of our method. Our method improves the accuracy of PVNet by 1.09 and 5.14 on average in terms of the 2D reprojection error and ADD metric, respectively, without increasing the computational time.
引用
收藏
页码:47563 / 47585
页数:22
相关论文
共 50 条
  • [1] Focal segmentation for robust 6D object pose estimation
    Ye, Yuning
    Park, Hanhoon
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 47563 - 47585
  • [2] Segmentation-driven 6D Object Pose Estimation
    Hu, Yinlin
    Hugonot, Joachim
    Fua, Pascal
    Salzmann, Mathieu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3380 - 3389
  • [3] PointPoseNet: Point Pose Network for Robust 6D Object Pose Estimation
    Chen, Wei
    Duan, Jinming
    Basevi, Hector
    Chang, Hyung Jin
    Leonardis, Ales
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2813 - 2822
  • [4] Robust 6D Object Pose Estimation in Cluttered Scenes using Semantic Segmentation and Pose Regression Networks
    Periyasamy, Arul Selvam
    Schwarz, Max
    Behnke, Sven
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 6660 - 6666
  • [5] DoPose-6D dataset for object segmentation and 6D pose estimation
    Gouda, Anas
    Ghanem, Abraham
    Reining, Christopher
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 477 - 483
  • [6] On Evaluation of 6D Object Pose Estimation
    Hodan, Tomas
    Matas, Jiri
    Obdrzalek, Stephan
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 606 - 619
  • [7] A Segmentation-Driven Approach for 6D Object Pose Estimation in the Crowd
    Bi, Sheng
    Chai, Ziqi
    Liu, Chao
    Xiong, Zhenhua
    2019 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2019, : 19 - 24
  • [8] Robust 6D Object Pose Estimation by Learning RGB-D Features
    Tian, Meng
    Pan, Liang
    Ang, Marcelo H., Jr.
    Lee, Gim Hee
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6218 - 6224
  • [9] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
  • [10] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35