Optimizing RGB-D Fusion for Accurate 6DoF Pose Estimation

被引:13
|
作者
Saadi, Lounes [1 ,2 ]
Besbes, Bassem [3 ]
Kramm, Sebastien [1 ]
Bensrhair, Abdelaziz [1 ]
机构
[1] INSA Rouen, Lab Informat Traitement Informat & Syst LITIS, F-76000 Rouen, France
[2] DIOTA, Vis Res Dept, F-91300 Massy, France
[3] DIOTA, Vis Res Dept, Rouen, France
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期
关键词
2D & 3D accuracy; keypoint detector; object's localization; refinement; RGB-D; TRACKING;
D O I
10.1109/LRA.2021.3061347
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Today's standard object localization systems often do not meet the industry's demands regarding 2D and 3D accuracy for digital manufacturing applications. Two targets are considered: digital-based assistance and robotic inspection. 2D precision is necessary to provide accurate assistance whilst 3D precision is crucial to get an inspection as much close to the object's true state. In this letter, we propose a new pose estimation system which ensures highest both 2D and 3D precision. While most RGB-based solutions focus on obtaining best 2D accuracy, RGBD-based systems mainly use depth information to maximize 3D accuracy. Very few solutions propose a way to jointly optimize both constraints. Nonetheless, pose estimation should produce high accuracy as a slight 2D error can result in a large 3D error (and inversely). To address this problem, we present a new system which uses RGB-D to fully take advantage of the depth information. A new 3D primitive is proposed in order to minimize the effect of RGB-D noise on 3D coordinates accuracy. CNN Keypoint Detector (KPD) method is used to localize this new primitive in order to achieve pose estimation task. Finally, we propose a novel refinement method which ensures optimal precision as both RGB and depth information are fused. We show the results of our experimentation on widely-used and challenging Linemod and Occlusion datasets. We demonstrate that our solution outperforms state-of-the-art methods when taking into account both 3D and 2D accuracy.
引用
收藏
页码:2413 / 2420
页数:8
相关论文
共 50 条
  • [1] 6DoF Pose Estimation of Transparent Object from a Single RGB-D Image
    Xu, Chi
    Chen, Jiale
    Yao, Mengyang
    Zhou, Jun
    Zhang, Lijun
    Liu, Yi
    SENSORS, 2020, 20 (23) : 1 - 19
  • [2] Semantic Segmentation and 6DoF Pose Estimation using RGB-D Images and Deep Neural Networks
    Van Luan Tran
    Lin, Huei-Yung
    PROCEEDINGS OF 2021 IEEE 30TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2021,
  • [3] Using a RGB-D camera for 6DoF SLAM
    Munoz, Jose
    Pastor, Daniel
    Gil, Pablo
    Puente, Santiago
    Cazorla, Miguel
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2012, 248 : 143 - +
  • [4] Multimodal Cue Integration through Hypotheses Verification for RGB-D Object Recognition and 6DOF Pose Estimation
    Aldoma, A.
    Tombari, F.
    Prankl, J.
    Richtsfeld, A.
    Di Stefano, L.
    Vincze, M.
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 2104 - 2111
  • [5] YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation
    Grenzdoerffer, Till
    Guenther, Martin
    Hertzberg, Joachim
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 3650 - 3656
  • [6] An RGB-D Refinement Solution for Accurate Object Pose Estimation
    Saadi, Lounes
    Besbes, Bassem
    Kramm, Sebastien
    Bensrhair, Abdelaziz
    2021 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT PROCEEDINGS (ISMAR-ADJUNCT 2021), 2021, : 189 - 194
  • [7] Online 3D Reconstruction and 6-DoF Pose Estimation for RGB-D Sensors
    Lim, Hyon
    Lim, Jongwoo
    Kim, H. Jin
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 238 - 254
  • [8] A novel 6DoF pose estimation method using transformer fusion
    Wang, Huafeng
    Zhang, Haodu
    Liu, Wanquan
    Hu, Zhimin
    Gao, Haoqi
    Lv, Weifeng
    Gu, Xianfeng
    PATTERN RECOGNITION, 2025, 162
  • [9] A RGB-D feature fusion network for occluded object 6D pose estimation
    Song, Yiwei
    Tang, Chunhui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6309 - 6319
  • [10] EFN6D: an efficient RGB-D fusion network for 6D pose estimation
    Wang Y.
    Jiang X.
    Fujita H.
    Fang Z.
    Qiu X.
    Chen J.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (01) : 75 - 88