Robotic Grasp Detection With 6-D Pose Estimation Based on Graph Convolution and Refinement

被引:4
|
作者
Yu, Sheng [1 ]
Zhai, Di-Hua [1 ,2 ]
Xia, Yuanqing [1 ,3 ]
Wang, Wei [4 ]
Zhang, Chengyu [4 ]
Zhao, Shiqi [4 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314001, Peoples R China
[3] Zhongyuan Univ Technol, Zhengzhou 450007, Henan, Peoples R China
[4] China United Network Commun Corp Ltd, Res Inst, Beijing 100176, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution network; grasp detection; pose estimation; robot; transformer;
D O I
10.1109/TSMC.2024.3371580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Six-dimensional (6-D) object pose estimation plays a critical role in robotic grasp, which performs extensive usage in manufacturing. The current state-of-the-art pose estimation techniques primarily depend on matching keypoints. Typically, these methods establish a correspondence between 2-D keypoints in an image and the corresponding ones in a 3-D object model. And then they use the PnP-RANSAC algorithm to determine the 6-D pose of the object. However, this approach is not end-to-end trainable and may encounter difficulties when applied to scenarios necessitating differentiable poses. When employing a direct end-to-end regression method, the outcomes are often inferior. To tackle the mentioned problems, we present GR6D, which is a keypoint-and graph-convolution-based neural network for differentiable pose estimation based on RGB-D data. First, we propose a multiscale fusion method that utilizes convolution and graph convolution to exploit information contained in RGB and depth images. Additionally, we propose a transformer-based pose refinement module to further adjust features from RGB images and point clouds. We evaluate GR6D on three datasets: 1) LINEMOD; 2) occlusion LINEMOD; and 3) YCB-Video dataset, and it outperforms most state-of-the-art methods. Finally, we apply GR6D to pose estimation and the robotic grasping task in the real world, manifesting superior performance.
引用
收藏
页码:3783 / 3795
页数:13
相关论文
共 50 条
  • [11] A Depth Adaptive Feature Extraction and Dense Prediction Network for 6-D Pose Estimation in Robotic Grasping
    Liu, Xuebing
    Yuan, Xiaofang
    Zhu, Qing
    Wang, Yaonan
    Feng, Mingtao
    Zhou, Jiaming
    Zhou, Zhen
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2727 - 2737
  • [12] Efficient MSPSO Sampling for Object Detection and 6-D Pose Estimation in 3-D Scenes
    Xing, Xuejun
    Guo, Jianwei
    Nan, Liangliang
    Gu, Qingyi
    Zhang, Xiaopeng
    Yan, Dong-Ming
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (10) : 10281 - 10291
  • [13] Robotic Grasp Pose Detection Method Based on Multiscale Features
    Wang, Zheng
    Leng, Longlong
    Zhou, Xianming
    Zhao, Yanwei
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2023, 20 (05)
  • [14] Sparse Convolution-Based 6D Pose Estimation for Robotic Bin-Picking With Point Clouds
    Zhuang, Chungang
    Niu, Wanhao
    Wang, Hesheng
    JOURNAL OF MECHANISMS AND ROBOTICS-TRANSACTIONS OF THE ASME, 2025, 17 (03):
  • [15] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
    Zhou, Guangliang
    Wang, Deming
    Yan, Yi
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [16] Sca-pose: category-level 6D pose estimation with adaptive shape prior based on CNN and graph convolution
    Zuo, Guoyu
    Yu, Shan
    Yu, Shuangyue
    Liu, Hong
    Zhao, Min
    INTELLIGENT SERVICE ROBOTICS, 2025, 18 (02) : 351 - 361
  • [17] WireframePose: Monocular 6-D Pose Estimation of Metal Parts Based on Wireframe Extraction and Matching
    Liu, Ze'An
    Wang, Xuanyin
    Pu, Bin
    Tang, Jixiang
    Sun, Jiaqi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [18] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
    Zhou, Guangliang
    Wang, Deming
    Yan, Yi
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [19] 3D hand pose estimation algorithm based on cascaded features and graph convolution
    Lin, Yi-lin
    Lin, Shan-ling
    Lin, Zhi-xian
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (06) : 736 - 745
  • [20] Sparse Template-Based 6-D Pose Estimation of Metal Parts Using a Monocular Camera
    He, Zaixing
    Jiang, Zhiwei
    Zhao, Xinyue
    Zhang, Shuyou
    Wu, Chenrui
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (01) : 390 - 401