Robotic Grasp Detection With 6-D Pose Estimation Based on Graph Convolution and Refinement

被引：4

作者：

Yu, Sheng ^{[1
]}

Zhai, Di-Hua ^{[1
,2
]}

Xia, Yuanqing ^{[1
,3
]}

Wang, Wei ^{[4
]}

Zhang, Chengyu ^{[4
]}

Zhao, Shiqi ^{[4
]}

机构：

[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China

[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314001, Peoples R China

[3] Zhongyuan Univ Technol, Zhengzhou 450007, Henan, Peoples R China

[4] China United Network Commun Corp Ltd, Res Inst, Beijing 100176, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Convolution network; grasp detection; pose estimation; robot; transformer;

D O I：

10.1109/TSMC.2024.3371580

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Six-dimensional (6-D) object pose estimation plays a critical role in robotic grasp, which performs extensive usage in manufacturing. The current state-of-the-art pose estimation techniques primarily depend on matching keypoints. Typically, these methods establish a correspondence between 2-D keypoints in an image and the corresponding ones in a 3-D object model. And then they use the PnP-RANSAC algorithm to determine the 6-D pose of the object. However, this approach is not end-to-end trainable and may encounter difficulties when applied to scenarios necessitating differentiable poses. When employing a direct end-to-end regression method, the outcomes are often inferior. To tackle the mentioned problems, we present GR6D, which is a keypoint-and graph-convolution-based neural network for differentiable pose estimation based on RGB-D data. First, we propose a multiscale fusion method that utilizes convolution and graph convolution to exploit information contained in RGB and depth images. Additionally, we propose a transformer-based pose refinement module to further adjust features from RGB images and point clouds. We evaluate GR6D on three datasets: 1) LINEMOD; 2) occlusion LINEMOD; and 3) YCB-Video dataset, and it outperforms most state-of-the-art methods. Finally, we apply GR6D to pose estimation and the robotic grasping task in the real world, manifesting superior performance.

引用

页码：3783 / 3795

页数：13

共 50 条

[11] A Depth Adaptive Feature Extraction and Dense Prediction Network for 6-D Pose Estimation in Robotic Grasping
Liu, Xuebing
Yuan, Xiaofang
Zhu, Qing
Wang, Yaonan
Feng, Mingtao
Zhou, Jiaming
Zhou, Zhen
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2727 - 2737
[12] Efficient MSPSO Sampling for Object Detection and 6-D Pose Estimation in 3-D Scenes
Xing, Xuejun
Guo, Jianwei
Nan, Liangliang
Gu, Qingyi
Zhang, Xiaopeng
Yan, Dong-Ming
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (10) : 10281 - 10291
[13] Robotic Grasp Pose Detection Method Based on Multiscale Features
Wang, Zheng
Leng, Longlong
Zhou, Xianming
Zhao, Yanwei
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2023, 20 (05)
[14] Sparse Convolution-Based 6D Pose Estimation for Robotic Bin-Picking With Point Clouds
Zhuang, Chungang
Niu, Wanhao
Wang, Hesheng
JOURNAL OF MECHANISMS AND ROBOTICS-TRANSACTIONS OF THE ASME, 2025, 17 (03):
[15] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
Zhou, Guangliang
Wang, Deming
Yan, Yi
Liu, Chengju
Chen, Qijun
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[16] Sca-pose: category-level 6D pose estimation with adaptive shape prior based on CNN and graph convolution
Zuo, Guoyu
Yu, Shan
Yu, Shuangyue
Liu, Hong
Zhao, Min
INTELLIGENT SERVICE ROBOTICS, 2025, 18 (02) : 351 - 361
[17] WireframePose: Monocular 6-D Pose Estimation of Metal Parts Based on Wireframe Extraction and Matching
Liu, Ze'An
Wang, Xuanyin
Pu, Bin
Tang, Jixiang
Sun, Jiaqi
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[18] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
Zhou, Guangliang
Wang, Deming
Yan, Yi
Liu, Chengju
Chen, Qijun
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[19] 3D hand pose estimation algorithm based on cascaded features and graph convolution
Lin, Yi-lin
Lin, Shan-ling
Lin, Zhi-xian
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (06) : 736 - 745
[20] Sparse Template-Based 6-D Pose Estimation of Metal Parts Using a Monocular Camera
He, Zaixing
Jiang, Zhiwei
Zhao, Xinyue
Zhang, Shuyou
Wu, Chenrui
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (01) : 390 - 401

← 1 2 3 4 5 →