FAGD-Net: Feature-Augmented Grasp Detection Network Based on Efficient Multi-Scale Attention and Fusion Mechanisms

被引：3

作者：

Zhong, Xungao ^{[1
,2
]}

Liu, Xianghui ^{[1
]}

Gong, Tao ^{[1
]}

Sun, Yuan ^{[1
,2
]}

Hu, Huosheng ^{[3
]}

Liu, Qiang ^{[4
]}

机构：

[1] Xiamen Univ Technol, Sch Elect Engn & Automat, Xiamen 361024, Peoples R China

[2] Xiamen Key Lab Frontier Elect Power Equipment & In, Xiamen 361024, Peoples R China

[3] Univ Essex, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, England

[4] Univ Bristol, Fac Engn, Sch Engn Math & Technol, Beacon House,Queens Rd, Bristol BS8 1QU, England

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 12期

基金：

中国国家自然科学基金;

关键词：

robotic grasping; deep network model; attention mechanism; feature fusion;

D O I：

10.3390/app14125097

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Grasping robots always confront challenges such as uncertainties in object size, orientation, and type, necessitating effective feature augmentation to improve grasping detection performance. However, many prior studies inadequately emphasize grasp-related features, resulting in suboptimal grasping performance. To address this limitation, this paper proposes a new grasping approach termed the Feature-Augmented Grasp Detection Network (FAGD-Net). The proposed network incorporates two modules designed to enhance spatial information features and multi-scale features. Firstly, we introduce the Residual Efficient Multi-Scale Attention (Res-EMA) module, which effectively adjusts the importance of feature channels while preserving precise spatial information within those channels. Additionally, we present a Feature Fusion Pyramidal Module (FFPM) that serves as an intermediary between the encoder and decoder, effectively addressing potential oversights or losses of grasp-related features as the encoder network deepens. As a result, FAGD-Net achieved advanced levels of grasping accuracy, with 98.9% and 96.5% on the Cornell and Jacquard datasets, respectively. The grasp detection model was deployed on a physical robot for real-world grasping experiments, where we conducted a series of trials in diverse scenarios. In these experiments, we randomly selected various unknown household items and adversarial objects. Remarkably, we achieved high success rates, with a 95.0% success rate for single-object household items, 93.3% for multi-object scenarios, and 91.0% for cluttered scenes.

引用

页数：26

共 42 条

[1] End-to-end Trainable Deep Neural Network for Robotic Grasp Detection and Semantic Segmentation from RGB
Ainetter, Stefan
Fraundorfer, Friedrich
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13452 - 13458
[2] Asif U, 2019, AAAI CONF ARTIF INTE, P8085
[3] Efficient Grasp Detection Network With Gaussian-Based Grasp Representation for Robotic Manipulation
Cao, Hu
Chen, Guang
Li, Zhijun
Feng, Qian
Lin, Jianjie
Knoll, Alois
[J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 28 (03) : 1384 - 1394
[4] Residual Squeeze-and-Excitation Network with Multi-scale Spatial Pyramid Module for Fast Robotic Grasping Detection
Cao, Hu
Chen, Guang
Li, Zhijun
Lin, Jianjie
Knoll, Alois
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13445 - 13451
[5] Edge-Dependent Efficient Grasp Rectangle Search in Robotic Grasp Detection
Chen, Lu
Huang, Panfeng
Li, Yuanhao
Meng, Zhongjie
[J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2021, 26 (06) : 2922 - 2931
[6] Real-World Multiobject, Multigrasp Detection
Chu, Fu-Jen
Xu, Ruinian
Vela, Patricio A.
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3355 - 3362
[7] Depierre A, 2018, IEEE INT C INT ROBOT, P3511, DOI 10.1109/IROS.2018.8593950
[8] JAUNet: A U-Shape Network with Jump Attention for Semantic Segmentation of Road Scenes
Fan, Zhiyong
Liu, Kailai
Hou, Jianmin
Yan, Fei
Zang, Qiang
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (03):
[9] Light-Weight Convolutional Neural Networks for Generative Robotic Grasping
Fu, Kui
Dang, Xuanju
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 6696 - 6707
[10] Identity Mappings in Deep Residual Networks
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645

← 1 2 3 4 5 →