FAGD-Net: Feature-Augmented Grasp Detection Network Based on Efficient Multi-Scale Attention and Fusion Mechanisms

被引:3
作者
Zhong, Xungao [1 ,2 ]
Liu, Xianghui [1 ]
Gong, Tao [1 ]
Sun, Yuan [1 ,2 ]
Hu, Huosheng [3 ]
Liu, Qiang [4 ]
机构
[1] Xiamen Univ Technol, Sch Elect Engn & Automat, Xiamen 361024, Peoples R China
[2] Xiamen Key Lab Frontier Elect Power Equipment & In, Xiamen 361024, Peoples R China
[3] Univ Essex, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, England
[4] Univ Bristol, Fac Engn, Sch Engn Math & Technol, Beacon House,Queens Rd, Bristol BS8 1QU, England
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 12期
基金
中国国家自然科学基金;
关键词
robotic grasping; deep network model; attention mechanism; feature fusion;
D O I
10.3390/app14125097
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Grasping robots always confront challenges such as uncertainties in object size, orientation, and type, necessitating effective feature augmentation to improve grasping detection performance. However, many prior studies inadequately emphasize grasp-related features, resulting in suboptimal grasping performance. To address this limitation, this paper proposes a new grasping approach termed the Feature-Augmented Grasp Detection Network (FAGD-Net). The proposed network incorporates two modules designed to enhance spatial information features and multi-scale features. Firstly, we introduce the Residual Efficient Multi-Scale Attention (Res-EMA) module, which effectively adjusts the importance of feature channels while preserving precise spatial information within those channels. Additionally, we present a Feature Fusion Pyramidal Module (FFPM) that serves as an intermediary between the encoder and decoder, effectively addressing potential oversights or losses of grasp-related features as the encoder network deepens. As a result, FAGD-Net achieved advanced levels of grasping accuracy, with 98.9% and 96.5% on the Cornell and Jacquard datasets, respectively. The grasp detection model was deployed on a physical robot for real-world grasping experiments, where we conducted a series of trials in diverse scenarios. In these experiments, we randomly selected various unknown household items and adversarial objects. Remarkably, we achieved high success rates, with a 95.0% success rate for single-object household items, 93.3% for multi-object scenarios, and 91.0% for cluttered scenes.
引用
收藏
页数:26
相关论文
共 42 条
  • [1] End-to-end Trainable Deep Neural Network for Robotic Grasp Detection and Semantic Segmentation from RGB
    Ainetter, Stefan
    Fraundorfer, Friedrich
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13452 - 13458
  • [2] Asif U, 2019, AAAI CONF ARTIF INTE, P8085
  • [3] Efficient Grasp Detection Network With Gaussian-Based Grasp Representation for Robotic Manipulation
    Cao, Hu
    Chen, Guang
    Li, Zhijun
    Feng, Qian
    Lin, Jianjie
    Knoll, Alois
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 28 (03) : 1384 - 1394
  • [4] Residual Squeeze-and-Excitation Network with Multi-scale Spatial Pyramid Module for Fast Robotic Grasping Detection
    Cao, Hu
    Chen, Guang
    Li, Zhijun
    Lin, Jianjie
    Knoll, Alois
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13445 - 13451
  • [5] Edge-Dependent Efficient Grasp Rectangle Search in Robotic Grasp Detection
    Chen, Lu
    Huang, Panfeng
    Li, Yuanhao
    Meng, Zhongjie
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2021, 26 (06) : 2922 - 2931
  • [6] Real-World Multiobject, Multigrasp Detection
    Chu, Fu-Jen
    Xu, Ruinian
    Vela, Patricio A.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3355 - 3362
  • [7] Depierre A, 2018, IEEE INT C INT ROBOT, P3511, DOI 10.1109/IROS.2018.8593950
  • [8] JAUNet: A U-Shape Network with Jump Attention for Semantic Segmentation of Road Scenes
    Fan, Zhiyong
    Liu, Kailai
    Hou, Jianmin
    Yan, Fei
    Zang, Qiang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [9] Light-Weight Convolutional Neural Networks for Generative Robotic Grasping
    Fu, Kui
    Dang, Xuanju
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 6696 - 6707
  • [10] Identity Mappings in Deep Residual Networks
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645