An Object Recognition Grasping Approach Using Proximal Policy Optimization With YOLOv5

被引:2
作者
Zheng, Qingchun [1 ]
Peng, Zhi [2 ]
Zhu, Peihao [1 ]
Zhao, Yangyang [3 ]
Zhai, Ran [2 ]
Ma, Wenpeng [4 ]
机构
[1] Tianjin Univ Technol, Sch Mech Engn, Tianjin Key Lab Adv Mechatron Syst Design & Intell, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Sch Mech Engn, Tianjin 300384, Peoples R China
[3] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
[4] Tianjin Univ Technol, Natl Demonstrat Ctr Expt Mech & Elect Engn Educ, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
Manipulators; Grasping; Robots; Wheels; Kinematics; Deep learning; Reinforcement learning; Deep reinforcement learning; manipulator; object grasping; proximal policy optimization; YOLOv5;
D O I
10.1109/ACCESS.2023.3305339
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the problems of traditional grasping methods for mobile manipulators, such as single application scenarios, low accuracy, and complex grasping tasks, this paper proposes an object recognition grasping approach using Proximal Policy Optimization (PPO) with You Only Look Once v5 (YOLOv5), which combines a vision recognition algorithm with a deep reinforcement learning algorithm to achieve object recognition grasping. First, YOLOv5 is adopted to identify the object and obtain the location information. Second, the PPO algorithm is used for object grasping to obtain the grasping strategy. Third, the PPO algorithm is compared with the Soft Actor-Critic (SAC) and Trust Region Policy Optimization (TRPO) algorithms in batches 16 and 128, respectively. The average reward training results of the PPO, SAC, and TRPO algorithms are obtained in our work. Experimental results show that the proposed method, in terms of object recognition speed, outperforms the original YOLOv4 model. The YOLOv5 model achieves 96% precision on our own built recognition dataset, which has higher detection precision and lower hardware requirements than the YOLOv4 model. Our proposed method outperforms SAC and TRPO algorithms in object grasping, and the average reward of the PPO algorithm is improved by 93.3% and 41% compared to SAC and TRPO algorithms, respectively. Finally, through the comparison of ablation experiments, our method has the highest accuracy and mean average precision (mAP)@0.5 value of 92.3%. We demonstrate in actual physical experiments that the grasping success rate under our proposed approach reaches 100%, providing a new research strategy for object grasping by the robot manipulator.
引用
收藏
页码:87330 / 87343
页数:14
相关论文
共 39 条
  • [1] Object Detection Recognition and Robot Grasping Based on Machine Learning: A Survey
    Bai, Qiang
    Li, Shaobo
    Yang, Jing
    Song, Qisong
    Li, Zhiang
    Zhang, Xingxing
    [J]. IEEE ACCESS, 2020, 8 : 181855 - 181879
  • [2] Cao GQ, 2022, J MECH SCI TECHNOL, V36, P2019
  • [3] Deep reinforcement learning based moving object grasping
    Chen, Pengzhan
    Lu, Weiqing
    [J]. INFORMATION SCIENCES, 2021, 565 : 62 - 76
  • [4] An Improved Lightweight YOLOv5 Algorithm for Detecting Strawberry Diseases
    Chen, Shunlong
    Liao, Yinghua
    Lin, Feng
    Huang, Bo
    [J]. IEEE ACCESS, 2023, 11 : 54080 - 54092
  • [5] Soft Actor-Critic for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Kich, Victor Augusto
    Kolling, Alisson Henrique
    Grando, Ricardo Bedin
    Cuadros, Marco Antonio de Souza Leite
    Gamarra, Daniel Fernando Tello
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (02)
  • [6] Geometry and kinematics of the Mecanum wheel
    Gfrerrer, A.
    [J]. COMPUTER AIDED GEOMETRIC DESIGN, 2008, 25 (09) : 784 - 791
  • [7] Swin-Transformer-Enabled YOLOv5 with Attention Mechanism for Small Object Detection on Satellite Images
    Gong, Hang
    Mu, Tingkui
    Li, Qiuxia
    Dai, Haishan
    Li, Chunlai
    He, Zhiping
    Wang, Wenjing
    Han, Feng
    Tuniyazi, Abudusalamu
    Li, Haoyang
    Lang, Xuechan
    Li, Zhiyuan
    Wang, Bin
    [J]. REMOTE SENSING, 2022, 14 (12)
  • [8] Proximal Policy Optimization With Policy Feedback
    Gu, Yang
    Cheng, Yuhu
    Chen, C. L. Philip
    Wang, Xuesong
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4600 - 4610
  • [9] Guan Z., 2022, WIRELESS COMMUN MOBI, P1
  • [10] ALLIANCE-ROS: A Software Framework on ROS for Fault-Tolerant and Cooperative Mobile Robots
    Guo Zhongyuan
    Yang Wenjing
    Li Minglong
    Yi Xiaodong
    Cai Zhongxuan
    Wang Yanzhen
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (03) : 467 - 475