Robot multi-action cooperative grasping strategy based on deep reinforcement learning

被引:0
作者
He, Huiteng [1 ]
Zhou, Yong [1 ]
Hu, Kaixiong [1 ]
Li, Weidong [2 ]
机构
[1] School of Transportation and Logistics Engineering, Wuhan University of Technology, Wuhan
[2] School of Mechanical Engineering, University of Shanghai for Science and Technology, Shanghai
来源
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS | 2024年 / 30卷 / 05期
基金
中国国家自然科学基金;
关键词
3D point cloud; deep reinforcement learning; multi-action collaboration; robot grasping;
D O I
10.13196/j.cims.2023.0280
中图分类号
学科分类号
摘要
To address the problem of decreased grasping success rates in complex scenarios where there are obvious obstructions and overlapping objects, a multi-view grasping and pushing collaborative strategy based on Deep Reinforcement Learning was proposed. This strategy utilized 3D point cloud obtained from an RGB-D camera as a state input, and used two deep Q-network algorithms to fit the grasping and pushing strategies separately. Through the reasonable design of the reward function, the coordination between grasping and pushing was learnt to change the distribution of objects through pushing actions and provide favorable conditions for grasping. Furthermore, to address the slow training speed resulting from the complexity of action space, a normal mask-based action space optimization strategy was proposed by limiting the exploration space of grasping and pushing actions based on prior knowledge of estimated normal direction. Finally, experiments were conducted with UR5 robot in the real world to verify the effectiveness of the proposed cooperative grasping strategy. © 2024 CIMS. All rights reserved.
引用
收藏
页码:1789 / 1797
页数:8
相关论文
共 25 条
  • [1] WANG Ke, ZHANG Hui, CAO Yihong, Et al., Intelligent robots and key technologies for pharmaceutical production[J ], Computer Integrated Manufacturing Systems, 28, 7, pp. 1981-1995, (2022)
  • [2] MENG Minghui, ZHOU Chuande, CHEN Libin, Et al., A review of the research and development of industrial robots, 50, (2016)
  • [3] SHIMOGA K B., Robot grasp synthesis algorithms
  • [4] A survey, The International Journal of Robotics Research, 15, 3, pp. 230-266, (1996)
  • [5] BOHG J, MORALES A, ASEOUR T, Et al., Data-driven grasp synthesis A survey, IEEE Transactions on robotics, 30, 2, pp. 289-309, (2013)
  • [6] QIN Zhiqiang, ZHAO Xifang, LI Zexiang, Et al., Grasping force analysis and optimization for robotic multifingered manipulation, Journal of Mechanical Engineering, 3, pp. 8-12, (2000)
  • [7] MAO Lmgbo, SHI Jmlong, ZHOU Zhiqiang, Et al., Robot grasping method based on single view key point voting [J], Computer Integrated Manufacturing Systems, 29, 11, pp. 3572-3581, (2023)
  • [8] SCHAAL S., Is imitation learning the route to humanoid robots? [J ], Trends in Cognitive Sciences, 3, 6, pp. 233-242, (1999)
  • [9] LIN Jiahao, ZHANG Zongzhang, JIANG Chong, Et al., A survey of imitation learning based on generative adversarial nets[j], Chinese Journal of Computers, 43, 2, pp. 326-351, (2020)
  • [10] LI Shuailong, ZHANG Huiwen, ZHOU Weijia, Review of imitation learning methods and its application in robotics[j], Computer Engineering and Applications, 55, 4, pp. 17-30, (2019)