Living Object Grasping Using Two-Stage Graph Reinforcement Learning

被引:10
作者
Hu, Zhe [1 ,2 ]
Zheng, Yu [2 ]
Pan, Jia [3 ]
机构
[1] City Univ Hong Kong, Dept Biomed Engn, Kowloon Tong, Hong Kong 999077, Peoples R China
[2] Tencent Robot X, Shenzhen 518057, Guangdong, Peoples R China
[3] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China
关键词
Deep learning in grasping and manipulation; dexterous manipulation; grasping; in-hand manipulation; reinforcement learning;
D O I
10.1109/LRA.2021.3060636
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Living objects are hard to grasp because they can actively dodge and struggle by writhing or deforming while or even prior to being contacted and modeling or predicting their responses to grasping is extremely difficult. This letter presents an algorithm based on reinforcement learning (RL) to attack this challenging problem. Considering the complexity of living object grasping, we divide the whole task into pre-grasp and in-hand stages and let the algorithm switch between the stages automatically. The pre-grasp stage is aimed at finding a good pose of a robot hand approaching a living object for performing a grasp. Dense reward functions are proposed for facilitating the learning of right hand actions based on the poses of both hand and object. Since an object held in hand may struggle to escape, the robot hand needs to adjust its configuration and respond correctly to the object's movement. Hence, the goal of the in-hand stage is to determine an appropriate adjustment of finger configuration in order for the robot hand to keep holding the object. At this stage, we treat the robot hand as a graph and use the graph convolutional network (GCN) to determine the hand action. We test our algorithm with both simulation and real experiments, which show its good performance in living object grasping. More results are available on our website: https://sites.google.com/view/graph-rl.
引用
收藏
页码:1950 / 1957
页数:8
相关论文
共 18 条
[1]  
[Anonymous], 2018, ARXIV180110247
[2]  
Baier-Löwenstein T, 2007, 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, P1557
[3]  
Ciocarlie M, 2007, 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, P3276
[4]   Hand Posture Subspaces for Dexterous Robotic Grasping [J].
Ciocarlie, Matei T. ;
Allen, Peter K. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2009, 28 (07) :851-867
[5]  
Kalashnikov D., 2018, P MACHINE LEARNING R, P651
[6]   SOPHIE velocimetry of Kepler transit candidates XVII. The physical properties of giant exoplanets within 400 days of period [J].
Santerne, A. ;
Moutou, C. ;
Tsantaki, M. ;
Bouchy, F. ;
Hebrard, G. ;
Adibekyan, V. ;
Almenara, J. -M. ;
Amard, L. ;
Barros, S. C. C. ;
Boisse, I. ;
Bonomo, A. S. ;
Bruno, G. ;
Courcol, B. ;
Deleuil, M. ;
Demangeon, O. ;
Diaz, R. F. ;
Guillot, T. ;
Havel, M. ;
Montagnier, G. ;
Rajpurohit, A. S. ;
Rey, J. ;
Santos, N. C. .
ASTRONOMY & ASTROPHYSICS, 2016, 587
[7]  
Lampe T, 2013, IEEE IJCNN
[8]   Dexterous grasping under shape uncertainty [J].
Li, Miao ;
Hang, Kaiyu ;
Kragic, Danica ;
Billard, Aude .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 :352-364
[9]  
Okamura A. M., 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065), P255, DOI 10.1109/ROBOT.2000.844067
[10]  
Quillen D, 2018, IEEE INT CONF ROBOT, P6284