Skill Learning for Robotic Insertion Based on One-shot Demonstration and Reinforcement Learning

被引：8

作者：

Li, Ying ^{[1
,2
]}

Xu, De ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Res Ctr Precis Sensing & Control, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

来源：

INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING | 2021年 / 18卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Force Jacobian matrix; one-shot demonstration; dynamic exploration strategy; insertion skill learning; reinforcement learning;

D O I：

10.1007/s11633-021-1290-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an efficient skill learning framework is proposed for robotic insertion, based on one-shot demonstration and reinforcement learning. First, the robot action is composed of two parts: expert action and refinement action. A force Jacobian matrix is calibrated with only one demonstration, based on which stable and safe expert action can be generated. The deep deterministic policy gradients (DDPG) method is employed to learn the refinement action, which aims to improve the assembly efficiency. Second, an episode-step exploration strategy is developed, which uses the expert action as a benchmark and adjusts the exploration intensity dynamically. A safety-efficiency reward function is designed for the compliant insertion. Third, to improve the adaptability with different components, a skill saving and selection mechanism is proposed. Several typical components are used to train the skill models. And the trained models and force Jacobian matrices are saved in a skill pool. Given a new component, the most appropriate model is selected from the skill pool according to the force Jacobian matrix and directly used to accomplish insertion tasks. Fourth, a simulation environment is established under the guidance of the force Jacobian matrix, which avoids tedious training process on real robotic systems. Simulation and experiments are conducted to validate the effectiveness of the proposed methods.

引用

页码：457 / 467

页数：11

共 21 条

[1] A Survey on Modelling and Compensation for Hysteresis in High Speed Nanopositioning of AFMs: Observation and Future Recommendation
Armin, Maniza
Roy, Priyo Nath
Das, Sajal Kumar
[J]. INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2020, 17 (04) : 479 - 501
[2] A Study on Error Recovery Search Strategies of Electronic Connector Mating for Robotic Fault-Tolerant Assembly
Chen, Fei
Cannella, Ferdinando
Huang, Jian
Sasaki, Hironobu
Fukuda, Toshio
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 81 (02) : 257 - 271
[3] Fan YX, 2019, IEEE INT CONF ROBOT, P811, DOI [10.1109/ICRA.2019.8793659, 10.1109/icra.2019.8793659]
[4] Hou ZM, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), P256, DOI 10.1109/ROBIO.2018.8665255
[5] Inoue T, 2017, IEEE INT C INT ROBOT, P819, DOI 10.1109/IROS.2017.8202244
[6] Johannink T, 2019, IEEE INT CONF ROBOT, P6023, DOI [10.1109/icra.2019.8794127, 10.1109/ICRA.2019.8794127]
[7] Robot skill acquisition in assembly process using deep reinforcement learning
Li, Fengming
Jiang, Qi
Zhang, Sisi
Wei, Meng
Song, Rui
[J]. NEUROCOMPUTING, 2019, 345 : 92 - 102
[8] Lillicrap TP, 2016, P 4 INT C LEARN REPR
[9] Sensing and Control for Simultaneous Precision Peg-in-Hole Assembly of Multiple Objects
Liu, Song
Li, You-Fu
Xing, Dengpeng
[J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (01) : 310 - 324
[10] High Precision Automatic Assembly Based on Microscopic Vision and Force Information
Liu, Song
Xu, De
Zhang, Dapeng
Zhang, Zhengtao
[J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (01) : 382 - 393

← 1 2 3 →