Skill Learning for Robotic Insertion Based on One-shot Demonstration and Reinforcement Learning

被引:8
作者
Li, Ying [1 ,2 ]
Xu, De [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Res Ctr Precis Sensing & Control, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Force Jacobian matrix; one-shot demonstration; dynamic exploration strategy; insertion skill learning; reinforcement learning;
D O I
10.1007/s11633-021-1290-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an efficient skill learning framework is proposed for robotic insertion, based on one-shot demonstration and reinforcement learning. First, the robot action is composed of two parts: expert action and refinement action. A force Jacobian matrix is calibrated with only one demonstration, based on which stable and safe expert action can be generated. The deep deterministic policy gradients (DDPG) method is employed to learn the refinement action, which aims to improve the assembly efficiency. Second, an episode-step exploration strategy is developed, which uses the expert action as a benchmark and adjusts the exploration intensity dynamically. A safety-efficiency reward function is designed for the compliant insertion. Third, to improve the adaptability with different components, a skill saving and selection mechanism is proposed. Several typical components are used to train the skill models. And the trained models and force Jacobian matrices are saved in a skill pool. Given a new component, the most appropriate model is selected from the skill pool according to the force Jacobian matrix and directly used to accomplish insertion tasks. Fourth, a simulation environment is established under the guidance of the force Jacobian matrix, which avoids tedious training process on real robotic systems. Simulation and experiments are conducted to validate the effectiveness of the proposed methods.
引用
收藏
页码:457 / 467
页数:11
相关论文
共 21 条
  • [1] A Survey on Modelling and Compensation for Hysteresis in High Speed Nanopositioning of AFMs: Observation and Future Recommendation
    Armin, Maniza
    Roy, Priyo Nath
    Das, Sajal Kumar
    [J]. INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2020, 17 (04) : 479 - 501
  • [2] A Study on Error Recovery Search Strategies of Electronic Connector Mating for Robotic Fault-Tolerant Assembly
    Chen, Fei
    Cannella, Ferdinando
    Huang, Jian
    Sasaki, Hironobu
    Fukuda, Toshio
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 81 (02) : 257 - 271
  • [3] Fan YX, 2019, IEEE INT CONF ROBOT, P811, DOI [10.1109/ICRA.2019.8793659, 10.1109/icra.2019.8793659]
  • [4] Hou ZM, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), P256, DOI 10.1109/ROBIO.2018.8665255
  • [5] Inoue T, 2017, IEEE INT C INT ROBOT, P819, DOI 10.1109/IROS.2017.8202244
  • [6] Johannink T, 2019, IEEE INT CONF ROBOT, P6023, DOI [10.1109/icra.2019.8794127, 10.1109/ICRA.2019.8794127]
  • [7] Robot skill acquisition in assembly process using deep reinforcement learning
    Li, Fengming
    Jiang, Qi
    Zhang, Sisi
    Wei, Meng
    Song, Rui
    [J]. NEUROCOMPUTING, 2019, 345 : 92 - 102
  • [8] Lillicrap TP, 2016, P 4 INT C LEARN REPR
  • [9] Sensing and Control for Simultaneous Precision Peg-in-Hole Assembly of Multiple Objects
    Liu, Song
    Li, You-Fu
    Xing, Dengpeng
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (01) : 310 - 324
  • [10] High Precision Automatic Assembly Based on Microscopic Vision and Force Information
    Liu, Song
    Xu, De
    Zhang, Dapeng
    Zhang, Zhengtao
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (01) : 382 - 393