Dynamic movement primitives based on positive and negative demonstrations

被引:2
作者
Dong, Shuai [1 ]
Yang, Zhihua [1 ]
Zhang, Weixi [1 ]
Zou, Kun [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Zhongshan Inst, Zhongshan, Peoples R China
[2] Univ Elect Sci & Technol China, Zhongshan Inst, Xueyuan Rd 1, Zhongshan 528400, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Expectation-maximization algorithm; Gaussian mixed regression; negative demonstrations; skill transfer learning; data set aggregation; TASK; SAFETY;
D O I
10.1177/17298806231152997
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Dynamic motion primitive has been the most prevalent model-based imitation learning method in the last few decades. Gaussian mixed regression dynamic motion primitive, which draws upon the strengths of both the motion model and the probability model to cope with multiple demonstrations, is a very practical and conspicuous branch in the dynamic motion primitive family. As Gaussian mixed regression dynamic motion primitive only learns from expert demonstrations and requires full environmental information, it is incapable of handling tasks with unmodeled obstacles. Aiming at this problem, we proposed the positive and negative demonstrations-based dynamic motion primitive, for which the introduction of negative demonstrations can bring additional flexibility. Positive and negative demonstrations-based dynamic motion primitive extends Gaussian mixed regression dynamic motion primitive in three aspects. The first aspect is a new maximum log-likelihood function that balances the probabilities on positive and negative demonstrations. The second one is the positive and negative demonstrations-based expectation-maximum, which involves iteratively calculating the lower bound of a new Q-function. And the last is the application framework of data set aggregation for positive and negative demonstrations-based dynamic motion primitive to handle unmodeled obstacles. Experiments on several typical robot manipulating tasks, which include letter writing, obstacle avoidance, and grasping in a grid box, are conducted to validate the performance of positive and negative demonstrations-based dynamic motion primitive.
引用
收藏
页数:19
相关论文
共 49 条
[41]   Learning Robot Manipulation Tasks With Task-Parameterized Semitied Hidden Semi-Markov Model [J].
Tanwani, Ajay Kumar ;
Calinon, Sylvain .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2016, 1 (01) :235-242
[42]  
Tessler C., 2018, ARXIV PREPRINT ARXIV, P1
[43]  
Tessler C, 2018, Arxiv, DOI arXiv:1805.11074
[44]   Task-Specific Generalization of Discrete and Periodic Dynamic Movement Primitives [J].
Ude, Ales ;
Gams, Andrej ;
Asfour, Tamim ;
Morimoto, Jun .
IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (05) :800-815
[45]   Compliant Parametric Dynamic Movement Primitives [J].
Ugur, Emre ;
Girgin, Hakan .
ROBOTICA, 2020, 38 (03) :457-474
[46]  
Wang ZY, 2017, ADV NEUR IN, V30
[47]  
Wen M., 2020, C NEURAL INFORM PROC, P1
[48]  
Yang C., 2019, IEEE T NEUR NET LEAR, V25, P581
[49]   A General Framework to Increase Safety of Learning Algorithms for Dynamical Systems Based on Region of Attraction Estimation [J].
Zhou, Zhehua ;
Oguz, Ozgur S. ;
Leibold, Marion ;
Buss, Martin .
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (05) :1472-1490