Dynamic movement primitives based on positive and negative demonstrations

被引:2
作者
Dong, Shuai [1 ]
Yang, Zhihua [1 ]
Zhang, Weixi [1 ]
Zou, Kun [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Zhongshan Inst, Zhongshan, Peoples R China
[2] Univ Elect Sci & Technol China, Zhongshan Inst, Xueyuan Rd 1, Zhongshan 528400, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Expectation-maximization algorithm; Gaussian mixed regression; negative demonstrations; skill transfer learning; data set aggregation; TASK; SAFETY;
D O I
10.1177/17298806231152997
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Dynamic motion primitive has been the most prevalent model-based imitation learning method in the last few decades. Gaussian mixed regression dynamic motion primitive, which draws upon the strengths of both the motion model and the probability model to cope with multiple demonstrations, is a very practical and conspicuous branch in the dynamic motion primitive family. As Gaussian mixed regression dynamic motion primitive only learns from expert demonstrations and requires full environmental information, it is incapable of handling tasks with unmodeled obstacles. Aiming at this problem, we proposed the positive and negative demonstrations-based dynamic motion primitive, for which the introduction of negative demonstrations can bring additional flexibility. Positive and negative demonstrations-based dynamic motion primitive extends Gaussian mixed regression dynamic motion primitive in three aspects. The first aspect is a new maximum log-likelihood function that balances the probabilities on positive and negative demonstrations. The second one is the positive and negative demonstrations-based expectation-maximum, which involves iteratively calculating the lower bound of a new Q-function. And the last is the application framework of data set aggregation for positive and negative demonstrations-based dynamic motion primitive to handle unmodeled obstacles. Experiments on several typical robot manipulating tasks, which include letter writing, obstacle avoidance, and grasping in a grid box, are conducted to validate the performance of positive and negative demonstrations-based dynamic motion primitive.
引用
收藏
页数:19
相关论文
共 49 条
[1]   Simultaneously learning actions and goals from demonstration [J].
Akgun, Baris ;
Thomaz, Andrea .
AUTONOMOUS ROBOTS, 2016, 40 (02) :211-227
[2]  
Alizadeh T, 2016, IEEE ASME INT C ADV, P889, DOI 10.1109/AIM.2016.7576881
[3]  
[Anonymous], 1996, The EM algorithm and extensions
[4]  
Berkenkamp F, 2017, ADV NEUR IN, V30
[5]   On learning, representing, and generalizing a task in a humanoid robot [J].
Calinon, Sylvain ;
Guenter, Florent ;
Billard, Aude .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :286-298
[6]   A tutorial on task-parameterized movement learning and retrieval [J].
Calinon, Sylvain .
INTELLIGENT SERVICE ROBOTICS, 2016, 9 (01) :1-29
[7]   Active learning with statistical models [J].
Cohn, DA ;
Ghahramani, Z ;
Jordan, MI .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :129-145
[8]   Dimensionality Reduction for Dynamic Movement Primitives and Application to Bimanual Manipulation of Clothes [J].
Colome, Adria ;
Torras, Carme .
IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (03) :602-615
[9]  
Colomé A, 2014, IEEE-RAS INT C HUMAN, P794, DOI 10.1109/HUMANOIDS.2014.7041454
[10]  
Du GG, 2020, Arxiv, DOI arXiv:1905.06658