Dynamic movement primitives based on positive and negative demonstrations

被引：2

作者：

Dong, Shuai ^{[1
]}

Yang, Zhihua ^{[1
]}

Zhang, Weixi ^{[1
]}

Zou, Kun ^{[1
,2
]}

机构：

[1] Univ Elect Sci & Technol China, Zhongshan Inst, Zhongshan, Peoples R China

[2] Univ Elect Sci & Technol China, Zhongshan Inst, Xueyuan Rd 1, Zhongshan 528400, Guangdong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2023年 / 20卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Expectation-maximization algorithm; Gaussian mixed regression; negative demonstrations; skill transfer learning; data set aggregation; TASK; SAFETY;

D O I：

10.1177/17298806231152997

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Dynamic motion primitive has been the most prevalent model-based imitation learning method in the last few decades. Gaussian mixed regression dynamic motion primitive, which draws upon the strengths of both the motion model and the probability model to cope with multiple demonstrations, is a very practical and conspicuous branch in the dynamic motion primitive family. As Gaussian mixed regression dynamic motion primitive only learns from expert demonstrations and requires full environmental information, it is incapable of handling tasks with unmodeled obstacles. Aiming at this problem, we proposed the positive and negative demonstrations-based dynamic motion primitive, for which the introduction of negative demonstrations can bring additional flexibility. Positive and negative demonstrations-based dynamic motion primitive extends Gaussian mixed regression dynamic motion primitive in three aspects. The first aspect is a new maximum log-likelihood function that balances the probabilities on positive and negative demonstrations. The second one is the positive and negative demonstrations-based expectation-maximum, which involves iteratively calculating the lower bound of a new Q-function. And the last is the application framework of data set aggregation for positive and negative demonstrations-based dynamic motion primitive to handle unmodeled obstacles. Experiments on several typical robot manipulating tasks, which include letter writing, obstacle avoidance, and grasping in a grid box, are conducted to validate the performance of positive and negative demonstrations-based dynamic motion primitive.

引用

页数：19

共 49 条

[21]

Karlsson Martin, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P316, DOI 10.1109/ICRA.2017.7989040

[22]

Karlsson M., 2018, 2018 17 EUROPEAN CON

[23] Incorporating Safety Into Parametric Dynamic Movement Primitives [J].

Kim, Hyoin ;

Seo, Hoseong ;

Choi, Seungwon ;

Tomlin, Claire J. ;

Kim, H. Jin .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) :2260-2267

[24]

Kroemer O, 2015, IEEE INT CONF ROBOT, P1503, DOI 10.1109/ICRA.2015.7139389

[25]

Kuefler A., 2017, P 17 INT C AUTONOMOU, P1

[26]

Lee KH, 2018, ECO-EFFIC IND SCI, V33, P1, DOI 10.1007/978-3-319-70899-7_1

[27] An enhanced teaching interface for a robot using DMP and GMR [J].

Li, Chunxu ;

Yang, Chenguang ;

Ju, Zhaojie ;

Annamalai, Andy S. K. .

INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2018, 2 (01) :110-121

[28]

Li H., 2019, STAT LEARNING METHOD

[29] Learning movement primitive libraries through probabilistic segmentation [J].

Lioutikov, Rudolf ;

Neumann, Gerhard ;

Maeda, Guilherme ;

Peters, Jan .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (08) :879-894

[30]

[刘环 Liu Huan], 2019, [机器人, Robot], V41, P574

← 1 2 3 4 5 →