f-Divergence Optimization for Task-Parameterized Learning from Demonstrations Algorithm

被引：2

作者：

Prados, Adrian ^{[1
]}

Mendez, Alberto ^{[1
]}

Espinoza, Gonzalo ^{[1
]}

Fernandez, Noelia ^{[1
]}

Barber, Ramon ^{[1
]}

机构：

[1] Univ Carlos III, RoboticsLab Syst & Automat, Madrid, Spain

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC | 2024年

关键词：

Learning from Demonstration; Imitation Learning; Manipulation; Task Parameterized Gaussian Mixture Model;

D O I：

10.1109/ICARSC61747.2024.10535920

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Programming robots through demonstration in unstructured environments is often a challenging task, requiring consideration of various parameters. One of the main challenges in an unstructured environment is the ability to extrapolate from user-provided demonstrations, which are sometimes limited. To address this issue, the idea of using task parameterization emerges, assuming that trajectory movements are modulated by different parameters (such as orientation or position) of relevant points, such as the initial and final points of a trajectory. However, some of these task parameters (TPs) may not be relevant for task resolution, especially in environments where various types of movements can occur, introducing additional difficulties in task learning. Additionally, it may happen that two demonstrations contain similar information for task execution (redundancies). This article proposes an approach based on a Task Parameterized Gaussian Mixture Model (TPGMM) for Learning from Demonstrations (LfD) that, through the use of an f-Divergence method (Kullback-Leibler), eliminates redundancy and irrelevance in certain tasks. This allows for a optimal learning model that avoids unnecessary information. The efficiency of the proposed approach has been tested in simulation environments and compared against state-of-the-art algorithms within the LfD domain, demonstrating high efficiency in both cases.

引用

页码：9 / 14

页数：6

共 19 条

[1]

Alizadeh T., 2018, ICMA, P94

[2]

Alizadeh T, 2016, IEEE/SICE I S SYS IN, P453, DOI 10.1109/SII.2016.7844040

[3]

Calinan S, 2012, IEEE-RAS INT C HUMAN, P323, DOI 10.1109/HUMANOIDS.2012.6651539

[4] On learning, representing, and generalizing a task in a humanoid robot [J].

Calinon, Sylvain ;

Guenter, Florent ;

Billard, Aude .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :286-298

[5] A tutorial on task-parameterized movement learning and retrieval [J].

Calinon, Sylvain .

INTELLIGENT SERVICE ROBOTICS, 2016, 9 (01) :1-29

[6]

Correia A, 2023, Arxiv, DOI arXiv:2303.11191

[7] An improved approach of task-parameterized learning from demonstrations for cobots in dynamic manufacturing [J].

El Zaatari, Shirine ;

Wang, Yuqi ;

Hu, Yudie ;

Li, Weidong .

JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (05) :1503-1519

[8] Approximating the Kullback Leibler Divergence between Gaussian Mixture Models [J].

Hershey, John R. ;

Olsen, Peder A. .

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, :317-320

[9] Kernelized movement primitives [J].

Huang, Yanlong ;

Rozo, Leonel ;

Silverio, Joao ;

Caldwell, Darwin G. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (07) :833-852

[10]

Huang YL, 2018, IEEE INT CONF ROBOT, P5667

← 1 2 →