The Art of Imitation: Learning Long-Horizon Manipulation Tasks From Few Demonstrations

被引：1

作者：

von Hartz, Jan Ole ^{[1
]}

Welschehold, Tim ^{[1
]}

Valada, Abhinav ^{[1
]}

Boedecker, Joschka ^{[1
]}

机构：

[1] Univ Freiburg, Dept Comp Sci, D-79085 Freiburg, Germany

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 12期

关键词：

Hidden Markov models; Trajectory; Visualization; Adaptation models; Robot kinematics; Gaussian mixture model; End effectors; Robot sensing systems; Motion segmentation; Data models; Imitation learning; learning from demonstration; sensorimotor learning; MOVEMENT; MIXTURE;

D O I：

10.1109/LRA.2024.3487506

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Task Parametrized Gaussian Mixture Models (TP-GMM) are a sample-efficient method for learning object-centric robot manipulation tasks. However, there are several open challenges to applying TP-GMMs in the wild. In this work, we tackle three crucial challenges synergistically. First, end-effector velocities are non-Euclidean and thus hard to model using standard GMMs. We thus propose to factorize the robot's end-effector velocity into its direction and magnitude, and model them using Riemannian GMMs. Second, we leverage the factorized velocities to segment and sequence skills from complex demonstration trajectories. Through the segmentation, we further align skill trajectories and hence leverage time as a powerful inductive bias. Third, we present a method to automatically detect relevant task parameters per skill from visual observations. Our approach enables learning complex manipulation tasks from just five demonstrations while using only RGB-D observations. Extensive experimental evaluations on RLBench demonstrate that our approach achieves state-of-the-art performance with 20-fold improved sample efficiency. Our policies generalize across different environments, object instances, and object positions, while the learned skills are reusable.

引用

页码：11369 / 11376

页数：8

共 35 条

[1] Alizadeh T, 2016, IEEE/SICE I S SYS IN, P453, DOI 10.1109/SII.2016.7844040
[2] Alizadeh T, 2014, IEEE INT CONF ROBOT, P3309, DOI 10.1109/ICRA.2014.6907335
[3] Amir Shir, 2022, ECCVW WHAT IS MOTION
[4] On learning, representing, and generalizing a task in a humanoid robot
Calinon, Sylvain
Guenter, Florent
Billard, Aude
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 286 - 298
[5] A tutorial on task-parameterized movement learning and retrieval
Calinon, Sylvain
[J]. INTELLIGENT SERVICE ROBOTICS, 2016, 9 (01) : 1 - 29
[6] Calinon S, 2013, IEEE INT C INT ROBOT, P610, DOI 10.1109/IROS.2013.6696414
[7] Learning and Reproduction of Gestures by Imitation An Approach Based on Hidden Markov Model and Gaussian Mixture Regression
Calinon, Sylvain
D'Halluin, Florent
Sauser, Eric L.
Caldwell, Darwin G.
Billard, Aude G.
[J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2010, 17 (02) : 44 - 54
[8] On learning the statistical representation of a task and generalizing it to various contexts
Calinon, Sylvain
Guenter, Florent
Billard, Aude
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 2978 - +
[9] Celemin C., 2022, Found. Trends Robot., V10, P1, DOI 10.1561/2300000072
[10] Chi C., 2023, Robotics: Science and Systems, P1, DOI [DOI 10.15607/RSS.2023.XIX.026, 10.15607/RSS.2023.XIX.026]

← 1 2 3 4 →