What Matters in Language Conditioned Robotic Imitation Learning Over Unstructured Data

被引：25

作者：

Mees, Oier ^{[1
]}

Hermann, Lukas ^{[1
]}

Burgard, Wolfram ^{[2
]}

机构：

[1] Univ Freiburg, D-79085 Freiburg, Germany

[2] Tech Univ Nuremberg, D-90489 Nurnberg, Germany

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2022年 / 7卷 / 04期

关键词：

Imitation learning; learning categories and concepts; machine learning for robot control;

D O I：

10.1109/LRA.2022.3196123

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

A long-standing goal in robotics is to build robots that can perform a wide range of daily tasks from perceptions obtained with their onboard sensors and specified only via natural language. While recently substantial advances have been achieved in language-driven robotics by leveraging end-to-end learning from pixels, there is no clear and well-understood process for making various design choices due to the underlying variation in setups. In this letter, we conduct an extensive study of the most critical challenges in learning language conditioned policies from offline free-form imitation datasets. We further identify architectural and algorithmic techniques that improve performance, such as a hierarchical decomposition of the robot control learning, a multimodal transformer encoder, discrete latent plans and a self-supervised contrastive loss that aligns video and language representations. By combining the results of our investigation with our improved model components, we are able to present a novel approach that significantly outperforms the state of the art on the challenging language conditioned long-horizon robot manipulation CALVIN benchmark. We have open-sourced our implementation to facilitate future research in learning to perform many complex manipulation skills in a row specified with natural language.

引用

页码：11205 / 11212

页数：8

共 37 条

[1] Abramson Josh, 2021, arXiv
[2] Andrychowicz M., 2017, Proceedings of the Advances in Neural Information Processing Systems, V30, P1
[3] Bengio Y, 2013, Arxiv, DOI arXiv:1308.3432
[4] Affordance Learning from Play for Sample-Efficient Policy Learning
Borja-Diaz, Jessica
Mees, Oier
Kalweit, Gabriel
Hermann, Lukas
Boedecker, Joschka
Burgard, Wolfram
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6372 - 6378
[5] Dasari S., 2021, P 2020 C ROBOT LEARN, P2071
[6] de Haan P., 2019, PROC 33 INT C NEURAL
[7] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[8] Hafner D., 2020, PROC INT C LEARN REP
[9] HARNAD S, 1990, PHYSICA D, V42, P335, DOI 10.1016/0167-2789(90)90087-6
[10] Hatori J, 2018, IEEE INT CONF ROBOT, P3774

← 1 2 3 4 →