PRIME: Scaffolding Manipulation Tasks With Behavior Primitives for Data-Efficient Imitation Learning

被引：0

作者：

Gao, Tian ^{[1
]}

Nasiriany, Soroush ^{[2
]}

Liu, Huihan ^{[2
]}

Yang, Quantao ^{[3
]}

Zhu, Yuke ^{[2
]}

机构：

[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA

[3] KTH Royal Inst Technol, Dept Comp Sci, S-11428 Stockholm, Sweden

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 10期

基金：

美国国家科学基金会;

关键词：

Task analysis; Imitation learning; Trajectory; Motors; Dynamic programming; Training; Robot sensing systems; deep learning in grasping and manipulation; deep learning methods;

D O I：

10.1109/LRA.2024.3443610

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Imitation learning has shown great potential for enabling robots to acquire complex manipulation behaviors. However, these algorithms suffer from high sample complexity in long-horizon tasks, where compounding errors accumulate over the task horizons. We present PRIME (PRimitive-based IMitation with data Efficiency), a behavior primitive-based framework designed for improving the data efficiency of imitation learning. PRIME scaffolds robot tasks by decomposing task demonstrations into primitive sequences, followed by learning a high-level control policy to sequence primitives through imitation learning. Our experiments demonstrate that PRIME achieves a significant performance improvement in multi-stage manipulation tasks, with 10-34% higher success rates in simulation over state-of-the-art baselines and 20-48% on physical hardware.

引用

页码：8322 / 8329

页数：8

共 59 条

[1]

Ajay A., 2021, P INT C LEARN REPR, P1

[2] Data-Driven Grasp Synthesis-A Survey [J].

Bohg, Jeannette ;

Morales, Antonio ;

Asfour, Tamim ;

Kragic, Danica .

IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (02) :289-309

[3]

Brandfonbrener D., 2023, 37 P C NEUR INF PROC

[4]

Brohan A, 2022, P ROB SCI SYST

[5]

Chen H., 2023, P C ROB LEARN, P358

[6]

Chernova Sonia., 2007, Proc. AAMAS, p233:1, DOI [DOI 10.1145/1329125.1329407, 10.1145/1329125.1329407]

[7]

Chi C., 2023, P ROB SCI SYST

[8]

Chitnis R, 2020, IEEE INT CONF ROBOT, P1149, DOI [10.1109/icra40945.2020.9196958, 10.1109/ICRA40945.2020.9196958]

[9]

Dalal Murtaza, 2021, Advances in Neural Information Processing Systems, V34

[10]

Du Y, 2024, PROC 37 INT C NEURAL, V36

← 1 2 3 4 5 6 →