PRIME: Scaffolding Manipulation Tasks With Behavior Primitives for Data-Efficient Imitation Learning

被引:0
作者
Gao, Tian [1 ]
Nasiriany, Soroush [2 ]
Liu, Huihan [2 ]
Yang, Quantao [3 ]
Zhu, Yuke [2 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[3] KTH Royal Inst Technol, Dept Comp Sci, S-11428 Stockholm, Sweden
基金
美国国家科学基金会;
关键词
Task analysis; Imitation learning; Trajectory; Motors; Dynamic programming; Training; Robot sensing systems; deep learning in grasping and manipulation; deep learning methods;
D O I
10.1109/LRA.2024.3443610
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Imitation learning has shown great potential for enabling robots to acquire complex manipulation behaviors. However, these algorithms suffer from high sample complexity in long-horizon tasks, where compounding errors accumulate over the task horizons. We present PRIME (PRimitive-based IMitation with data Efficiency), a behavior primitive-based framework designed for improving the data efficiency of imitation learning. PRIME scaffolds robot tasks by decomposing task demonstrations into primitive sequences, followed by learning a high-level control policy to sequence primitives through imitation learning. Our experiments demonstrate that PRIME achieves a significant performance improvement in multi-stage manipulation tasks, with 10-34% higher success rates in simulation over state-of-the-art baselines and 20-48% on physical hardware.
引用
收藏
页码:8322 / 8329
页数:8
相关论文
共 59 条
[1]  
Ajay A., 2021, P INT C LEARN REPR, P1
[2]   Data-Driven Grasp Synthesis-A Survey [J].
Bohg, Jeannette ;
Morales, Antonio ;
Asfour, Tamim ;
Kragic, Danica .
IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (02) :289-309
[3]  
Brandfonbrener D., 2023, 37 P C NEUR INF PROC
[4]  
Brohan A, 2022, P ROB SCI SYST
[5]  
Chen H., 2023, P C ROB LEARN, P358
[6]  
Chernova Sonia., 2007, Proc. AAMAS, p233:1, DOI [DOI 10.1145/1329125.1329407, 10.1145/1329125.1329407]
[7]  
Chi C., 2023, P ROB SCI SYST
[8]  
Chitnis R, 2020, IEEE INT CONF ROBOT, P1149, DOI [10.1109/icra40945.2020.9196958, 10.1109/ICRA40945.2020.9196958]
[9]  
Dalal Murtaza, 2021, Advances in Neural Information Processing Systems, V34
[10]  
Du Y, 2024, PROC 37 INT C NEURAL, V36