Waypoint-Based Imitation Learning for Robotic Manipulation

被引:0
作者
Shi, Lucy Xiaoyang [1 ]
Sharma, Archit [1 ]
Zhao, Tony Z. [1 ]
Finn, Chelsea [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
来源
CONFERENCE ON ROBOT LEARNING, VOL 229 | 2023年 / 229卷
关键词
imitation learning; waypoints; long-horizon;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While imitation learning methods have seen a resurgent interest for robotic manipulation, the well-known problem of compounding errors continues to afflict behavioral cloning (BC). Waypoints can help address this problem by reducing the horizon of the learning problem for BC, and thus, the errors compounded over time. However, waypoint labeling is underspecified, and requires additional human supervision. Can we generate waypoints automatically without any additional human supervision? Our key insight is that if a trajectory segment can be approximated by linear motion, the endpoints can be used as waypoints. We propose Automatic Waypoint Extraction (AWE) for imitation learning, a preprocessing module to decompose a demonstration into a minimal set of waypoints which when interpolated linearly can approximate the trajectory up to a specified error threshold. AWE can be combined with any BC algorithm, and we find that AWE can increase the success rate of state-of-the-art algorithms by up to 25% in simulation and by 4-28% on real-world bimanual manipulation tasks, reducing the decision making horizon by up to a factor of 10. Videos and code are available at https://lucys0.github.io/awe/.
引用
收藏
页数:15
相关论文
共 50 条
[31]   Robotic Manipulation Skill Acquisition Via Demonstration Policy Learning [J].
Liu, Dong ;
Lu, Binpeng ;
Cong, Ming ;
Yu, Honghua ;
Zou, Qiang ;
Du, Yu .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (03) :1054-1065
[32]   Watch and Act: Learning Robotic Manipulation From Visual Demonstration [J].
Yang, Shuo ;
Zhang, Wei ;
Song, Ran ;
Cheng, Jiyu ;
Wang, Hesheng ;
Li, Yibin .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07) :4404-4416
[33]   A Motion Generation Strategy of Robotic Rat Using Imitation Learning for Behavioral Interaction [J].
Xie, Hongzhao ;
Jia, Guanglu ;
Al-Khulaqui, Mohamed ;
Gao, Zihang ;
Guo, Xiaowen ;
Fukuda, Toshio ;
Shi, Qing .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) :7351-7358
[34]   What Matters in Language Conditioned Robotic Imitation Learning Over Unstructured Data [J].
Mees, Oier ;
Hermann, Lukas ;
Burgard, Wolfram .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :11205-11212
[35]   Imitation Learning from a Single Demonstration Leveraging Vector Quantization for Robotic Harvesting [J].
Porichis, Antonios ;
Inglezou, Myrto ;
Kegkeroglou, Nikolaos ;
Mohan, Vishwanathan ;
Chatzakos, Panagiotis .
ROBOTICS, 2024, 13 (07)
[36]   The Art of Imitation: Learning Long-Horizon Manipulation Tasks From Few Demonstrations [J].
von Hartz, Jan Ole ;
Welschehold, Tim ;
Valada, Abhinav ;
Boedecker, Joschka .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12) :11369-11376
[37]   PRIME: Scaffolding Manipulation Tasks With Behavior Primitives for Data-Efficient Imitation Learning [J].
Gao, Tian ;
Nasiriany, Soroush ;
Liu, Huihan ;
Yang, Quantao ;
Zhu, Yuke .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (10) :8322-8329
[38]   A Novel Obstacle Traversal Method for Multiple Robotic Fish Based on Cross-Modal Variational Autoencoders and Imitation Learning [J].
Wang, Ruilong ;
Wang, Ming ;
Zhao, Qianchuan ;
Gong, Yanling ;
Zuo, Lingchen ;
Zheng, Xuehan ;
Gao, He .
BIOMIMETICS, 2024, 9 (04)
[39]   Correct Me If I am Wrong: Interactive Learning for Robotic Manipulation [J].
Chisari, Eugenio ;
Welschehold, Tim ;
Boedecker, Joschka ;
Burgard, Wolfram ;
Valada, Abhinav .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) :3695-3702
[40]   An Adaptive Imitation Learning Framework for Robotic Complex Contact-Rich Insertion Tasks [J].
Wang, Yan ;
Beltran-Hernandez, Cristian C. ;
Wan, Weiwei ;
Harada, Kensuke .
FRONTIERS IN ROBOTICS AND AI, 2022, 8