Waypoint-Based Imitation Learning for Robotic Manipulation

被引:0
作者
Shi, Lucy Xiaoyang [1 ]
Sharma, Archit [1 ]
Zhao, Tony Z. [1 ]
Finn, Chelsea [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
来源
CONFERENCE ON ROBOT LEARNING, VOL 229 | 2023年 / 229卷
关键词
imitation learning; waypoints; long-horizon;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While imitation learning methods have seen a resurgent interest for robotic manipulation, the well-known problem of compounding errors continues to afflict behavioral cloning (BC). Waypoints can help address this problem by reducing the horizon of the learning problem for BC, and thus, the errors compounded over time. However, waypoint labeling is underspecified, and requires additional human supervision. Can we generate waypoints automatically without any additional human supervision? Our key insight is that if a trajectory segment can be approximated by linear motion, the endpoints can be used as waypoints. We propose Automatic Waypoint Extraction (AWE) for imitation learning, a preprocessing module to decompose a demonstration into a minimal set of waypoints which when interpolated linearly can approximate the trajectory up to a specified error threshold. AWE can be combined with any BC algorithm, and we find that AWE can increase the success rate of state-of-the-art algorithms by up to 25% in simulation and by 4-28% on real-world bimanual manipulation tasks, reducing the decision making horizon by up to a factor of 10. Videos and code are available at https://lucys0.github.io/awe/.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Imitation Learning based Soft Robotic Grasping Control without Precise Estimation of Target Posture
    Cortes, David Santiago Diaz
    Hwang, Geonwoo
    Kyung, Ki-Uk
    2021 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFT ROBOTICS (ROBOSOFT), 2021, : 149 - 154
  • [22] Autonomous Teleoperated Robotic Arm Based on Imitation Learning Using Instance Segmentation and Haptics Information
    Imai, Kota
    Takahashi, Yasutake
    Tsuichihara, Satoki
    Haruna, Masaki
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2025, 29 (01) : 79 - 94
  • [23] IMITATION LEARNING OF DUAL-ARM MANIPULATION TASKS IN HUMANOID ROBOTS
    Asfour, T.
    Azad, P.
    Gyarfas, F.
    Dillmann, R.
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2008, 5 (02) : 183 - 202
  • [24] FMB: A functional manipulation benchmark for generalizable robotic learning
    Luo, Jianlan
    Xu, Charles
    Liu, Fangchen
    Tan, Liam
    Lin, Zipeng
    Wu, Jeffrey
    Abbeel, Pieter
    Levine, Sergey
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2025, 44 (04) : 592 - 606
  • [25] A Computational Framework for Integrating Robotic Exploration and Human Demonstration in Imitation Learning
    Tan, Huan
    Kawamura, Kazuhiko
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2501 - 2506
  • [26] Imitation Learning for Nonprehensile Manipulation Through Self-Supervised Learning Considering Motion Speed
    Saigusa, Yuki
    Sakaino, Sho
    Tsuji, Toshiaki
    IEEE ACCESS, 2022, 10 : 68291 - 68306
  • [27] Gaze-Based Dual Resolution Deep Imitation Learning for High-Precision Dexterous Robot Manipulation
    Kim, Heecheol
    Ohmura, Yoshiyuki
    Kuniyoshi, Yasuo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 1630 - 1637
  • [28] Imitation Learning System Design with Small Training Data for Flexible Tool Manipulation
    Sasatake, Harumo
    Tasaki, Ryosuke
    Yamashita, Takahito
    Uchiyama, Naoki
    INTERNATIONAL JOURNAL OF AUTOMATION TECHNOLOGY, 2021, 15 (05) : 669 - 677
  • [29] Bayesian Disturbance Injection: Robust imitation learning of flexible policies for robot manipulation
    Oh, Hanbit
    Sasaki, Hikaru
    Michael, Brendan
    Matsubara, Takamitsu
    NEURAL NETWORKS, 2023, 158 : 42 - 58
  • [30] Using human gaze in few-shot imitation learning for robot manipulation
    Hamano, Shogo
    Kim, Heecheol
    Ohmura, Yoshiyuki
    Kuniyoshi, Yasuo
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8622 - 8629