Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

被引:0
|
作者
Li, Chenhao [1 ,2 ]
Vlastelica, Marin [1 ]
Blaes, Sebastian [1 ]
Frey, Jonas [1 ,2 ]
Grimminger, Felix [1 ]
Martius, Georg [1 ]
机构
[1] Max Planck Inst Intelligent Syst, Stuttgart, Germany
[2] Swiss Fed Inst Technol, Robot Syst Lab, Zurich, Switzerland
来源
CONFERENCE ON ROBOT LEARNING, VOL 205 | 2022年 / 205卷
关键词
Adversarial; Imitation Learning; Legged Robots; LOCOMOTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method for inferring reward functions from partial and potentially physically incompatible demonstrations for successful skill acquirement where reference or expert demonstrations are not easily accessible. Moreover, we show that by using a Wasserstein GAN formulation and transitions from demonstrations with rough and partial information as input, we are able to extract policies that are robust and capable of imitating demonstrated behaviors. Finally, the obtained skills such as a backflip are tested on an agile quadruped robot called Solo 8 and present faithful replication of hand-held human demonstrations.
引用
收藏
页码:342 / 352
页数:11
相关论文
共 50 条
  • [1] Adversarial Imitation Learning from State-only Demonstrations
    Torabi, Faraz
    Warnell, Garrett
    Stone, Peter
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2229 - 2231
  • [2] A Novel Robust Imitation Learning Framework for Complex Skills With Limited Demonstrations
    Wang, Weiyong
    Zeng, Chao
    Zhan, Hong
    Yang, Chenguang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 13
  • [3] Learning Category-Level Generalizable Object Manipulation Policy Via Generative Adversarial Self-Imitation Learning From Demonstrations
    Shen, Hao
    Wan, Weikang
    Wang, He
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 11166 - 11173
  • [4] Perception-Aware-Based UAV Trajectory Planner via Generative Adversarial Self-Imitation Learning From Demonstrations
    Zhang, Hanxuan
    Huo, Ju
    Huang, Yulong
    Cheng, Jiajun
    Li, Xiaofeng
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (03): : 3248 - 3260
  • [5] Imitation learning for agile autonomous driving
    Pan, Yunpeng
    Cheng, Ching-An
    Saigol, Kamil
    Lee, Keuntaek
    Yan, Xinyan
    Theodorou, Evangelos A.
    Boots, Byron
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (2-3): : 286 - 302
  • [6] Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrations
    Brown, Daniel S.
    Goo, Wonjoon
    Niekum, Scott
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [7] Deterministic generative adversarial imitation learning
    Zuo, Guoyu
    Chen, Kexin
    Lu, Jiahao
    Huang, Xiangsheng
    NEUROCOMPUTING, 2020, 388 : 60 - 69
  • [8] Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations
    Zhao, Tianxiang
    Yu, Wenchao
    Wang, Suhang
    Wang, Lu
    Zhang, Xiang
    Chen, Yuncong
    Liu, Yanchi
    Cheng, Wei
    Chen, Haifeng
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3513 - 3524
  • [9] Model predictive optimization for imitation learning from demonstrations
    Hu, Yingbai
    Cui, Mingyang
    Duan, Jianghua
    Liu, Wenjun
    Huang, Dianye
    Knoll, Alois
    Chen, Guang
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 163
  • [10] Learning from Suboptimal Demonstration via Trajectory-Ranked Adversarial Imitation
    Chen, Luyao
    Xie, Shaorong
    Pang, Tao
    Yu, Hang
    Luo, Xiangfeng
    Zhang, Zhenyu
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 486 - 493