Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

被引:0
|
作者
Li, Chenhao [1 ,2 ]
Vlastelica, Marin [1 ]
Blaes, Sebastian [1 ]
Frey, Jonas [1 ,2 ]
Grimminger, Felix [1 ]
Martius, Georg [1 ]
机构
[1] Max Planck Inst Intelligent Syst, Stuttgart, Germany
[2] Swiss Fed Inst Technol, Robot Syst Lab, Zurich, Switzerland
来源
CONFERENCE ON ROBOT LEARNING, VOL 205 | 2022年 / 205卷
关键词
Adversarial; Imitation Learning; Legged Robots; LOCOMOTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method for inferring reward functions from partial and potentially physically incompatible demonstrations for successful skill acquirement where reference or expert demonstrations are not easily accessible. Moreover, we show that by using a Wasserstein GAN formulation and transitions from demonstrations with rough and partial information as input, we are able to extract policies that are robust and capable of imitating demonstrated behaviors. Finally, the obtained skills such as a backflip are tested on an agile quadruped robot called Solo 8 and present faithful replication of hand-held human demonstrations.
引用
收藏
页码:342 / 352
页数:11
相关论文
共 50 条
  • [21] Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes
    Wang, Lu
    Yu, Wenchao
    Cheng, Wei
    Min, Martin Renqiang
    Zong, Bo
    He, Xiaofeng
    Zha, Hongyuan
    Wang, Wei
    Chen, Haifeng
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1785 - 1795
  • [22] Interactive imitation learning of object movement skills
    Manuel Mühlig
    Michael Gienger
    Jochen J. Steil
    Autonomous Robots, 2012, 32 : 97 - 114
  • [23] BAGAIL: Multi-modal imitation learning from imbalanced demonstrations
    Gu, Sijia
    Zhu, Fei
    NEURAL NETWORKS, 2024, 174
  • [24] Interactive imitation learning of object movement skills
    Muehlig, Manuel
    Gienger, Michael
    Steil, Jochen J.
    AUTONOMOUS ROBOTS, 2012, 32 (02) : 97 - 114
  • [25] Generative Adversarial Network for Imitation Learning from Single Demonstration
    Tho Nguyen Duc
    Chanh Minh Tran
    Phan Xuan Tan
    Kamioka, Eiji
    BAGHDAD SCIENCE JOURNAL, 2021, 18 (04) : 1350 - 1355
  • [26] Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning
    Xu, Mai
    Yang, Li
    Tao, Xiaoming
    Duan, Yiping
    Wang, Zulin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2087 - 2102
  • [27] Imitation Learning for Playing Shogi Based on Generative Adversarial Networks
    Wan, Shanchuan
    Kaneko, Tomoyuki
    2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 92 - 95
  • [28] Improve generated adversarial imitation learning with reward variance regularization
    Zhang, Yi-Feng
    Luo, Fan-Ming
    Yu, Yang
    MACHINE LEARNING, 2022, 111 (03) : 977 - 995
  • [29] Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving
    Lee, Gunmin
    Oh, Wooseok
    Oh, Jeongwoo
    Shin, Seungyoun
    Kim, Dohyeong
    Jeong, Jaeyeon
    Choi, Sungjoon
    Oh, Songhwai
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 20 - 25
  • [30] Domain Adaptation for Imitation Learning Using Generative Adversarial Network
    Duc, Tho Nguyen
    Tran, Chanh Minh
    Tan, Phan Xuan
    Kamioka, Eiji
    SENSORS, 2021, 21 (14)