Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

被引：0

作者：

Li, Chenhao ^{[1
,2
]}

Vlastelica, Marin ^{[1
]}

Blaes, Sebastian ^{[1
]}

Frey, Jonas ^{[1
,2
]}

Grimminger, Felix ^{[1
]}

Martius, Georg ^{[1
]}

机构：

[1] Max Planck Inst Intelligent Syst, Stuttgart, Germany

[2] Swiss Fed Inst Technol, Robot Syst Lab, Zurich, Switzerland

来源：

CONFERENCE ON ROBOT LEARNING, VOL 205 | 2022年 / 205卷

关键词：

Adversarial; Imitation Learning; Legged Robots; LOCOMOTION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method for inferring reward functions from partial and potentially physically incompatible demonstrations for successful skill acquirement where reference or expert demonstrations are not easily accessible. Moreover, we show that by using a Wasserstein GAN formulation and transitions from demonstrations with rough and partial information as input, we are able to extract policies that are robust and capable of imitating demonstrated behaviors. Finally, the obtained skills such as a backflip are tested on an agile quadruped robot called Solo 8 and present faithful replication of hand-held human demonstrations.

引用

页码：342 / 352

页数：11

共 50 条

[21] Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes
Wang, Lu
Yu, Wenchao
Cheng, Wei
Min, Martin Renqiang
Zong, Bo
He, Xiaofeng
Zha, Hongyuan
Wang, Wei
Chen, Haifeng
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1785 - 1795
[22] Interactive imitation learning of object movement skills
Manuel Mühlig
Michael Gienger
Jochen J. Steil
Autonomous Robots, 2012, 32 : 97 - 114
[23] BAGAIL: Multi-modal imitation learning from imbalanced demonstrations
Gu, Sijia
Zhu, Fei
NEURAL NETWORKS, 2024, 174
[24] Interactive imitation learning of object movement skills
Muehlig, Manuel
Gienger, Michael
Steil, Jochen J.
AUTONOMOUS ROBOTS, 2012, 32 (02) : 97 - 114
[25] Generative Adversarial Network for Imitation Learning from Single Demonstration
Tho Nguyen Duc
Chanh Minh Tran
Phan Xuan Tan
Kamioka, Eiji
BAGHDAD SCIENCE JOURNAL, 2021, 18 (04) : 1350 - 1355
[26] Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning
Xu, Mai
Yang, Li
Tao, Xiaoming
Duan, Yiping
Wang, Zulin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2087 - 2102
[27] Imitation Learning for Playing Shogi Based on Generative Adversarial Networks
Wan, Shanchuan
Kaneko, Tomoyuki
2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 92 - 95
[28] Improve generated adversarial imitation learning with reward variance regularization
Zhang, Yi-Feng
Luo, Fan-Ming
Yu, Yang
MACHINE LEARNING, 2022, 111 (03) : 977 - 995
[29] Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving
Lee, Gunmin
Oh, Wooseok
Oh, Jeongwoo
Shin, Seungyoun
Kim, Dohyeong
Jeong, Jaeyeon
Choi, Sungjoon
Oh, Songhwai
2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 20 - 25
[30] Domain Adaptation for Imitation Learning Using Generative Adversarial Network
Duc, Tho Nguyen
Tran, Chanh Minh
Tan, Phan Xuan
Kamioka, Eiji
SENSORS, 2021, 21 (14)

← 1 2 3 4 5 →