Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

被引：0

作者：

Li, Chenhao ^{[1
,2
]}

Vlastelica, Marin ^{[1
]}

Blaes, Sebastian ^{[1
]}

Frey, Jonas ^{[1
,2
]}

Grimminger, Felix ^{[1
]}

Martius, Georg ^{[1
]}

机构：

[1] Max Planck Inst Intelligent Syst, Stuttgart, Germany

[2] Swiss Fed Inst Technol, Robot Syst Lab, Zurich, Switzerland

来源：

CONFERENCE ON ROBOT LEARNING, VOL 205 | 2022年 / 205卷

关键词：

Adversarial; Imitation Learning; Legged Robots; LOCOMOTION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method for inferring reward functions from partial and potentially physically incompatible demonstrations for successful skill acquirement where reference or expert demonstrations are not easily accessible. Moreover, we show that by using a Wasserstein GAN formulation and transitions from demonstrations with rough and partial information as input, we are able to extract policies that are robust and capable of imitating demonstrated behaviors. Finally, the obtained skills such as a backflip are tested on an agile quadruped robot called Solo 8 and present faithful replication of hand-held human demonstrations.

引用

页码：342 / 352

页数：11

共 50 条

[1] Adversarial Imitation Learning from State-only Demonstrations
Torabi, Faraz
Warnell, Garrett
Stone, Peter
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2229 - 2231
[2] A Novel Robust Imitation Learning Framework for Complex Skills With Limited Demonstrations
Wang, Weiyong
Zeng, Chao
Zhan, Hong
Yang, Chenguang
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 13
[3] Learning Category-Level Generalizable Object Manipulation Policy Via Generative Adversarial Self-Imitation Learning From Demonstrations
Shen, Hao
Wan, Weikang
Wang, He
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 11166 - 11173
[4] Perception-Aware-Based UAV Trajectory Planner via Generative Adversarial Self-Imitation Learning From Demonstrations
Zhang, Hanxuan
Huo, Ju
Huang, Yulong
Cheng, Jiajun
Li, Xiaofeng
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (03): : 3248 - 3260
[5] Imitation learning for agile autonomous driving
Pan, Yunpeng
Cheng, Ching-An
Saigol, Kamil
Lee, Keuntaek
Yan, Xinyan
Theodorou, Evangelos A.
Boots, Byron
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (2-3): : 286 - 302
[6] Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrations
Brown, Daniel S.
Goo, Wonjoon
Niekum, Scott
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[7] Deterministic generative adversarial imitation learning
Zuo, Guoyu
Chen, Kexin
Lu, Jiahao
Huang, Xiangsheng
NEUROCOMPUTING, 2020, 388 : 60 - 69
[8] Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations
Zhao, Tianxiang
Yu, Wenchao
Wang, Suhang
Wang, Lu
Zhang, Xiang
Chen, Yuncong
Liu, Yanchi
Cheng, Wei
Chen, Haifeng
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3513 - 3524
[9] Model predictive optimization for imitation learning from demonstrations
Hu, Yingbai
Cui, Mingyang
Duan, Jianghua
Liu, Wenjun
Huang, Dianye
Knoll, Alois
Chen, Guang
ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 163
[10] Learning from Suboptimal Demonstration via Trajectory-Ranked Adversarial Imitation
Chen, Luyao
Xie, Shaorong
Pang, Tao
Yu, Hang
Luo, Xiangfeng
Zhang, Zhenyu
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 486 - 493

← 1 2 3 4 5 →