Adversarial Imitation Learning with Trajectorial Augmentation and Correction

被引：6

作者：

Antotsiou, Dafni ^{[1
]}

Ciliberto, Carlo ^{[1
]}

Kim, Tae-Kyun ^{[1
]}

机构：

[1] Imperial Coll London, EEE Dept, London, England

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

关键词：

D O I：

10.1109/ICRA48506.2021.9561915

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be easily applied to control tasks due to the sequential nature of the problem. In this work, we introduce a novel augmentation method which preserves the success of the augmented trajectories. To achieve this, we introduce a semi-supervised correction network that aims to correct distorted expert actions. To adequately test the abilities of the correction network, we develop an adversarial data augmented imitation architecture to train an imitation agent using synthetic experts. Additionally, we introduce a metric to measure diversity in trajectory datasets. Experiments show that our data augmentation strategy can improve accuracy and convergence time of adversarial imitation while preserving the diversity between the generated and real trajectories.

引用

页码：4724 / 4730

页数：7

共 50 条

[31] TextGAIL: Generative Adversarial Imitation Learning for Text Generation
Wu, Qingyang
Li, Lei
Yu, Zhou
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14067 - 14075
[32] Provably Efficient Adversarial Imitation Learning with Unknown Transitions
Xu, Tian
Li, Ziniu
Yu, Yang
Luo, Zhi-Quan
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2367 - 2378
[33] Adversarial Option-Aware Hierarchical Imitation Learning
Jing, Mingxuan
Huang, Wenbing
Sunk, Fuchun
Ma, Xiaojian
Kong, Tao
Gan, Chuang
Li, Lei
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[34] Complexity of bird song caused by adversarial imitation learning
Seiya Yamazaki
Hiroyuki Iizuka
Masahito Yamamoto
Artificial Life and Robotics, 2020, 25 : 124 - 132
[35] Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes
Wang, Lu
Yu, Wenchao
Cheng, Wei
Min, Martin Renqiang
Zong, Bo
He, Xiaofeng
Zha, Hongyuan
Wang, Wei
Chen, Haifeng
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1785 - 1795
[36] Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning
Fernando, Tharindu
Denman, Simon
Sridharan, Sridha
Fookes, Clinton
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 113 - 121
[37] Imitation Learning based on Data Augmentation for Robotic Reaching
Hoshino, Satoshi
Hisada, Tomoki
Oikawa, Ryota
2021 60TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2021, : 417 - 424
[38] Addressing Delays in Reinforcement Learning via Delayed Adversarial Imitation Learning
Xie, Minzhi
Xia, Bo
Yu, Yalou
Wang, Xueqian
Chang, Yongzhe
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 271 - 282
[39] Adversarial Imitation Learning from State-only Demonstrations
Torabi, Faraz
Warnell, Garrett
Stone, Peter
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2229 - 2231
[40] GAILPG: Multiagent Policy Gradient With Generative Adversarial Imitation Learning
Li, Wei
Huang, Shiyi
Qiu, Ziming
Song, Aiguo
IEEE TRANSACTIONS ON GAMES, 2025, 17 (01) : 62 - 75

← 1 2 3 4 5 →