Adversarial Imitation Learning with Trajectorial Augmentation and Correction

被引:6
|
作者
Antotsiou, Dafni [1 ]
Ciliberto, Carlo [1 ]
Kim, Tae-Kyun [1 ]
机构
[1] Imperial Coll London, EEE Dept, London, England
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年
关键词
D O I
10.1109/ICRA48506.2021.9561915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be easily applied to control tasks due to the sequential nature of the problem. In this work, we introduce a novel augmentation method which preserves the success of the augmented trajectories. To achieve this, we introduce a semi-supervised correction network that aims to correct distorted expert actions. To adequately test the abilities of the correction network, we develop an adversarial data augmented imitation architecture to train an imitation agent using synthetic experts. Additionally, we introduce a metric to measure diversity in trajectory datasets. Experiments show that our data augmentation strategy can improve accuracy and convergence time of adversarial imitation while preserving the diversity between the generated and real trajectories.
引用
收藏
页码:4724 / 4730
页数:7
相关论文
共 50 条
  • [31] TextGAIL: Generative Adversarial Imitation Learning for Text Generation
    Wu, Qingyang
    Li, Lei
    Yu, Zhou
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14067 - 14075
  • [32] Provably Efficient Adversarial Imitation Learning with Unknown Transitions
    Xu, Tian
    Li, Ziniu
    Yu, Yang
    Luo, Zhi-Quan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2367 - 2378
  • [33] Adversarial Option-Aware Hierarchical Imitation Learning
    Jing, Mingxuan
    Huang, Wenbing
    Sunk, Fuchun
    Ma, Xiaojian
    Kong, Tao
    Gan, Chuang
    Li, Lei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [34] Complexity of bird song caused by adversarial imitation learning
    Seiya Yamazaki
    Hiroyuki Iizuka
    Masahito Yamamoto
    Artificial Life and Robotics, 2020, 25 : 124 - 132
  • [35] Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes
    Wang, Lu
    Yu, Wenchao
    Cheng, Wei
    Min, Martin Renqiang
    Zong, Bo
    He, Xiaofeng
    Zha, Hongyuan
    Wang, Wei
    Chen, Haifeng
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1785 - 1795
  • [36] Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning
    Fernando, Tharindu
    Denman, Simon
    Sridharan, Sridha
    Fookes, Clinton
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 113 - 121
  • [37] Imitation Learning based on Data Augmentation for Robotic Reaching
    Hoshino, Satoshi
    Hisada, Tomoki
    Oikawa, Ryota
    2021 60TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2021, : 417 - 424
  • [38] Addressing Delays in Reinforcement Learning via Delayed Adversarial Imitation Learning
    Xie, Minzhi
    Xia, Bo
    Yu, Yalou
    Wang, Xueqian
    Chang, Yongzhe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 271 - 282
  • [39] Adversarial Imitation Learning from State-only Demonstrations
    Torabi, Faraz
    Warnell, Garrett
    Stone, Peter
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2229 - 2231
  • [40] GAILPG: Multiagent Policy Gradient With Generative Adversarial Imitation Learning
    Li, Wei
    Huang, Shiyi
    Qiu, Ziming
    Song, Aiguo
    IEEE TRANSACTIONS ON GAMES, 2025, 17 (01) : 62 - 75