Adversarial Imitation Learning with Trajectorial Augmentation and Correction

被引:6
作者
Antotsiou, Dafni [1 ]
Ciliberto, Carlo [1 ]
Kim, Tae-Kyun [1 ]
机构
[1] Imperial Coll London, EEE Dept, London, England
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年
关键词
D O I
10.1109/ICRA48506.2021.9561915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be easily applied to control tasks due to the sequential nature of the problem. In this work, we introduce a novel augmentation method which preserves the success of the augmented trajectories. To achieve this, we introduce a semi-supervised correction network that aims to correct distorted expert actions. To adequately test the abilities of the correction network, we develop an adversarial data augmented imitation architecture to train an imitation agent using synthetic experts. Additionally, we introduce a metric to measure diversity in trajectory datasets. Experiments show that our data augmentation strategy can improve accuracy and convergence time of adversarial imitation while preserving the diversity between the generated and real trajectories.
引用
收藏
页码:4724 / 4730
页数:7
相关论文
共 46 条
[1]  
[Anonymous], 2017, CVPR
[2]  
[Anonymous], 2018, INT C LEARN REPR, DOI DOI 10.1002/PROT.25414
[3]  
[Anonymous], 1988, Advances in neural information processing systems
[4]  
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.696
[5]  
Antotsiou D., 2018, ECCV
[6]  
Bojarski Mariusz, 2016, arXiv
[7]  
Bowles C., 2018, CORR
[8]  
Brill M., 2019, DMKD
[9]  
Brockman Greg, 2016, arXiv
[10]  
Buhet T., 2019, ICCV