Variational Adversarial Kernel Learned Imitation Learning

被引:0
|
作者
Yang, Fan [1 ]
Vereshchaka, Alma [1 ]
Zhou, Yufan [1 ]
Chen, Changyou [1 ]
Dong, Wen [1 ]
机构
[1] SUNY Buffalo, Buffalo, NY 14260 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation learning refers to the problem where an agent learns to perform a task through observing and mimicking expert demonstrations, without knowledge of the cost function. State-of-the-art imitation learning algorithms reduce imitation learning to distribution-matching problems by minimizing some distance measures. However, the distance measure may not always provide informative signals for a policy update. To this end, we propose the variational adversarial kernel learned imitation learning (VAKLIL), which measures the distance using the maximum mean discrepancy with variational kernel learning. Our method optimizes over a large cost-function space and is sample efficient and robust to overfitting. We demonstrate the performance of our algorithm through benchmarking with four state-of-the-art imitation learning algorithms over five high-dimensional control tasks, and a complex transportation control task. Experimental results indicate that our algorithm significantly outperforms related algorithms in all scenarios.
引用
收藏
页码:6599 / 6606
页数:8
相关论文
共 50 条
  • [1] Visual Adversarial Imitation Learning using Variational Models
    Rafailov, Rafael
    Yu, Tianhe
    Rajeswaran, Aravind
    Finn, Chelsea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [2] Distributional generative adversarial imitation learning with reproducing kernel generalization
    Zhou, Yirui
    Lu, Mengxiao
    Liu, Xiaowei
    Che, Zhengping
    Xu, Zhiyuan
    Tang, Jian
    Zhang, Yangchun
    Peng, Yan
    Peng, Yaxin
    NEURAL NETWORKS, 2023, 165 : 43 - 59
  • [3] Generative Adversarial Imitation Learning
    Ho, Jonathan
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [4] What Matters for Adversarial Imitation Learning?
    Orsini, Manu
    Raichuk, Anton
    Hussenot, Leonard
    Vincent, Damien
    Dadashi, Robert
    Girgin, Sertan
    Geist, Matthieu
    Bachem, Olivier
    Pietquin, Olivier
    Andrychowicz, Marcin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [5] Quantum generative adversarial imitation learning
    Xiao, Tailong
    Huang, Jingzheng
    Li, Hongjing
    Fan, Jianping
    Zeng, Guihua
    NEW JOURNAL OF PHYSICS, 2023, 25 (03):
  • [6] DiffAIL: Diffusion Adversarial Imitation Learning
    Wang, Bingzheng
    Wu, Guoqiang
    Pang, Teng
    Zhang, Yan
    Yin, Yilong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15447 - 15455
  • [7] Deterministic generative adversarial imitation learning
    Zuo, Guoyu
    Chen, Kexin
    Lu, Jiahao
    Huang, Xiangsheng
    NEUROCOMPUTING, 2020, 388 : 60 - 69
  • [8] Variational Adversarial Active Learning
    Sinha, Samarth
    Ebrahimi, Sayna
    Darrell, Trevor
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5971 - 5980
  • [9] A Bayesian Approach to Generative Adversarial Imitation Learning
    Jeon, Wonseok
    Seo, Seokin
    Kim, Kee-Eung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Sample-efficient Adversarial Imitation Learning
    Jung, Dahuin
    Lee, Hyungyu
    Yoon, Sungroh
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25