Model-based Imitation Learning by Probabilistic Trajectory Matching

被引:0
|
作者
Englert, Peter [1 ]
Paraschos, Alexandros [1 ]
Peters, Jan [1 ]
Deisenroth, Marc Peter [1 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2013年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the most elegant ways of teaching new skills to robots is to provide demonstrations of a task and let the robot imitate this behavior. Such imitation learning is a non-trivial task: Different anatomies of robot and teacher, and reduced robustness towards changes in the control task are two major difficulties in imitation learning. We present an imitation-learning approach to efficiently learn a task from expert demonstrations. Instead of finding policies indirectly, either via state-action mappings (behavioral cloning), or cost function learning (inverse reinforcement learning), our goal is to find policies directly such that predicted trajectories match observed ones. To achieve this aim, we model the trajectory of the teacher and the predicted robot trajectory by means of probability distributions. We match these distributions by minimizing their Kullback-Leibler divergence. In this paper, we propose to learn probabilistic forward models to compute a probability distribution over trajectories. We compare our approach to model-based reinforcement learning methods with hand-crafted cost functions. Finally, we evaluate our method with experiments on a real compliant robot.
引用
收藏
页码:1922 / 1927
页数:6
相关论文
共 50 条
  • [1] Probabilistic model-based imitation learning
    Englert, Peter
    Paraschos, Alexandros
    Deisenroth, Marc Peter
    Peters, Jan
    ADAPTIVE BEHAVIOR, 2013, 21 (05) : 388 - 403
  • [2] A Probabilistic Framework for Model-Based Imitation Learning
    Shon, Aaron P.
    Grimes, David B.
    Baker, Chris L.
    Rao, Rajesh P. N.
    PROCEEDINGS OF THE TWENTY-SIXTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 2004, : 1237 - 1242
  • [3] GeoGail: A Model-Based Imitation Learning Framework for Human Trajectory Synthesizing
    Wu, Yuchen
    Wang, Huandong
    Gao, Changzheng
    Jin, Depeng
    Li, Yong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (01)
  • [4] Model-Based Imitation Learning for Urban Driving
    Hu, Anthony
    Corrado, Gianluca
    Griffiths, Nicolas
    Murez, Zak
    Gurau, Corina
    Yeo, Hudson
    Kendall, Alex
    Cipolla, Roberto
    Shotton, Jamie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid
    Veith, Eric Msp
    Logemann, Torben
    Berezin, Aleksandr
    Wellssow, Arlena
    Balduin, Stephan
    2024 12TH WORKSHOP ON MODELING AND SIMULATION OF CYBER-PHYSICAL ENERGY SYSTEMS, MSCPES, 2024,
  • [6] Practical Probabilistic Model-Based Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling
    Huang, Wenjun
    Cui, Yunduan
    Li, Huiyun
    Wu, Xinyu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [7] Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
    Uchibe, Eiji
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10922 - 10929
  • [8] MobILE: Model-Based Imitation Learning From Observation Alone
    Kidambi, Rahul
    Chang, Jonathan D.
    Sun, Wen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
    Bronstein, Eli
    Palatucci, Mark
    Notz, Dominik
    White, Brandyn
    Kuefler, Alex
    Lu, Yiren
    Paul, Supratik
    Nikdel, Payam
    Mougin, Paul
    Chen, Hongge
    Fu, Justin
    Abrams, Austin
    Shah, Punit
    Racah, Evan
    Frenkel, Benjamin
    Whiteson, Shimon
    Anguelov, Dragomir
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8652 - 8659
  • [10] Discriminator-Guided Model-Based Offline Imitation Learning
    Zhang, Wenjia
    Xu, Haoran
    Niu, Haoyi
    Cheng, Peng
    Li, Ming
    Zhang, Heming
    Zhou, Guyue
    Zhan, Xianyuan
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1266 - 1276