Model-based Imitation Learning by Probabilistic Trajectory Matching

被引：0

作者：

Englert, Peter ^{[1
]}

Paraschos, Alexandros ^{[1
]}

Peters, Jan ^{[1
]}

Deisenroth, Marc Peter ^{[1
]}

机构：

[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2013年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

One of the most elegant ways of teaching new skills to robots is to provide demonstrations of a task and let the robot imitate this behavior. Such imitation learning is a non-trivial task: Different anatomies of robot and teacher, and reduced robustness towards changes in the control task are two major difficulties in imitation learning. We present an imitation-learning approach to efficiently learn a task from expert demonstrations. Instead of finding policies indirectly, either via state-action mappings (behavioral cloning), or cost function learning (inverse reinforcement learning), our goal is to find policies directly such that predicted trajectories match observed ones. To achieve this aim, we model the trajectory of the teacher and the predicted robot trajectory by means of probability distributions. We match these distributions by minimizing their Kullback-Leibler divergence. In this paper, we propose to learn probabilistic forward models to compute a probability distribution over trajectories. We compare our approach to model-based reinforcement learning methods with hand-crafted cost functions. Finally, we evaluate our method with experiments on a real compliant robot.

引用

页码：1922 / 1927

页数：6

共 50 条

[1] Probabilistic model-based imitation learning
Englert, Peter
Paraschos, Alexandros
Deisenroth, Marc Peter
Peters, Jan
ADAPTIVE BEHAVIOR, 2013, 21 (05) : 388 - 403
[2] A Probabilistic Framework for Model-Based Imitation Learning
Shon, Aaron P.
Grimes, David B.
Baker, Chris L.
Rao, Rajesh P. N.
PROCEEDINGS OF THE TWENTY-SIXTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 2004, : 1237 - 1242
[3] GeoGail: A Model-Based Imitation Learning Framework for Human Trajectory Synthesizing
Wu, Yuchen
Wang, Huandong
Gao, Changzheng
Jin, Depeng
Li, Yong
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (01)
[4] Model-Based Imitation Learning for Urban Driving
Hu, Anthony
Corrado, Gianluca
Griffiths, Nicolas
Murez, Zak
Gurau, Corina
Yeo, Hudson
Kendall, Alex
Cipolla, Roberto
Shotton, Jamie
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[5] Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid
Veith, Eric Msp
Logemann, Torben
Berezin, Aleksandr
Wellssow, Arlena
Balduin, Stephan
2024 12TH WORKSHOP ON MODELING AND SIMULATION OF CYBER-PHYSICAL ENERGY SYSTEMS, MSCPES, 2024,
[6] Practical Probabilistic Model-Based Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling
Huang, Wenjun
Cui, Yunduan
Li, Huiyun
Wu, Xinyu
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[7] Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
Uchibe, Eiji
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10922 - 10929
[8] MobILE: Model-Based Imitation Learning From Observation Alone
Kidambi, Rahul
Chang, Jonathan D.
Sun, Wen
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[9] Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Bronstein, Eli
Palatucci, Mark
Notz, Dominik
White, Brandyn
Kuefler, Alex
Lu, Yiren
Paul, Supratik
Nikdel, Payam
Mougin, Paul
Chen, Hongge
Fu, Justin
Abrams, Austin
Shah, Punit
Racah, Evan
Frenkel, Benjamin
Whiteson, Shimon
Anguelov, Dragomir
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8652 - 8659
[10] Discriminator-Guided Model-Based Offline Imitation Learning
Zhang, Wenjia
Xu, Haoran
Niu, Haoyi
Cheng, Peng
Li, Ming
Zhang, Heming
Zhou, Guyue
Zhan, Xianyuan
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1266 - 1276

← 1 2 3 4 5 →