Transfer in Inverse Reinforcement Learning for Multiple Strategies

被引:0
|
作者
Tanwani, Ajay Kumar [1 ]
Billard, Aude [1 ]
机构
[1] Ecole Polytech Fed Lausanne, LASA, CH-1015 Lausanne, Switzerland
来源
2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2013年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of incrementally learning different strategies of performing a complex sequential task from multiple demonstrations of an expert or a set of experts. While the task is the same, each expert differs in his/her way of performing it. We assume that this variety across experts' demonstration is due to the fact that each expert/strategy is driven by a different reward function, where reward function is expressed as a linear combination of a set of known features. Consequently, we can learn all the expert strategies by forming a convex set of optimal deterministic policies, from which one can match any unseen expert strategy drawn from this set. Instead of learning from scratch every optimal policy in this set, the learner transfers knowledge from the set of learned policies to bootstrap its search for new optimal policy. We demonstrate our approach on a simulated mini-golf task where the 7 degrees of freedom Barrett WAM robot arm learns to sequentially putt on different holes in accordance with the playing strategies of the expert.
引用
收藏
页码:3244 / 3250
页数:7
相关论文
共 50 条
  • [31] Inverse Reinforcement Learning with Gaussian Process
    Qiao, Qifeng
    Beling, Peter A.
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 113 - 118
  • [32] Active Exploration for Inverse Reinforcement Learning
    Lindner, David
    Krause, Andreas
    Ramponi, Giorgia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [33] Recent Advancements in Inverse Reinforcement Learning
    Metelli, Alberto Maria
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22680 - 22680
  • [34] Multiagent Adversarial Inverse Reinforcement Learning
    Wei, Ermo
    Wicke, Drew
    Luke, Sean
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2265 - 2266
  • [35] Preference Elicitation and Inverse Reinforcement Learning
    Rothkopf, Constantin A.
    Dimitrakakis, Christos
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 34 - 48
  • [36] Inverse Reinforcement Learning with Constraint Recovery
    Das, Nirjhar
    Chattopadhyay, Arpan
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 179 - 188
  • [37] Training parsers by inverse reinforcement learning
    Neu, Gergely
    Szepesvari, Csaba
    MACHINE LEARNING, 2009, 77 (2-3) : 303 - 337
  • [38] A survey of inverse reinforcement learning techniques
    Shao Zhifei
    Joo, Er Meng
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2012, 5 (03) : 293 - 311
  • [39] Inverse Reinforcement Learning for Strategy Identification
    Rucker, Mark
    Adams, Stephen
    Hayes, Roy
    Beling, Peter A.
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3067 - 3074
  • [40] Inverse reinforcement learning in contextual MDPs
    Belogolovsky, Stav
    Korsunsky, Philip
    Mannor, Shie
    Tessler, Chen
    Zahavy, Tom
    MACHINE LEARNING, 2021, 110 (09) : 2295 - 2334