Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation

被引:0
|
作者
Alvarez, Nahum [1 ]
Noda, Itsuki [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
来源
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2 | 2019年
关键词
Inverse Reinforcement Learning; Behavioral Agents; Pedestrian Simulation;
D O I
10.5220/0007684908870894
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning is a discipline with many simulator-driven applications oriented to learn behavior. However, behavior simulation it comes with a number of associated difficulties, like the lack of a clear reward function, actions that depend of the state of the actor and the alternation of different policies. We present a method for behavior learning called Contextual Action Multiple Policy Inverse Reinforcement Learning (CAMP-IRL) that tackles those factors. Our method allows to extract multiple reward functions and generates different behavior profiles from them. We applied our method to a large scale crowd simulator using intelligent agents to imitate pedestrian behavior, making the virtual pedestrians able to switch between behaviors depending of the goal they have and navigating efficiently across unknown environments.
引用
收藏
页码:887 / 894
页数:8
相关论文
共 50 条
  • [1] Inverse reinforcement learning in contextual MDPs
    Belogolovsky, Stav
    Korsunsky, Philip
    Mannor, Shie
    Tessler, Chen
    Zahavy, Tom
    MACHINE LEARNING, 2021, 110 (09) : 2295 - 2334
  • [2] Inverse reinforcement learning in contextual MDPs
    Stav Belogolovsky
    Philip Korsunsky
    Shie Mannor
    Chen Tessler
    Tom Zahavy
    Machine Learning, 2021, 110 : 2295 - 2334
  • [3] Adversarial Inverse Reinforcement Learning to Estimate Policies from Multiple Experts
    Yamashita K.
    Hamagami T.
    Yamashita, Kodai, 2021, Institute of Electrical Engineers of Japan (141) : 1405 - 1410
  • [4] Learning Behavior Styles with Inverse Reinforcement Learning
    Lee, Seong Jae
    popovic, Zoran
    ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
  • [5] Transfer in Inverse Reinforcement Learning for Multiple Strategies
    Tanwani, Ajay Kumar
    Billard, Aude
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 3244 - 3250
  • [6] Modular inverse reinforcement learning for visuomotor behavior
    Rothkopf, Constantin A.
    Ballard, Dana H.
    BIOLOGICAL CYBERNETICS, 2013, 107 (04) : 477 - 490
  • [7] Modular inverse reinforcement learning for visuomotor behavior
    Constantin A. Rothkopf
    Dana H. Ballard
    Biological Cybernetics, 2013, 107 : 477 - 490
  • [8] Regularizing Action Policies for Smooth Control with Reinforcement Learning
    Mysore, Siddharth
    Mabsout, Bassel
    Mancuso, Renato
    Saenko, Kate
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1810 - 1816
  • [9] Deep Reinforcement Learning Based Mobility Load Balancing Under Multiple Behavior Policies
    Xu, Yue
    Xu, Wenjun
    Wang, Zhi
    Lin, Jiaru
    Cui, Shuguang
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [10] Contextual Reinforcement Learning
    Langford, John
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3 - 3