Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation

被引:0
|
作者
Alvarez, Nahum [1 ]
Noda, Itsuki [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
来源
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2 | 2019年
关键词
Inverse Reinforcement Learning; Behavioral Agents; Pedestrian Simulation;
D O I
10.5220/0007684908870894
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning is a discipline with many simulator-driven applications oriented to learn behavior. However, behavior simulation it comes with a number of associated difficulties, like the lack of a clear reward function, actions that depend of the state of the actor and the alternation of different policies. We present a method for behavior learning called Contextual Action Multiple Policy Inverse Reinforcement Learning (CAMP-IRL) that tackles those factors. Our method allows to extract multiple reward functions and generates different behavior profiles from them. We applied our method to a large scale crowd simulator using intelligent agents to imitate pedestrian behavior, making the virtual pedestrians able to switch between behaviors depending of the goal they have and navigating efficiently across unknown environments.
引用
收藏
页码:887 / 894
页数:8
相关论文
共 50 条
  • [21] Inverse Contextual Bandits: Learning How Behavior Evolves over Time
    Huyuk, Alihan
    Jarrett, Daniel
    van der Schaar, Mihaela
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [22] Predicting driving behavior using inverse reinforcement learning with multiple reward functions towards environmental diversity
    Shimosaka, Masamichi
    Nishi, Kentaro
    Sato, Junichi
    Kataoka, Hirokatsu
    2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 567 - 572
  • [23] Robust Bayesian Inverse Reinforcement Learning with Sparse Behavior Noise
    Zheng, Jiangchuan
    Liu, Siyuan
    Ni, Lionel M.
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2198 - 2205
  • [24] Modeling Driver Behavior using Adversarial Inverse Reinforcement Learning
    Sackmann, Moritz
    Bey, Henrik
    Hofmann, Ulrich
    Thielecke, Joern
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1683 - 1690
  • [25] Reconfigurable Embedded Devices Using Reinforcement Learning to Develop Action Policies
    Burger, Alwyn
    Schiele, Gregor
    King, David W.
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2021, 15 (04)
  • [26] Reinforcement learning in discrete action space applied to inverse defect design
    Loeffler, Troy D.
    Banik, Suvo
    Patra, Tarak K.
    Sternberg, Michael
    Sankaranarayanan, Subramanian K. R. S.
    JOURNAL OF PHYSICS COMMUNICATIONS, 2021, 5 (03):
  • [27] Objective-aware Traffic Simulation via Inverse Reinforcement Learning
    Zheng, Guanjie
    Liu, Hanyang
    Xu, Kai
    Li, Zhenhui
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3771 - 3777
  • [28] Learning Curriculum Policies for Reinforcement Learning
    Narvekar, Sanmit
    Stone, Peter
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 25 - 33
  • [29] Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
    Efroni, Yonathan
    Dalal, Gal
    Scherrer, Bruno
    Mannor, Shie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [30] Federated Reinforcement Learning for Training Control Policies on Multiple IoT Devices
    Lim, Hyun-Kyo
    Kim, Ju-Bong
    Heo, Joo-Seong
    Han, Youn-Hee
    SENSORS, 2020, 20 (05)