Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation

被引：0

作者：

Alvarez, Nahum ^{[1
]}

Noda, Itsuki ^{[1
]}

机构：

[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan

来源：

PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2 | 2019年

关键词：

Inverse Reinforcement Learning; Behavioral Agents; Pedestrian Simulation;

D O I：

10.5220/0007684908870894

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning is a discipline with many simulator-driven applications oriented to learn behavior. However, behavior simulation it comes with a number of associated difficulties, like the lack of a clear reward function, actions that depend of the state of the actor and the alternation of different policies. We present a method for behavior learning called Contextual Action Multiple Policy Inverse Reinforcement Learning (CAMP-IRL) that tackles those factors. Our method allows to extract multiple reward functions and generates different behavior profiles from them. We applied our method to a large scale crowd simulator using intelligent agents to imitate pedestrian behavior, making the virtual pedestrians able to switch between behaviors depending of the goal they have and navigating efficiently across unknown environments.

引用

页码：887 / 894

页数：8

共 50 条

[21] Inverse Contextual Bandits: Learning How Behavior Evolves over Time
Huyuk, Alihan
Jarrett, Daniel
van der Schaar, Mihaela
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[22] Predicting driving behavior using inverse reinforcement learning with multiple reward functions towards environmental diversity
Shimosaka, Masamichi
Nishi, Kentaro
Sato, Junichi
Kataoka, Hirokatsu
2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 567 - 572
[23] Robust Bayesian Inverse Reinforcement Learning with Sparse Behavior Noise
Zheng, Jiangchuan
Liu, Siyuan
Ni, Lionel M.
PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2198 - 2205
[24] Modeling Driver Behavior using Adversarial Inverse Reinforcement Learning
Sackmann, Moritz
Bey, Henrik
Hofmann, Ulrich
Thielecke, Joern
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1683 - 1690
[25] Reconfigurable Embedded Devices Using Reinforcement Learning to Develop Action Policies
Burger, Alwyn
Schiele, Gregor
King, David W.
ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2021, 15 (04)
[26] Reinforcement learning in discrete action space applied to inverse defect design
Loeffler, Troy D.
Banik, Suvo
Patra, Tarak K.
Sternberg, Michael
Sankaranarayanan, Subramanian K. R. S.
JOURNAL OF PHYSICS COMMUNICATIONS, 2021, 5 (03):
[27] Objective-aware Traffic Simulation via Inverse Reinforcement Learning
Zheng, Guanjie
Liu, Hanyang
Xu, Kai
Li, Zhenhui
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3771 - 3777
[28] Learning Curriculum Policies for Reinforcement Learning
Narvekar, Sanmit
Stone, Peter
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 25 - 33
[29] Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
Efroni, Yonathan
Dalal, Gal
Scherrer, Bruno
Mannor, Shie
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[30] Federated Reinforcement Learning for Training Control Policies on Multiple IoT Devices
Lim, Hyun-Kyo
Kim, Ju-Bong
Heo, Joo-Seong
Han, Youn-Hee
SENSORS, 2020, 20 (05)

← 1 2 3 4 5 →