Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation

被引：0

作者：

Alvarez, Nahum ^{[1
]}

Noda, Itsuki ^{[1
]}

机构：

[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan

来源：

PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2 | 2019年

关键词：

Inverse Reinforcement Learning; Behavioral Agents; Pedestrian Simulation;

D O I：

10.5220/0007684908870894

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning is a discipline with many simulator-driven applications oriented to learn behavior. However, behavior simulation it comes with a number of associated difficulties, like the lack of a clear reward function, actions that depend of the state of the actor and the alternation of different policies. We present a method for behavior learning called Contextual Action Multiple Policy Inverse Reinforcement Learning (CAMP-IRL) that tackles those factors. Our method allows to extract multiple reward functions and generates different behavior profiles from them. We applied our method to a large scale crowd simulator using intelligent agents to imitate pedestrian behavior, making the virtual pedestrians able to switch between behaviors depending of the goal they have and navigating efficiently across unknown environments.

引用

页码：887 / 894

页数：8

共 50 条

[1] Inverse reinforcement learning in contextual MDPs
Belogolovsky, Stav
Korsunsky, Philip
Mannor, Shie
Tessler, Chen
Zahavy, Tom
MACHINE LEARNING, 2021, 110 (09) : 2295 - 2334
[2] Inverse reinforcement learning in contextual MDPs
Stav Belogolovsky
Philip Korsunsky
Shie Mannor
Chen Tessler
Tom Zahavy
Machine Learning, 2021, 110 : 2295 - 2334
[3] Adversarial Inverse Reinforcement Learning to Estimate Policies from Multiple Experts
Yamashita K.
Hamagami T.
Yamashita, Kodai, 2021, Institute of Electrical Engineers of Japan (141) : 1405 - 1410
[4] Learning Behavior Styles with Inverse Reinforcement Learning
Lee, Seong Jae
popovic, Zoran
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
[5] Transfer in Inverse Reinforcement Learning for Multiple Strategies
Tanwani, Ajay Kumar
Billard, Aude
2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 3244 - 3250
[6] Modular inverse reinforcement learning for visuomotor behavior
Rothkopf, Constantin A.
Ballard, Dana H.
BIOLOGICAL CYBERNETICS, 2013, 107 (04) : 477 - 490
[7] Modular inverse reinforcement learning for visuomotor behavior
Constantin A. Rothkopf
Dana H. Ballard
Biological Cybernetics, 2013, 107 : 477 - 490
[8] Regularizing Action Policies for Smooth Control with Reinforcement Learning
Mysore, Siddharth
Mabsout, Bassel
Mancuso, Renato
Saenko, Kate
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1810 - 1816
[9] Deep Reinforcement Learning Based Mobility Load Balancing Under Multiple Behavior Policies
Xu, Yue
Xu, Wenjun
Wang, Zhi
Lin, Jiaru
Cui, Shuguang
ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[10] Contextual Reinforcement Learning
Langford, John
2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3 - 3

← 1 2 3 4 5 →