Pedestrian simulation as multi-objective reinforcement learning

被引:5
作者
Ravichandran, Naresh Balaji [1 ]
Yang, Fangkai [1 ]
Peters, Christopher [1 ]
Lansner, Anders [1 ]
Herman, Pawel [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
来源
18TH ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS (IVA'18) | 2018年
关键词
reinforcement learning; agent-based simulation; multi-objective learning; parallel learning; FORCE MODEL;
D O I
10.1145/3267851.3267914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modelling and simulation of pedestrian crowds require agents to reach pre-determined goals and avoid collisions with static obstacles and dynamic pedestrians, while maintaining natural gait behaviour. We model pedestrians as autonomous, learning, and reactive agents employing Reinforcement Learning (RL). Typical RL-based agent simulations suffer poor generalization due to hand-crafted reward function to ensure realistic behaviour. In this work, we model pedestrians in a modular framework integrating navigation and collision-avoidance tasks as separate modules. Each such module consists of independent state-spaces and rewards, but with shared action-spaces. Empirical results suggest that such modular framework learning models can show satisfactory performance without tuning parameters, and we compare it with the state-of-art crowd simulation methods.
引用
收藏
页码:307 / 312
页数:6
相关论文
共 17 条
[1]  
[Anonymous], 2002, TRANSP RES B
[2]   Emergent fundamental pedestrian flows from cellular automata microsimulation [J].
Blue, VJ ;
Adler, JL .
TRAFFIC FLOW THEORY: SIMULATION MODELS, MACROSCOPIC FLOW RELATIONSHIPS, AND FLOW ESTIMATION AND PREDICTION, 1998, (1644) :29-36
[3]  
BLUE VJ, 2000, COM ADAP SY, P437
[4]  
Curtis S., 2016, Collect. Dyn., V1, P1
[5]  
Godoy J, 2015, PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), P1577
[6]  
Godoy Julio, 2017, ARXIV171004296
[7]   SOCIAL FORCE MODEL FOR PEDESTRIAN DYNAMICS [J].
HELBING, D ;
MOLNAR, P .
PHYSICAL REVIEW E, 1995, 51 (05) :4282-4286
[8]   The flow of human crowds [J].
Hughes, RL .
ANNUAL REVIEW OF FLUID MECHANICS, 2003, 35 :169-182
[9]  
Ijaz Kiran, 2015, P 17 UKSIMAMSS INT C, P111, DOI DOI 10.1109/UKSIM.2015.46
[10]  
Karamouzas I, 2009, LECT NOTES COMPUT SC, V5884, P41, DOI 10.1007/978-3-642-10347-6_4