Heterogeneous Crowd Simulation Using Parametric Reinforcement Learning

被引：12

作者：

Hu, Kaidong ^{[1
]}

Haworth, Brandon ^{[2
]}

Berseth, Glen ^{[3
]}

Pavlovic, Vladimir ^{[1
]}

Faloutsos, Petros ^{[4
,5
]}

Kapadia, Mubbasir ^{[1
]}

机构：

[1] Univ Victoria, Dept Comp Sci, Victoria, BC V8P 5C2, Canada

[2] Univ Victoria, Dept Comp Sci, Victoria, BC V8P 5C2, Canada

[3] Univ Montreal, Dept Comp Sci & Operat Res, MILA, Montreal, PQ H3T 1J4, Canada

[4] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON M3J 1P3, Canada

[5] Univ Hlth Network, Toronto Rehabil Inst, Toronto, ON M5G 2A2, Canada

来源：

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS | 2023年 / 29卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Computational modeling; Reinforcement learning; Neural networks; Collision avoidance; Training; Predictive models; Navigation; Multi-agent navigation; crowd simulation; reinforcement learning; parametric policy learning; NEURAL-NETWORK; NAVIGATION; MODEL;

D O I：

10.1109/TVCG.2021.3139031

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Agent-based synthetic crowd simulation affords the cost-effective large-scale simulation and animation of interacting digital humans. Model-based approaches have successfully generated a plethora of simulators with a variety of foundations. However, prior approaches have been based on statically defined models predicated on simplifying assumptions, limited video-based datasets, or homogeneous policies. Recent works have applied reinforcement learning to learn policies for navigation. However, these approaches may learn static homogeneous rules, are typically limited in their generalization to trained scenarios, and limited in their usability in synthetic crowd domains. In this article, we present a multi-agent reinforcement learning-based approach that learns a parametric predictive collision avoidance and steering policy. We show that training over a parameter space produces a flexible model across crowd configurations. That is, our goal-conditioned approach learns a parametric policy that affords heterogeneous synthetic crowds. We propose a model-free approach without centralization of internal agent information, control signals, or agent communication. The model is extensively evaluated. The results show policy generalization across unseen scenarios, agent parameters, and out-of-distribution parameterizations. The learned model has comparable computational performance to traditional methods. Qualitatively the model produces both expected (laminar flow, shuffling, bottleneck) and unexpected (side-stepping) emergent qualitative behaviours, and quantitatively the approach is performant across measures of movement quality.

引用

页码：2036 / 2052

页数：17

共 89 条

[1] Social LSTM: Human Trajectory Prediction in Crowded Spaces
Alahi, Alexandre
Goel, Kratarth
Ramanathan, Vignesh
Robicquet, Alexandre
Li Fei-Fei
Savarese, Silvio
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 961 - 971
[2] Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories with GANs
Amirian, Javad
Hayet, Jean-Bernard
Pettre, Julien
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2964 - 2972
[3] [Anonymous], P ACM SIGGRAPH
[4] [Anonymous], 2011, P 2011 ACM SIGGRAPH, DOI DOI 10.1145/2019406.2019414
[5] B HAWORTH., 2015, P 8 ACM SIGGRAPH C M, P91
[6] Berseth G., 2015, Computer Animation and Virtual Worlds
[7] Berseth Glen., 2014, P ACM SIGGRAPH EUR S, P113
[8] Virtual Crowds: An LSTM-Based Framework for Crowd Simulation
Bisagno, Niccolo
Garau, Nicola
Montagner, Andrea
Conci, Nicola
[J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 117 - 127
[9] Brito B, 2020, Arxiv, DOI arXiv:2010.09056
[10] A Perceptually-Validated Metric for Crowd Trajectory Quality Evaluation
Cabrero Daniel, Beatriz
Marques, Ricardo
Hoyet, Ludovic
Pettre, Julien
Blat, Josep
[J]. PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2021, 4 (03)

← 1 2 3 4 5 6 7 8 9 →