A path planning method based on deep reinforcement learning for crowd evacuation

被引：0

作者：

Meng X. ^{[1
,2
]}

Liu H. ^{[1
,2
]}

Li W. ^{[1
,2
]}

机构：

[1] School of Information Science and Engineering, Shandong Normal University, Jinan

[2] Shandong Provincial Key Laboratory for Novel Distributed Computer Software Technology, Jinan

来源：

Journal of Ambient Intelligence and Humanized Computing | 2024年 / 15卷 / 6期

基金：

中国国家自然科学基金;

关键词：

Crowd evacuation; Deep reinforcement learning; Optimized multi-agent deep deterministic policy gradient; Path planning;

D O I：

10.1007/s12652-024-04787-x

中图分类号：

学科分类号：

摘要：

Deep reinforcement learning (DRL) is suitable for solving complex path-planning problems due to its excellent ability to make continuous decisions in a complex environment. However, the increase in the population size in the crowd evacuation path-planning problem causes a substantial computational burden for the algorithm, which leads to an unsatisfactory efficiency of the current DRL algorithm. This paper presents a path planning method based on DRL for crowd evacuation to solve the problem. First, we divide crowds into groups based on their relationship and distance from each other and select leaders from them. Next, we expand the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to propose an Optimized Multi-Agent Deep Deterministic Policy Gradient (OMADDPG) algorithm to obtain the global evacuation path. The OMADDPG algorithm uses the Cross-Entropy Method (CEM) to optimize policy and improve the neural network’s training efficiency by applying the Data Pruning (DP) algorithm. In addition, the social force model is improved, incorporating the relationship between individuals and psychological factors into the model. Finally, this paper combines the improved social force model and the OMADDPG algorithm. The OMADDPG algorithm transmits the path information to the leaders. Pedestrians in the environment are driven by the improved social force model to follow the leaders to complete the evacuation simulation. The method can use a leader to guide pedestrians safely arrive the exit and reduce evacuation time in different environments. The simulation results prove the efficiency of the path planning method. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

引用

页码：2925 / 2939

页数：14

共 50 条

[21] Dynamic Scene Path Planning of UAVs Based on Deep Reinforcement Learning
Tang, Jin
Liang, Yangang
Li, Kebo
DRONES, 2024, 8 (02)
[22] AUV path planning based on improved IFDS and deep reinforcement learning
Fan, Yiqun
Li, Hongna
Xie, Jiaqi
Zhou, Yunfu
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (06):
[23] Data-driven crowd evacuation: A reinforcement learning method
Yao, Zhenzhen
Zhang, Guijuan
Lu, Dianjie
Liu, Hong
NEUROCOMPUTING, 2019, 366 : 314 - 327
[24] Path Planning for Mobile Robot Based on Deep Reinforcement Learning and Fuzzy Control
Liu, Chunling
Xu, Jun
Guo, Kaiwen
2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 533 - 537
[25] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
Zheng, Kaiyuan
Gao, Jingpeng
Shen, Liangxi
IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
[26] Deep reinforcement learning-based path planning of underactuated surface vessels
Xu H.
Wang N.
Zhao H.
Zheng Z.
Cyber-Physical Systems, 2019, 5 (01): : 1 - 17
[27] Path planning in an unknown environment based on deep reinforcement learning with prior knowledge
Lou, Ping
Xu, Kun
Jiang, Xuemei
Xiao, Zheng
Yan, Junwei
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 5773 - 5789
[28] Dynamic Path Planning for Mobile Robots with Deep Reinforcement Learning
Yang, Laiyi
Bi, Jing
Yuan, Haitao
IFAC PAPERSONLINE, 2022, 55 (11): : 19 - 24
[29] Deep Reinforcement Learning for Indoor Mobile Robot Path Planning
Gao, Junli
Ye, Weijie
Guo, Jing
Li, Zhongjuan
SENSORS, 2020, 20 (19) : 1 - 15
[30] Path planning of stratospheric airship in dynamic wind field based on deep reinforcement learning
Zheng, Baojin
Zhu, Ming
Guo, Xiao
Ou, Jiajun
Yuan, Jiace
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150

← 1 2 3 4 5 →