A path planning method based on deep reinforcement learning for crowd evacuation

被引：0

作者：

Meng X. ^{[1
,2
]}

Liu H. ^{[1
,2
]}

Li W. ^{[1
,2
]}

机构：

[1] School of Information Science and Engineering, Shandong Normal University, Jinan

[2] Shandong Provincial Key Laboratory for Novel Distributed Computer Software Technology, Jinan

来源：

Journal of Ambient Intelligence and Humanized Computing | 2024年 / 15卷 / 6期

基金：

中国国家自然科学基金;

关键词：

Crowd evacuation; Deep reinforcement learning; Optimized multi-agent deep deterministic policy gradient; Path planning;

D O I：

10.1007/s12652-024-04787-x

中图分类号：

学科分类号：

摘要：

Deep reinforcement learning (DRL) is suitable for solving complex path-planning problems due to its excellent ability to make continuous decisions in a complex environment. However, the increase in the population size in the crowd evacuation path-planning problem causes a substantial computational burden for the algorithm, which leads to an unsatisfactory efficiency of the current DRL algorithm. This paper presents a path planning method based on DRL for crowd evacuation to solve the problem. First, we divide crowds into groups based on their relationship and distance from each other and select leaders from them. Next, we expand the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to propose an Optimized Multi-Agent Deep Deterministic Policy Gradient (OMADDPG) algorithm to obtain the global evacuation path. The OMADDPG algorithm uses the Cross-Entropy Method (CEM) to optimize policy and improve the neural network’s training efficiency by applying the Data Pruning (DP) algorithm. In addition, the social force model is improved, incorporating the relationship between individuals and psychological factors into the model. Finally, this paper combines the improved social force model and the OMADDPG algorithm. The OMADDPG algorithm transmits the path information to the leaders. Pedestrians in the environment are driven by the improved social force model to follow the leaders to complete the evacuation simulation. The method can use a leader to guide pedestrians safely arrive the exit and reduce evacuation time in different environments. The simulation results prove the efficiency of the path planning method. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

引用

页码：2925 / 2939

页数：14

共 50 条

[1] Crowd evacuation path planning and simulation method based on deep reinforcement learning and repulsive force field
Wang, Hongyue
Liu, Hong
Li, Wenhao
APPLIED INTELLIGENCE, 2025, 55 (04)
[2] AFSA based path planning method for crowd evacuation
Lu, Dianjie
Zhang, Guijuan
Liu, Yiliang
Wang, Dequan
Liu, Hong
Journal of Information and Computational Science, 2014, 11 (11): : 3815 - 3823
[3] A double-layer crowd evacuation simulation method based on deep reinforcement learning
Zhang, Yong
Yang, Bo
Zhu, Jianlin
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)
[4] A UAV Path Planning Method Based on Deep Reinforcement Learning
Li, Yibing
Zhang, Sitong
Ye, Fang
Jiang, Tao
Li, Yingsong
2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
[5] Crowd Evacuation Simulation Using Hierarchical Deep Reinforcement Learning
Zhang, Zheng
Lu, Dianjie
Li, Jialiuyuan
Liu, Pingshan
Zhang, Guijuan
PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 563 - 568
[6] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
Meng, Haitao
Zhang, Hengrui
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
[7] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
Yanglong Liu
Zuguo Chen
Yonggang Li
Ming Lu
Chaoyang Chen
Xuzhuo Zhang
International Journal of Control, Automation and Systems, 2022, 20 : 2669 - 2680
[8] Robot Search Path Planning Method Based on Prioritized Deep Reinforcement Learning
Liu, Yanglong
Chen, Zuguo
Li, Yonggang
Lu, Ming
Chen, Chaoyang
Zhang, Xuzhuo
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (08) : 2669 - 2680
[9] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
Han, Huiyan
Wang, Jiaqi
Kuang, Liqun
Han, Xie
Xue, Hongxin
SENSORS, 2023, 23 (12)
[10] Path Planning for the Robotic Manipulator in Dynamic Environments Based on a Deep Reinforcement Learning Method
Jie Liu
Hwa Jen Yap
Anis Salwa Mohd Khairuddin
Journal of Intelligent & Robotic Systems, 111 (1)

← 1 2 3 4 5 →