A path planning method based on deep reinforcement learning for crowd evacuation

被引:0
|
作者
Meng X. [1 ,2 ]
Liu H. [1 ,2 ]
Li W. [1 ,2 ]
机构
[1] School of Information Science and Engineering, Shandong Normal University, Jinan
[2] Shandong Provincial Key Laboratory for Novel Distributed Computer Software Technology, Jinan
基金
中国国家自然科学基金;
关键词
Crowd evacuation; Deep reinforcement learning; Optimized multi-agent deep deterministic policy gradient; Path planning;
D O I
10.1007/s12652-024-04787-x
中图分类号
学科分类号
摘要
Deep reinforcement learning (DRL) is suitable for solving complex path-planning problems due to its excellent ability to make continuous decisions in a complex environment. However, the increase in the population size in the crowd evacuation path-planning problem causes a substantial computational burden for the algorithm, which leads to an unsatisfactory efficiency of the current DRL algorithm. This paper presents a path planning method based on DRL for crowd evacuation to solve the problem. First, we divide crowds into groups based on their relationship and distance from each other and select leaders from them. Next, we expand the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to propose an Optimized Multi-Agent Deep Deterministic Policy Gradient (OMADDPG) algorithm to obtain the global evacuation path. The OMADDPG algorithm uses the Cross-Entropy Method (CEM) to optimize policy and improve the neural network’s training efficiency by applying the Data Pruning (DP) algorithm. In addition, the social force model is improved, incorporating the relationship between individuals and psychological factors into the model. Finally, this paper combines the improved social force model and the OMADDPG algorithm. The OMADDPG algorithm transmits the path information to the leaders. Pedestrians in the environment are driven by the improved social force model to follow the leaders to complete the evacuation simulation. The method can use a leader to guide pedestrians safely arrive the exit and reduce evacuation time in different environments. The simulation results prove the efficiency of the path planning method. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:2925 / 2939
页数:14
相关论文
共 50 条
  • [21] A decentralized path planning model based on deep reinforcement learning
    Guo, Dong
    Ji, Shouwen
    Yao, Yanke
    Chen, Cheng
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 117
  • [22] Deep deterministic policy gradient algorithm for crowd-evacuation path planning
    Li, Xinjin
    Liu, Hong
    Li, Junqing
    Li, Yan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 161
  • [23] Research on Path Planning Algorithm for Crowd Evacuation
    Wang, Zhenfei
    Zhang, Chuchu
    Wang, Junfeng
    Zheng, Zhiyun
    Li, Lun
    SYMMETRY-BASEL, 2021, 13 (08):
  • [24] Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning
    Xue, Yiran
    Wu, Rui
    Liu, Jiafeng
    Tang, Xianglong
    ALGORITHMS, 2021, 14 (01)
  • [25] A Guided-to-Autonomous Policy Learning method of Deep Reinforcement Learning in Path Planning
    Zhao, Wang
    Zhang, Ye
    Li, Haoyu
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 665 - 672
  • [26] Multi-objective path planning based on deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Cui, Yunfei
    Du, Xue
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3273 - 3279
  • [27] Dynamic Scene Path Planning of UAVs Based on Deep Reinforcement Learning
    Tang, Jin
    Liang, Yangang
    Li, Kebo
    DRONES, 2024, 8 (02)
  • [28] AUV path planning based on improved IFDS and deep reinforcement learning
    Fan, Yiqun
    Li, Hongna
    Xie, Jiaqi
    Zhou, Yunfu
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (06):
  • [29] Path planning of robotic arm based on deep reinforcement learning algorithm
    Al-Gabalawy M.
    Advanced Control for Applications: Engineering and Industrial Systems, 2022, 4 (01):
  • [30] Ship path planning based on Deep Reinforcement Learning and weather forecast
    Artusi, Eva
    2021 22ND IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2021), 2021, : 258 - 260