A path planning method based on deep reinforcement learning for crowd evacuation

被引:0
|
作者
Meng X. [1 ,2 ]
Liu H. [1 ,2 ]
Li W. [1 ,2 ]
机构
[1] School of Information Science and Engineering, Shandong Normal University, Jinan
[2] Shandong Provincial Key Laboratory for Novel Distributed Computer Software Technology, Jinan
基金
中国国家自然科学基金;
关键词
Crowd evacuation; Deep reinforcement learning; Optimized multi-agent deep deterministic policy gradient; Path planning;
D O I
10.1007/s12652-024-04787-x
中图分类号
学科分类号
摘要
Deep reinforcement learning (DRL) is suitable for solving complex path-planning problems due to its excellent ability to make continuous decisions in a complex environment. However, the increase in the population size in the crowd evacuation path-planning problem causes a substantial computational burden for the algorithm, which leads to an unsatisfactory efficiency of the current DRL algorithm. This paper presents a path planning method based on DRL for crowd evacuation to solve the problem. First, we divide crowds into groups based on their relationship and distance from each other and select leaders from them. Next, we expand the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to propose an Optimized Multi-Agent Deep Deterministic Policy Gradient (OMADDPG) algorithm to obtain the global evacuation path. The OMADDPG algorithm uses the Cross-Entropy Method (CEM) to optimize policy and improve the neural network’s training efficiency by applying the Data Pruning (DP) algorithm. In addition, the social force model is improved, incorporating the relationship between individuals and psychological factors into the model. Finally, this paper combines the improved social force model and the OMADDPG algorithm. The OMADDPG algorithm transmits the path information to the leaders. Pedestrians in the environment are driven by the improved social force model to follow the leaders to complete the evacuation simulation. The method can use a leader to guide pedestrians safely arrive the exit and reduce evacuation time in different environments. The simulation results prove the efficiency of the path planning method. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:2925 / 2939
页数:14
相关论文
共 50 条
  • [1] Crowd evacuation path planning and simulation method based on deep reinforcement learning and repulsive force field
    Wang, Hongyue
    Liu, Hong
    Li, Wenhao
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [2] AFSA based path planning method for crowd evacuation
    Lu, Dianjie
    Zhang, Guijuan
    Liu, Yiliang
    Wang, Dequan
    Liu, Hong
    Journal of Information and Computational Science, 2014, 11 (11): : 3815 - 3823
  • [3] A double-layer crowd evacuation simulation method based on deep reinforcement learning
    Zhang, Yong
    Yang, Bo
    Zhu, Jianlin
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)
  • [4] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [5] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
    Han, Huiyan
    Wang, Jiaqi
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    SENSORS, 2023, 23 (12)
  • [6] Path planning of manipulator based on deep reinforcement learning and screw method
    Wang Y.
    Wang Y.-H.
    Yin Z.-Z.
    Wan P.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
  • [7] An entropy-based path planning method for crowd evacuation in complex environments
    Dong, Shiyu
    Huang, Ping
    Wu, Fan
    Wang, Wei
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 954 - 959
  • [8] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [9] Robot Path Planning Based on Deep Reinforcement Learning
    Zhang, Rui
    Jiang, Yuhao
    Wu Fenghua
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1697 - 1701
  • [10] Crowd Evacuation Simulation Using Hierarchical Deep Reinforcement Learning
    Zhang, Zheng
    Lu, Dianjie
    Li, Jialiuyuan
    Liu, Pingshan
    Zhang, Guijuan
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 563 - 568