Path planning of mobile robot based on improved double deep Q-network algorithm

被引:0
|
作者
Wang, Zhenggang [1 ]
Song, Shuhong [1 ]
Cheng, Shenghui [1 ]
机构
[1] Anhui Polytech Univ, Coll Elect Engn, Wuhu, Peoples R China
来源
FRONTIERS IN NEUROROBOTICS | 2025年 / 19卷
关键词
deep reinforcement learning; mobile robot; path planning; BiLSTM; Dueling Network;
D O I
10.3389/fnbot.2025.1512953
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the problems of slow network convergence, poor reward convergence stability, and low path planning efficiency of traditional deep reinforcement learning algorithms, this paper proposes a BiLSTM-D3QN (Bidirectional Long and Short-Term Memory Dueling Double Deep Q-Network) path planning algorithm based on the DDQN (Double Deep Q-Network) decision model. Firstly, a Bidirectional Long Short-Term Memory network (BiLSTM) is introduced to make the network have memory, increase the stability of decision making and make the reward converge more stably; secondly, Dueling Network is introduced to further solve the problem of overestimating the Q-value of the neural network, which makes the network able to be updated quickly; Adaptive reprioritization based on the frequency penalty function is proposed. Experience Playback, which extracts important and fresh data from the experience pool to accelerate the convergence of the neural network; finally, an adaptive action selection mechanism is introduced to further optimize the action exploration. Simulation experiments show that the BiLSTM-D3QN path planning algorithm outperforms the traditional Deep Reinforcement Learning algorithm in terms of network convergence speed, planning efficiency, stability of reward convergence, and success rate in simple environments; in complex environments, the path length of BiLSTM-D3QN is 20 m shorter than that of the improved ERDDQN (Experience Replay Double Deep Q-Network) algorithm, the number of turning points is 7 fewer, the planning time is 0.54 s shorter, and the success rate is 10.4% higher. The superiority of the BiLSTM-D3QN algorithm in terms of network convergence speed and path planning performance is demonstrated.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Path Planning Method for Mobile Robot Based on Curiosity Distillation Double Q-Network
    Zhang, Feng
    Gu, Qiran
    Yuan, Shuai
    Computer Engineering and Applications, 2023, 59 (19) : 316 - 322
  • [2] Transport robot path planning based on an advantage dueling double deep Q-network
    He Q.
    Wang Q.
    Li J.
    Wang Z.
    Wang T.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (11): : 1751 - 1757
  • [3] Path Planning for Mobile Robot Considering Turnabouts on Narrow Road by Deep Q-Network
    Nakamura, Tomoaki
    Kobayashi, Masato
    Motoi, Naoki
    IEEE ACCESS, 2023, 11 : 19111 - 19121
  • [4] A Novel Path Planning Approach for Mobile Robot in Radioactive Environment Based on Improved Deep Q Network Algorithm
    Wu, Zhiqiang
    Yin, Yebo
    Liu, Jie
    Zhang, De
    Chen, Jie
    Jiang, Wei
    SYMMETRY-BASEL, 2023, 15 (11):
  • [5] UAV Coverage Path Planning With Limited Battery Energy Based on Improved Deep Double Q-network
    Ni, Jianjun
    Gu, Yu
    Gu, Yang
    Zhao, Yonghao
    Shi, Pengfei
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (08) : 2591 - 2601
  • [6] Mobile robot path planning based on improved Q learning algorithm
    Peng, Jiansheng
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (07): : 285 - 294
  • [7] Improved Double Deep Q-Network Algorithm Applied to Multi-Dimensional Environment Path Planning of Hexapod Robots
    Chen, Liuhongxu
    Wang, Qibiao
    Deng, Chao
    Xie, Bo
    Tuo, Xianguo
    Jiang, Gang
    SENSORS, 2024, 24 (07)
  • [8] Noisy Dueling Double Deep Q-Network algorithm for autonomous underwater vehicle path planning
    Liao, Xu
    Li, Le
    Huang, Chuangxia
    Zhao, Xian
    Tan, Shumin
    FRONTIERS IN NEUROROBOTICS, 2024, 18
  • [9] Improved Double Deep Q Network Algorithm Based on Average Q-Value Estimation and Reward Redistribution for Robot Path Planning
    Yin, Yameng
    Zhang, Lieping
    Shi, Xiaoxu
    Wang, Yilin
    Peng, Jiansheng
    Zou, Jianchu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (02): : 2769 - 2790
  • [10] PATH PLANNING OF MOBILE ROBOT BASED ON THE IMPROVED Q-LEARNING ALGORITHM
    Chen, Chaorui
    Wang, Dongshu
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (03): : 687 - 702