Path planning of mobile robot based on improved double deep Q-network algorithm

被引：0

作者：

Wang, Zhenggang ^{[1
]}

Song, Shuhong ^{[1
]}

Cheng, Shenghui ^{[1
]}

机构：

[1] Anhui Polytech Univ, Coll Elect Engn, Wuhu, Peoples R China

来源：

FRONTIERS IN NEUROROBOTICS | 2025年 / 19卷

关键词：

deep reinforcement learning; mobile robot; path planning; BiLSTM; Dueling Network;

D O I：

10.3389/fnbot.2025.1512953

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Aiming at the problems of slow network convergence, poor reward convergence stability, and low path planning efficiency of traditional deep reinforcement learning algorithms, this paper proposes a BiLSTM-D3QN (Bidirectional Long and Short-Term Memory Dueling Double Deep Q-Network) path planning algorithm based on the DDQN (Double Deep Q-Network) decision model. Firstly, a Bidirectional Long Short-Term Memory network (BiLSTM) is introduced to make the network have memory, increase the stability of decision making and make the reward converge more stably; secondly, Dueling Network is introduced to further solve the problem of overestimating the Q-value of the neural network, which makes the network able to be updated quickly; Adaptive reprioritization based on the frequency penalty function is proposed. Experience Playback, which extracts important and fresh data from the experience pool to accelerate the convergence of the neural network; finally, an adaptive action selection mechanism is introduced to further optimize the action exploration. Simulation experiments show that the BiLSTM-D3QN path planning algorithm outperforms the traditional Deep Reinforcement Learning algorithm in terms of network convergence speed, planning efficiency, stability of reward convergence, and success rate in simple environments; in complex environments, the path length of BiLSTM-D3QN is 20 m shorter than that of the improved ERDDQN (Experience Replay Double Deep Q-Network) algorithm, the number of turning points is 7 fewer, the planning time is 0.54 s shorter, and the success rate is 10.4% higher. The superiority of the BiLSTM-D3QN algorithm in terms of network convergence speed and path planning performance is demonstrated.

引用

页数：17

共 50 条

[1] Path Planning Method for Mobile Robot Based on Curiosity Distillation Double Q-Network
Zhang, Feng
Gu, Qiran
Yuan, Shuai
Computer Engineering and Applications, 2023, 59 (19) : 316 - 322
[2] Transport robot path planning based on an advantage dueling double deep Q-network
He Q.
Wang Q.
Li J.
Wang Z.
Wang T.
Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (11): : 1751 - 1757
[3] Path Planning for Mobile Robot Considering Turnabouts on Narrow Road by Deep Q-Network
Nakamura, Tomoaki
Kobayashi, Masato
Motoi, Naoki
IEEE ACCESS, 2023, 11 : 19111 - 19121
[4] A Novel Path Planning Approach for Mobile Robot in Radioactive Environment Based on Improved Deep Q Network Algorithm
Wu, Zhiqiang
Yin, Yebo
Liu, Jie
Zhang, De
Chen, Jie
Jiang, Wei
SYMMETRY-BASEL, 2023, 15 (11):
[5] UAV Coverage Path Planning With Limited Battery Energy Based on Improved Deep Double Q-network
Ni, Jianjun
Gu, Yu
Gu, Yang
Zhao, Yonghao
Shi, Pengfei
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (08) : 2591 - 2601
[6] Mobile robot path planning based on improved Q learning algorithm
Peng, Jiansheng
International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (07): : 285 - 294
[7] Improved Double Deep Q-Network Algorithm Applied to Multi-Dimensional Environment Path Planning of Hexapod Robots
Chen, Liuhongxu
Wang, Qibiao
Deng, Chao
Xie, Bo
Tuo, Xianguo
Jiang, Gang
SENSORS, 2024, 24 (07)
[8] Noisy Dueling Double Deep Q-Network algorithm for autonomous underwater vehicle path planning
Liao, Xu
Li, Le
Huang, Chuangxia
Zhao, Xian
Tan, Shumin
FRONTIERS IN NEUROROBOTICS, 2024, 18
[9] Improved Double Deep Q Network Algorithm Based on Average Q-Value Estimation and Reward Redistribution for Robot Path Planning
Yin, Yameng
Zhang, Lieping
Shi, Xiaoxu
Wang, Yilin
Peng, Jiansheng
Zou, Jianchu
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (02): : 2769 - 2790
[10] PATH PLANNING OF MOBILE ROBOT BASED ON THE IMPROVED Q-LEARNING ALGORITHM
Chen, Chaorui
Wang, Dongshu
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (03): : 687 - 702

← 1 2 3 4 5 →