Robot Dynamic Path Planning Based on Prioritized Experience Replay and LSTM Network

被引：0

作者：

Li, Hongqi ^{[1
]}

Zhong, Peisi ^{[1
]}

Liu, Li ^{[2
]}

Wang, Xiao ^{[1
]}

Liu, Mei ^{[3
]}

Yuan, Jie ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China

[2] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China

[3] Shandong Univ Sci & Technol, Coll Energy Storage Technol, Qingdao 266590, Peoples R China

来源：

IEEE ACCESS | 2025年 / 13卷

基金：

中国国家自然科学基金;

关键词：

Heuristic algorithms; Long short term memory; Path planning; Convergence; Robots; Training; Planning; Adaptation models; Accuracy; Deep reinforcement learning; DDQN; LSTM network; mobile robot; path planning; prioritized experience replay; LEARNING ALGORITHM;

D O I：

10.1109/ACCESS.2025.3532449

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To address the issues of slow convergence speed, poor dynamic adaptability, and path redundancy in the Double Deep Q Network (DDQN) within complex obstacle environments, this paper proposes an enhanced algorithm within the deep reinforcement learning framework. This algorithm, termed LPDDQN, integrates Prioritized Experience Replay (PER) and the Long Short Term Memory (LSTM) network to improve upon the DDQN algorithm. First, Prioritized Experience Replay (PER) is utilized to prioritize experience data and optimize storage and sampling operations through the SumTree structure, rather than the conventional experience queue. Second, the LSTM network is introduced to enhance the dynamic adaptability of the DDQN algorithm. Owing to the introduction of the LSTM model, the experience samples must be sliced and populated. The performance of the proposed LPDDQN algorithm is compared with five other path planning algorithms in both static and dynamic environments. Simulation analysis shows that in a static environment, LPDDQN demonstrates significant improvements over traditional DDQN in terms of convergence, number of moving steps, success rate, and number of turns, with respective improvements of 24.07%, 17.49%, 37.73%, and 61.54%. In dynamic and complex environments, the success rates of all algorithms, except TLD3 and the LPDDQN, decreased significantly. Further analysis reveals that the LPDDQN outperforms the TLD3 by 18.87%, 2.41%, and 39.02% in terms of moving steps, success rate, and number of turns, respectively.

引用

页码：22283 / 22299

页数：17

共 50 条

[31] Research on Path Planning of Mobile Robot Based on Neural Network Algorithm
Duan, Chenxu
Tang, Xiaojie
PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 717 - 723
[32] Robot path planning based on artificial immune network
Hu, Xuanzi
Xie, Cunxi
Xu, Qingui
2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-5, 2007, : 1053 - +
[33] Path planning of mobile robot based on improved TD3 algorithm in dynamic environment
Li, Peng
Chen, Donghui
Wang, Yuchen
Zhang, Lanyong
Zhao, Shiquan
HELIYON, 2024, 10 (11)
[34] MOD-RRT*: A Sampling-Based Algorithm for Robot Path Planning in Dynamic Environment
Qi, Jie
Yang, Hui
Sun, Haixin
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (08) : 7244 - 7251
[35] Path planning of mobile robot based on improved double deep Q-network algorithm
Wang, Zhenggang
Song, Shuhong
Cheng, Shenghui
FRONTIERS IN NEUROROBOTICS, 2025, 19
[36] Neural Machine Translation Based on Prioritized Experience Replay
Sun, Shuo
Hou, Hongxu
Wu, Nier
Guo, Ziyue
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 358 - 368
[37] An Improved Dyna-Q Algorithm for Mobile Robot Path Planning in Unknown Dynamic Environment
Pei, Muleilan
An, Hao
Liu, Bo
Wang, Changhong
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4415 - 4425
[38] Dynamic Multi-Role Adaptive Collaborative Ant Colony Optimization for Robot Path Planning
Zhang, Dehui
You, Xiaoming
Liu, Sheng
Pan, Han
IEEE ACCESS, 2020, 8 : 129958 - 129974
[39] Past, Present and Future of Path-Planning Algorithms for Mobile Robot Navigation in Dynamic Environments
Hewawasam, H. S.
Ibrahim, M. Yousef
Appuhamillage, Gayan Kahandawa
IEEE OPEN JOURNAL OF THE INDUSTRIAL ELECTRONICS SOCIETY, 2022, 3 : 353 - 365
[40] Mobile Robot Path Planning Based on Improved Localized Particle Swarm Optimization
Zhang, Lin
Zhang, Yingjie
Li, Yangfan
IEEE SENSORS JOURNAL, 2021, 21 (05) : 6962 - 6972

← 1 2 3 4 5 →