Robot Dynamic Path Planning Based on Prioritized Experience Replay and LSTM Network

被引:0
|
作者
Li, Hongqi [1 ]
Zhong, Peisi [1 ]
Liu, Li [2 ]
Wang, Xiao [1 ]
Liu, Mei [3 ]
Yuan, Jie [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Energy Storage Technol, Qingdao 266590, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Long short term memory; Path planning; Convergence; Robots; Training; Planning; Adaptation models; Accuracy; Deep reinforcement learning; DDQN; LSTM network; mobile robot; path planning; prioritized experience replay; LEARNING ALGORITHM;
D O I
10.1109/ACCESS.2025.3532449
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the issues of slow convergence speed, poor dynamic adaptability, and path redundancy in the Double Deep Q Network (DDQN) within complex obstacle environments, this paper proposes an enhanced algorithm within the deep reinforcement learning framework. This algorithm, termed LPDDQN, integrates Prioritized Experience Replay (PER) and the Long Short Term Memory (LSTM) network to improve upon the DDQN algorithm. First, Prioritized Experience Replay (PER) is utilized to prioritize experience data and optimize storage and sampling operations through the SumTree structure, rather than the conventional experience queue. Second, the LSTM network is introduced to enhance the dynamic adaptability of the DDQN algorithm. Owing to the introduction of the LSTM model, the experience samples must be sliced and populated. The performance of the proposed LPDDQN algorithm is compared with five other path planning algorithms in both static and dynamic environments. Simulation analysis shows that in a static environment, LPDDQN demonstrates significant improvements over traditional DDQN in terms of convergence, number of moving steps, success rate, and number of turns, with respective improvements of 24.07%, 17.49%, 37.73%, and 61.54%. In dynamic and complex environments, the success rates of all algorithms, except TLD3 and the LPDDQN, decreased significantly. Further analysis reveals that the LPDDQN outperforms the TLD3 by 18.87%, 2.41%, and 39.02% in terms of moving steps, success rate, and number of turns, respectively.
引用
收藏
页码:22283 / 22299
页数:17
相关论文
共 50 条
  • [21] A modified dueling DQN algorithm for robot path planning incorporating priority experience replay and artificial potential fieldsA modified dueling dqn algorithm for robot path planning incorporating priority experience replay and artificial potential fieldsX. Yue et al.
    Chang Li
    Xiaofeng Yue
    Zeyuan Liu
    Guoyuan Ma
    Hongbo Zhang
    Yuan Zhou
    Juan Zhu
    Applied Intelligence, 2025, 55 (6)
  • [22] Mobile Robot Navigation Based on Noisy N-Step Dueling Double Deep Q-Network and Prioritized Experience Replay
    Hu, Wenjie
    Zhou, Ye
    Ho, Hann Woei
    ELECTRONICS, 2024, 13 (12)
  • [23] A Dynamic Risk Level Based Bioinspired Neural Network Approach for Robot Path Planning
    Ni, Jianjun
    Li, Xinyun
    Fan, Xinnan
    Shen, Jinrong
    2014 WORLD AUTOMATION CONGRESS (WAC): EMERGING TECHNOLOGIES FOR A NEW PARADIGM IN SYSTEM OF SYSTEMS ENGINEERING, 2014,
  • [24] Dynamic Path Planning for Mobile Robot Based on Improved Genetic Algorithm
    Liu Changan
    Yan Xiaohu
    Liu Chunyang
    Li Guodong
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (02): : 245 - 248
  • [25] Optimization of Dynamic Mobile Robot Path Planning based on Evolutionary Methods
    Fetanat, Masoud
    Haghzad, Sajjad
    Shouraki, Saeed Bagheri
    2015 AI & ROBOTICS (IRANOPEN), 2015,
  • [26] Path planning based on improved A* and dynamic window approach for mobile robot
    Chen J.
    Xu L.
    Chen J.
    Liu Q.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (06): : 1650 - 1658
  • [27] Mobile robot path planning based on optimized A* and dynamic window approach
    Wang B.
    Nie J.
    Li H.
    Xie X.
    Yan H.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (04): : 1353 - 1363
  • [28] Dynamic path planning of mobile robot based on artificial potential field
    He, Naifeng
    Su, Yifan
    Guo, Jilu
    Fan, Xiaoliang
    Liu, Zihong
    Wang, Bolun
    2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 259 - 264
  • [29] Research on Robot Path Planning Based on Fuzzy Neural Network Algorithm
    Wang, Hao
    Duan, Jie
    Wang, Maoli
    Zhao, Jingbo
    Dong, Zhenzhen
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1800 - 1803
  • [30] Path Planning Methods of Mobile Robot Based on New Neural Network
    Lv Zhanyong
    Cao Jiangtao
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3222 - 3226