Local Path Planning of Mobile Robots Based on the Improved SAC Algorithm

被引:0
作者
Zhou, Ruihong [1 ]
Li, Caihong [1 ]
Zhang, Guosheng [1 ]
Zhang, Yaoyu [1 ]
Liu, Jiajun [2 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255049, Peoples R China
[2] Lingnan Univ, Fac Business, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Mobile robots; local path planning; reinforcement learning; SAC algorithm; priority experience replay; experience pool adjustment; Robot Operating System (ROS);
D O I
10.14569/IJACSA.2024.01505100
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a new EP-PER-SAC algorithm to solve the problems of slow training speed and low learning efficiency of the SAC (Soft Actor Critic) algorithm in the local path planning of mobile robots by introducing the Priority Experience Replay (PER) strategy and Experience Pool (EP) adjustment technique. This algorithm replaces equal probability random sampling with sampling based on the priority experience to increase the frequency of extracting important samples, thereby improves the stability and convergence speed of model training. On this basis, it requires to continuously monitor the learning progress and exploration rate changes of the robot to dynamically adjust the experience pool, so the robot can adapt effectively to the environment changes and the storage requirements and learning efficiency of the algorithm are balanced. Then, the algorithm's reward and punishment function is improved to reduce the blindness of algorithm training. Finally, experiments are conducted under different obstacle environments to verify the feasibility of the algorithm based on ROS (Robot Operating System) simulation platform and real environment. The results show that the improved EP-PER-SAC algorithm has a shorter path length and faster model convergence speed than the original SAC algorithm and PER-SAC algorithm.
引用
收藏
页码:991 / 999
页数:9
相关论文
共 23 条
[1]  
Deng Z., 2023, Journal of New Media., V5
[2]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[3]   Motion path planning of 6-DOF industrial robot based on fuzzy control algorithm [J].
Gao, Huanbing ;
Lu, Shouyin ;
Wang, Tao .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) :3773-3782
[4]   Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient [J].
Gong, Hui ;
Wang, Peng ;
Ni, Cui ;
Cheng, Nuo .
SENSORS, 2022, 22 (09)
[5]   Research on Path Planning of Mobile Robot with a Novel Improved Artificial Potential Field Algorithm [J].
Guo, Tiezheng ;
Wang, Jie ;
Wang, Zhiming ;
Chen, Wei ;
Chen, Guojun ;
Zhang, Shishi .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
[6]  
Haarnoja T, 2018, PR MACH LEARN RES, V80
[7]   Research on global path planning of electric disinfection vehicle based on improved A* algorithm [J].
Jiang, Haojie ;
Sun, Yuan .
ENERGY REPORTS, 2021, 7 :1270-1279
[8]   Path planning for underwater gliders in time-varying ocean current using deep reinforcement learning [J].
Lan, Wei ;
Jin, Xiang ;
Chang, Xin ;
Wang, Tianlin ;
Zhou, Han ;
Tian, Wei ;
Zhou, Lilei .
OCEAN ENGINEERING, 2022, 262
[9]   Research on Dynamic Path Planning of Mobile Robot Based on Improved DDPG Algorithm [J].
Li, Peng ;
Ding, Xiangcheng ;
Sun, Hongfang ;
Zhao, Shiquan ;
Cajo, Ricardo .
MOBILE INFORMATION SYSTEMS, 2021, 2021
[10]  
Li X., 2021, J. Phys., Conf. Ser., V2083