Mobile robot navigation based on intrinsic reward mechanism with TD3 algorithm

被引：0

作者：

Yang, Jianan ^{[1
]}

Liu, Yu ^{[1
]}

Zhang, Jie ^{[2
]}

Guan, Yong ^{[1
]}

Shao, Zhenzhou ^{[1
]}

机构：

[1] Capital Normal Univ, Coll Informat Engn, 105 West Third Ring Rd North, Beijing 100048, Peoples R China

[2] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2024年 / 21卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Mobile robots; deep reinforcement learning; intrinsic reward; curiosity; random enhancement;

D O I：

10.1177/17298806241292893

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Deep reinforcement learning methods have been applied to mobile robot navigation to find the optimal path to the target. The rewards are usually given when the task is completed, which may lead to the local optima during the training procedure. It seriously affects the training efficiency and navigation performance of the mobile robot. To this end, this paper proposes an intrinsic reward mechanism with intrinsic curiosity module and randomness enhanced module, combining the TD3 (twin-delayed deep deterministic policy gradient) reinforcement learning algorithm for mobile robot navigation. It effectively resolves the issue of slow convergence caused by sparse rewards in continuous action spaces. It also encourages mobile robots to explore unknown areas and reduces the occurrence of local optima. The experimental results show that the proposed navigation method significantly improves the training efficiency of mobile robots. Out of 1000 test episodes, only 3 exceeded the maximum step limit. This approach significantly reduces the occurrence of local optima. Furthermore, it increases the success rate to an impressive 83.5%, outperforms the existing navigation methods.

引用

页数：10

共 50 条

[1] Inspection Robot Navigation Based on Improved TD3 Algorithm
Huang, Bo
Xie, Jiacheng
Yan, Jiawei
SENSORS, 2024, 24 (08)
[2] Path Planning of Mobile Robot Based on Improved TD3 Algorithm
Li, Peng
Wang, Yuchen
Gao, Zhenyan
PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 715 - 720
[3] GRU-Attention based TD3 Network for Mobile Robot Navigation
Jia, Jiayao
Xing, Xiaowei
Chang, Dong Eui
2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1642 - 1647
[4] Research and Application of an Improved TD3 Algorithm in Mobile Robot Environment Perception and Autonomous Navigation
Fu, Bo
Yao, Xulin
2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 158 - 162
[5] Path planning of mobile robot based on improved TD3 algorithm in dynamic environment
Li, Peng
Chen, Donghui
Wang, Yuchen
Zhang, Lanyong
Zhao, Shiquan
HELIYON, 2024, 10 (11)
[6] Intelligent Air Combat Maneuvering Decision Based on TD3 Algorithm
Zhou Xiaoyu
Huang Jiangtao
Zhu Zhe
Zhang Sheng
Zhou Pan
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1082 - 1094
[7] D3-TD3: Deep Dense Dueling Architectures in TD3 Algorithm for Robot Path Planning Based on 3D Point Cloud
Gu, Yuwan
Zhu, Zhitao
Chu, Yongtao
Lv, Jidong
Wang, Xueyuan
Xu, Shoukun
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (18)
[8] An Algorithm based on Autowaves for Navigation Control of a Mobile Robot
Medina Hernandez, Jose Antonio
Gomez Castaneda, Felipe
Moreno Cadenas, Jose Antonio
2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATION CONTROL (CCE 2009), 2009, : 512 - +
[9] Monocular Vision Based Navigation Algorithm for Mobile Robot
Liu Hai-Bo
Dong Yu-Jie
Wang Fu-Zhong
Niu Man-Cang
2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 3937 - 3941
[10] The navigation of mobile robot based on hybrid Dijkstra algorithm
Guo, Jinchao
Gao, Yu
Cui, Guangzhao
Journal of Computational Information Systems, 2014, 10 (09): : 3879 - 3886

← 1 2 3 4 5 →