Mobile robot navigation based on intrinsic reward mechanism with TD3 algorithm

被引:0
|
作者
Yang, Jianan [1 ]
Liu, Yu [1 ]
Zhang, Jie [2 ]
Guan, Yong [1 ]
Shao, Zhenzhou [1 ]
机构
[1] Capital Normal Univ, Coll Informat Engn, 105 West Third Ring Rd North, Beijing 100048, Peoples R China
[2] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Mobile robots; deep reinforcement learning; intrinsic reward; curiosity; random enhancement;
D O I
10.1177/17298806241292893
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep reinforcement learning methods have been applied to mobile robot navigation to find the optimal path to the target. The rewards are usually given when the task is completed, which may lead to the local optima during the training procedure. It seriously affects the training efficiency and navigation performance of the mobile robot. To this end, this paper proposes an intrinsic reward mechanism with intrinsic curiosity module and randomness enhanced module, combining the TD3 (twin-delayed deep deterministic policy gradient) reinforcement learning algorithm for mobile robot navigation. It effectively resolves the issue of slow convergence caused by sparse rewards in continuous action spaces. It also encourages mobile robots to explore unknown areas and reduces the occurrence of local optima. The experimental results show that the proposed navigation method significantly improves the training efficiency of mobile robots. Out of 1000 test episodes, only 3 exceeded the maximum step limit. This approach significantly reduces the occurrence of local optima. Furthermore, it increases the success rate to an impressive 83.5%, outperforms the existing navigation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Inspection Robot Navigation Based on Improved TD3 Algorithm
    Huang, Bo
    Xie, Jiacheng
    Yan, Jiawei
    SENSORS, 2024, 24 (08)
  • [2] Path Planning of Mobile Robot Based on Improved TD3 Algorithm
    Li, Peng
    Wang, Yuchen
    Gao, Zhenyan
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 715 - 720
  • [3] GRU-Attention based TD3 Network for Mobile Robot Navigation
    Jia, Jiayao
    Xing, Xiaowei
    Chang, Dong Eui
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1642 - 1647
  • [4] Research and Application of an Improved TD3 Algorithm in Mobile Robot Environment Perception and Autonomous Navigation
    Fu, Bo
    Yao, Xulin
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 158 - 162
  • [5] Path planning of mobile robot based on improved TD3 algorithm in dynamic environment
    Li, Peng
    Chen, Donghui
    Wang, Yuchen
    Zhang, Lanyong
    Zhao, Shiquan
    HELIYON, 2024, 10 (11)
  • [6] Intelligent Air Combat Maneuvering Decision Based on TD3 Algorithm
    Zhou Xiaoyu
    Huang Jiangtao
    Zhu Zhe
    Zhang Sheng
    Zhou Pan
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1082 - 1094
  • [7] D3-TD3: Deep Dense Dueling Architectures in TD3 Algorithm for Robot Path Planning Based on 3D Point Cloud
    Gu, Yuwan
    Zhu, Zhitao
    Chu, Yongtao
    Lv, Jidong
    Wang, Xueyuan
    Xu, Shoukun
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (18)
  • [8] An Algorithm based on Autowaves for Navigation Control of a Mobile Robot
    Medina Hernandez, Jose Antonio
    Gomez Castaneda, Felipe
    Moreno Cadenas, Jose Antonio
    2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATION CONTROL (CCE 2009), 2009, : 512 - +
  • [9] Monocular Vision Based Navigation Algorithm for Mobile Robot
    Liu Hai-Bo
    Dong Yu-Jie
    Wang Fu-Zhong
    Niu Man-Cang
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 3937 - 3941
  • [10] The navigation of mobile robot based on hybrid Dijkstra algorithm
    Guo, Jinchao
    Gao, Yu
    Cui, Guangzhao
    Journal of Computational Information Systems, 2014, 10 (09): : 3879 - 3886