Mobile robot navigation based on intrinsic reward mechanism with TD3 algorithm

被引:0
|
作者
Yang, Jianan [1 ]
Liu, Yu [1 ]
Zhang, Jie [2 ]
Guan, Yong [1 ]
Shao, Zhenzhou [1 ]
机构
[1] Capital Normal Univ, Coll Informat Engn, 105 West Third Ring Rd North, Beijing 100048, Peoples R China
[2] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Mobile robots; deep reinforcement learning; intrinsic reward; curiosity; random enhancement;
D O I
10.1177/17298806241292893
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep reinforcement learning methods have been applied to mobile robot navigation to find the optimal path to the target. The rewards are usually given when the task is completed, which may lead to the local optima during the training procedure. It seriously affects the training efficiency and navigation performance of the mobile robot. To this end, this paper proposes an intrinsic reward mechanism with intrinsic curiosity module and randomness enhanced module, combining the TD3 (twin-delayed deep deterministic policy gradient) reinforcement learning algorithm for mobile robot navigation. It effectively resolves the issue of slow convergence caused by sparse rewards in continuous action spaces. It also encourages mobile robots to explore unknown areas and reduces the occurrence of local optima. The experimental results show that the proposed navigation method significantly improves the training efficiency of mobile robots. Out of 1000 test episodes, only 3 exceeded the maximum step limit. This approach significantly reduces the occurrence of local optima. Furthermore, it increases the success rate to an impressive 83.5%, outperforms the existing navigation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] SLAM algorithm and Navigation for Indoor Mobile Robot Based on ROS
    Zhou, Ling
    Zhu, Chen
    Su, Xinyuan
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 230 - 236
  • [22] Autonomous Navigation Experiment for Mobile Robot Based on IHDR Algorithm
    Li, Weiling
    Wu, Huaiyu
    Chen, Yang
    Cheng, Lei
    2015 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2015, : 572 - 576
  • [23] Automatic control of mobile robot based on autonomous navigation algorithm
    Liping Wang
    Artificial Life and Robotics, 2019, 24 : 494 - 498
  • [24] Automatic control of mobile robot based on autonomous navigation algorithm
    Wang, Liping
    ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (04) : 494 - 498
  • [25] TD3 Algorithm of Dynamic Classification Replay Buffer Based PID Parameter Optimization
    Zhong, Haojun
    Wang, Zhenlei
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (10) : 3068 - 3082
  • [26] Research on SLAM Algorithm and Navigation of Mobile Robot Based on ROS
    Liu, Bin
    Guan, Zhiwei
    Li, Bin
    Wen, Guoqiang
    Zhao, Yu
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2021), 2021, : 119 - 124
  • [27] The Optimal Strategies of Maneuver Decision in Air Combat of UCAV Based on the Improved TD3 Algorithm
    Gao, Xianzhong
    Zhang, Yue
    Wang, Baolai
    Leng, Zhihui
    Hou, Zhongxi
    DRONES, 2024, 8 (09)
  • [28] Self-organization of place cells and reward-based navigation for a mobile robot
    Takahashi, T
    Tanaka, T
    Nishida, K
    Kurita, T
    8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, : 1164 - 1169
  • [29] Energy management strategy based on an improved TD3 reinforcement algorithm with novel experience replay
    Niu, Zegong
    Huang, Ruchen
    He, Hongwen
    Zhou, Zhiqiang
    Su, Qicong
    2023 IEEE VEHICLE POWER AND PROPULSION CONFERENCE, VPPC, 2023,
  • [30] A Bionic Robot Navigation Algorithm Based on Cognitive Mechanism of Hippocampus
    Yu, Naigong
    Zhai, Yujia
    Yuan, Yunhe
    Wang, Zongxia
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (04) : 1640 - 1652