Mobile robot navigation based on intrinsic reward mechanism with TD3 algorithm

被引:0
|
作者
Yang, Jianan [1 ]
Liu, Yu [1 ]
Zhang, Jie [2 ]
Guan, Yong [1 ]
Shao, Zhenzhou [1 ]
机构
[1] Capital Normal Univ, Coll Informat Engn, 105 West Third Ring Rd North, Beijing 100048, Peoples R China
[2] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Mobile robots; deep reinforcement learning; intrinsic reward; curiosity; random enhancement;
D O I
10.1177/17298806241292893
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep reinforcement learning methods have been applied to mobile robot navigation to find the optimal path to the target. The rewards are usually given when the task is completed, which may lead to the local optima during the training procedure. It seriously affects the training efficiency and navigation performance of the mobile robot. To this end, this paper proposes an intrinsic reward mechanism with intrinsic curiosity module and randomness enhanced module, combining the TD3 (twin-delayed deep deterministic policy gradient) reinforcement learning algorithm for mobile robot navigation. It effectively resolves the issue of slow convergence caused by sparse rewards in continuous action spaces. It also encourages mobile robots to explore unknown areas and reduces the occurrence of local optima. The experimental results show that the proposed navigation method significantly improves the training efficiency of mobile robots. Out of 1000 test episodes, only 3 exceeded the maximum step limit. This approach significantly reduces the occurrence of local optima. Furthermore, it increases the success rate to an impressive 83.5%, outperforms the existing navigation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Vector Control of PMSM Using TD3 Reinforcement Learning Algorithm
    Yin, Fengyuan
    Yuan, Xiaoming
    Ma, Zhiao
    Xu, Xinyu
    ALGORITHMS, 2023, 16 (09)
  • [32] Marker Detection Algorithm for the Navigation of a Mobile Robot
    Annusewicz, Anna
    Zwierzchowski, Jaroslaw
    PROCEEDINGS OF 2020 27TH INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM (MIXDES), 2020, : 223 - 226
  • [33] Cuckoo Search Algorithm for the Mobile Robot Navigation
    Mohanty, Prases Kumar
    Parhi, Dayal R.
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I (SEMCCO 2013), 2013, 8297 : 527 - 536
  • [34] A Sensor Based Navigation Algorithm for a Mobile Robot using the DVFF Approach
    Djekoune, A. Oualid
    Achour, Karim
    Toumi, Redouane
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2009, 6 (02): : 97 - 108
  • [35] Harmonic Potential Function Based Algorithm for Autonomous Mobile Robot Navigation
    Panati, Subbash
    Baasandorj, Bayanjargal
    Chong, Kil To
    ADVANCED SCIENCE LETTERS, 2015, 21 (12) : 3662 - 3666
  • [36] A sensor based navigation algorithm for a mobile robot using the DVFF approach
    Djekoune, A. Oualid
    Achour, Karim
    Toum, Redouane
    International Journal of Advanced Robotic Systems, 2009, 6 (02) : 97 - 108
  • [37] Model-Free Attitude Control of Spacecraft Based on PID-Guide TD3 Algorithm
    Zhang, ZhiBin
    Li, XinHong
    An, JiPing
    Man, WanXin
    Zhang, GuoHui
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2020, 2020
  • [38] One-to-one Air-combat Maneuver Strategy Based on Improved TD3 Algorithm
    Qiu, Xuyi
    Yao, Ziyu
    Tan, Fuwei
    Zhu, Zhen
    Lu, Jun-Guo
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5719 - 5725
  • [39] Whisker based mobile robot navigation
    Jung, D
    Zelinsky, A
    IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3, 1996, : 497 - 504
  • [40] Mobile Robot Navigation Based on Lidar
    Cheng, Yi
    Wang, Gong Ye
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 1243 - 1246