Mapless navigation via Hierarchical Reinforcement Learning with memory-decaying novelty

被引:0
|
作者
Gao, Yan [1 ]
Lin, Feiqiang [1 ]
Cai, Boliang [1 ]
Wu, Jing [2 ]
Wei, Changyun [3 ]
Grech, Raphael [4 ]
Ji, Ze [1 ]
机构
[1] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, Wales
[2] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales
[3] Hohai Univ, Coll Mech & Elect Engn, Changzhou, Peoples R China
[4] Spirent Commun, Paignton TQ4 7QR, England
关键词
Mapless navigation; Deep reinforcement learning; Collision avoidance; Hierarchical Reinforcement Learning; Path planning; DEEP NEURAL-NETWORKS;
D O I
10.1016/j.robot.2024.104815
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hierarchical Reinforcement Learning (HRL) has shown superior performance for mapless navigation tasks. However, it remains limited in unstructured environments that might contain terrains like long corridors and dead corners, which can lead to local minima. This is because most HRL-based mapless navigation methods employ a simplified reward setting and exploration strategy. In this work, we propose a novel reward function for training the high-level (HL) policy, which contains two components: extrinsic reward and intrinsic reward. The extrinsic reward encourages the robot to move towards the target location, while the intrinsic reward is computed based on novelty, episode memory and memory decaying, making the agent capable of accomplishing spontaneous exploration. We also design a novel neural network structure that incorporates an LSTM network to augment the agent with memory and reasoning capabilities. We test our method in unknown environments and specific scenarios prone to the local minimum problem to evaluate the navigation performance and local minimum resolution ability. The results show that our method significantly increases the success rate when compared to advanced RL-based methods, achieving a maximum improvement of nearly 28%. Our method demonstrates effective improvement in addressing the local minimum issue, especially in cases where the baselines fail completely. Additionally, numerous ablation studies consistently confirm the effectiveness of our proposed reward function and neural network structure.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Reinforcement Learning with Auxiliary Localization Task for Mapless Navigation
    He, Cong
    Zhang, Wengang
    Wang, Teng
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3069 - 3073
  • [12] FGRL: Federated Growing Reinforcement Learning for Resilient Mapless Navigation in Unfamiliar Environments
    Tian, Shunyu
    Wei, Changyun
    Li, Yajun
    Ji, Ze
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [13] MSN: Mapless Short-Range Navigation Based on Time Critical Deep Reinforcement Learning
    Li, Bohan
    Huang, Zhelong
    Chen, Tony Weitong
    Dai, Tianlun
    Zang, Yalei
    Xie, Wenbin
    Tian, Bo
    Cai, Ken
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8628 - 8637
  • [14] Environment Exploration for Mapless Navigation based on Deep Reinforcement Learning
    Toan, Nguyen Duc
    Gon-Woo, Kim
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 17 - 20
  • [15] Sim-to-Real: Mapless Navigation for USVs Using Deep Reinforcement Learning
    Wang, Ning
    Wang, Yabiao
    Zhao, Yuming
    Wang, Yong
    Li, Zhigang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)
  • [16] Deep Reinforcement Learning-Based Mapless Navigation for Mobile Robot in Unknown Environment With Local Optima
    Hu, Yiming
    Wang, Shuting
    Xie, Yuanlong
    Zheng, Shiqi
    Shi, Peng
    Rudas, Imre
    Cheng, Xiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 628 - 635
  • [17] Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
    Xue, Honghu
    Hein, Benedikt
    Bakr, Mohamed
    Schildbach, Georg
    Abel, Bengt
    Rueckert, Elmar
    APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [18] An Improvement on Mapless Navigation with Deep Reinforcement Learning: A Reward Shaping Approach
    Alipanah, Arezoo
    Moosavian, S. Ali A.
    2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, : 261 - 266
  • [19] IPAPRec: A Promising Tool for Learning High-Performance Mapless Navigation Skills With Deep Reinforcement Learning
    Zhang, Wei
    Zhang, Yunfeng
    Liu, Ning
    Ren, Kai
    Wang, Pengfei
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (06) : 5451 - 5461
  • [20] An Efficient Deep Reinforcement Learning Algorithm for Mapless Navigation with Gap-Guided Switching Strategy
    Heng Li
    Jiahu Qin
    Qingchen Liu
    Chengzhen Yan
    Journal of Intelligent & Robotic Systems, 2023, 108