Mapless navigation via Hierarchical Reinforcement Learning with memory-decaying novelty

被引：0

作者：

Gao, Yan ^{[1
]}

Lin, Feiqiang ^{[1
]}

Cai, Boliang ^{[1
]}

Wu, Jing ^{[2
]}

Wei, Changyun ^{[3
]}

Grech, Raphael ^{[4
]}

Ji, Ze ^{[1
]}

机构：

[1] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, Wales

[2] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales

[3] Hohai Univ, Coll Mech & Elect Engn, Changzhou, Peoples R China

[4] Spirent Commun, Paignton TQ4 7QR, England

来源：

ROBOTICS AND AUTONOMOUS SYSTEMS | 2024年 / 182卷

关键词：

Mapless navigation; Deep reinforcement learning; Collision avoidance; Hierarchical Reinforcement Learning; Path planning; DEEP NEURAL-NETWORKS;

D O I：

10.1016/j.robot.2024.104815

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hierarchical Reinforcement Learning (HRL) has shown superior performance for mapless navigation tasks. However, it remains limited in unstructured environments that might contain terrains like long corridors and dead corners, which can lead to local minima. This is because most HRL-based mapless navigation methods employ a simplified reward setting and exploration strategy. In this work, we propose a novel reward function for training the high-level (HL) policy, which contains two components: extrinsic reward and intrinsic reward. The extrinsic reward encourages the robot to move towards the target location, while the intrinsic reward is computed based on novelty, episode memory and memory decaying, making the agent capable of accomplishing spontaneous exploration. We also design a novel neural network structure that incorporates an LSTM network to augment the agent with memory and reasoning capabilities. We test our method in unknown environments and specific scenarios prone to the local minimum problem to evaluate the navigation performance and local minimum resolution ability. The results show that our method significantly increases the success rate when compared to advanced RL-based methods, achieving a maximum improvement of nearly 28%. Our method demonstrates effective improvement in addressing the local minimum issue, especially in cases where the baselines fail completely. Additionally, numerous ablation studies consistently confirm the effectiveness of our proposed reward function and neural network structure.

引用

页数：14

共 50 条

[11] Reinforcement Learning with Auxiliary Localization Task for Mapless Navigation
He, Cong
Zhang, Wengang
Wang, Teng
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3069 - 3073
[12] FGRL: Federated Growing Reinforcement Learning for Resilient Mapless Navigation in Unfamiliar Environments
Tian, Shunyu
Wei, Changyun
Li, Yajun
Ji, Ze
APPLIED SCIENCES-BASEL, 2024, 14 (23):
[13] MSN: Mapless Short-Range Navigation Based on Time Critical Deep Reinforcement Learning
Li, Bohan
Huang, Zhelong
Chen, Tony Weitong
Dai, Tianlun
Zang, Yalei
Xie, Wenbin
Tian, Bo
Cai, Ken
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8628 - 8637
[14] Environment Exploration for Mapless Navigation based on Deep Reinforcement Learning
Toan, Nguyen Duc
Gon-Woo, Kim
2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 17 - 20
[15] Sim-to-Real: Mapless Navigation for USVs Using Deep Reinforcement Learning
Wang, Ning
Wang, Yabiao
Zhao, Yuming
Wang, Yong
Li, Zhigang
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)
[16] Deep Reinforcement Learning-Based Mapless Navigation for Mobile Robot in Unknown Environment With Local Optima
Hu, Yiming
Wang, Shuting
Xie, Yuanlong
Zheng, Shiqi
Shi, Peng
Rudas, Imre
Cheng, Xiang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 628 - 635
[17] Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Xue, Honghu
Hein, Benedikt
Bakr, Mohamed
Schildbach, Georg
Abel, Bengt
Rueckert, Elmar
APPLIED SCIENCES-BASEL, 2022, 12 (06):
[18] An Improvement on Mapless Navigation with Deep Reinforcement Learning: A Reward Shaping Approach
Alipanah, Arezoo
Moosavian, S. Ali A.
2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, : 261 - 266
[19] IPAPRec: A Promising Tool for Learning High-Performance Mapless Navigation Skills With Deep Reinforcement Learning
Zhang, Wei
Zhang, Yunfeng
Liu, Ning
Ren, Kai
Wang, Pengfei
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (06) : 5451 - 5461
[20] An Efficient Deep Reinforcement Learning Algorithm for Mapless Navigation with Gap-Guided Switching Strategy
Heng Li
Jiahu Qin
Qingchen Liu
Chengzhen Yan
Journal of Intelligent & Robotic Systems, 2023, 108

← 1 2 3 4 5 →