Safe Robot Navigation Using Constrained Hierarchical Reinforcement Learning

被引:0
作者
Roza, Felippe Schmoeller [1 ]
Rasheed, Hassan [1 ]
Roscher, Karsten [1 ]
Ning, Xiangyu [1 ]
Guennemann, Stephan [2 ]
机构
[1] Fraunhofer IKS, Munich, Germany
[2] Tech Univ Munich, Dept Comp Sci, Munich, Germany
来源
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA | 2022年
关键词
Hierarchical Reinforcement Learning; Safety; Robot Navigation; Constrained Reinforcement Learning;
D O I
10.1109/ICMLA55696.2022.00123
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Safe navigation is one of the steps necessary for achieving autonomous control of robots. Among different algorithms that focus on robot navigation, Reinforcement Learning (and more specifically Deep Reinforcement Learning) has shown impressive results for controlling robots with complex and highdimensional state representations. However, when integrating methods to comply with safety requirements by means of constraint satisfaction in flat Reinforcement Learning policies, the system performance can be affected. In this paper, we propose a constrained Hierarchical Reinforcement Learning framework with a safety layer used to modify the low-level policy to achieve a safer operation of the robot. Results obtained in simulation show that the proposed method is better at retaining performance while keeping the system in a safe region when compared to a constrained flat model.
引用
收藏
页码:737 / 742
页数:6
相关论文
共 27 条
  • [1] Achiam J, 2017, PR MACH LEARN RES, V70
  • [2] Alshiekh M, 2018, AAAI CONF ARTIF INTE, P2669
  • [3] Altman E., 1995, PhD dissertation
  • [4] Berner C., 2019, ARXIV
  • [5] Dalal G, 2018, Arxiv, DOI arXiv:1801.08757
  • [6] Fujimoto S, 2018, PR MACH LEARN RES, V80
  • [7] Hierarchical Program-Triggered Reinforcement Learning Agents for Automated Driving
    Gangopadhyay, Briti
    Soora, Harshit
    Dasgupta, Pallab
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10902 - 10911
  • [8] Gronauer S., 2022, Tech. Rep.
  • [9] Kamran D, 2020, IEEE INT VEH SYM, P1205, DOI 10.1109/IV47402.2020.9304606
  • [10] Levy A., 2017, ARXIV PREPRINT ARXIV, V12