Safe Robot Navigation Using Constrained Hierarchical Reinforcement Learning

被引：0

作者：

Roza, Felippe Schmoeller ^{[1
]}

Rasheed, Hassan ^{[1
]}

Roscher, Karsten ^{[1
]}

Ning, Xiangyu ^{[1
]}

Guennemann, Stephan ^{[2
]}

机构：

[1] Fraunhofer IKS, Munich, Germany

[2] Tech Univ Munich, Dept Comp Sci, Munich, Germany

来源：

2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA | 2022年

关键词：

Hierarchical Reinforcement Learning; Safety; Robot Navigation; Constrained Reinforcement Learning;

D O I：

10.1109/ICMLA55696.2022.00123

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Safe navigation is one of the steps necessary for achieving autonomous control of robots. Among different algorithms that focus on robot navigation, Reinforcement Learning (and more specifically Deep Reinforcement Learning) has shown impressive results for controlling robots with complex and highdimensional state representations. However, when integrating methods to comply with safety requirements by means of constraint satisfaction in flat Reinforcement Learning policies, the system performance can be affected. In this paper, we propose a constrained Hierarchical Reinforcement Learning framework with a safety layer used to modify the low-level policy to achieve a safer operation of the robot. Results obtained in simulation show that the proposed method is better at retaining performance while keeping the system in a safe region when compared to a constrained flat model.

引用

页码：737 / 742

页数：6

共 27 条

[1] Achiam J, 2017, PR MACH LEARN RES, V70
[2] Alshiekh M, 2018, AAAI CONF ARTIF INTE, P2669
[3] Altman E., 1995, PhD dissertation
[4] Berner C., 2019, ARXIV
[5] Dalal G, 2018, Arxiv, DOI arXiv:1801.08757
[6] Fujimoto S, 2018, PR MACH LEARN RES, V80
[7] Hierarchical Program-Triggered Reinforcement Learning Agents for Automated Driving
Gangopadhyay, Briti
Soora, Harshit
Dasgupta, Pallab
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10902 - 10911
[8] Gronauer S., 2022, Tech. Rep.
[9] Kamran D, 2020, IEEE INT VEH SYM, P1205, DOI 10.1109/IV47402.2020.9304606
[10] Levy A., 2017, ARXIV PREPRINT ARXIV, V12

← 1 2 3 →