Modular hierarchical reinforcement learning for multi-destination navigation in hybrid crowds

被引：5

作者：

Ou, Wen ^{[1
]}

Luo, Biao ^{[1
]}

Wang, Bingchuan ^{[1
]}

Zhao, Yuqian ^{[1
]}

机构：

[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 171卷

基金：

中国国家自然科学基金;

关键词：

Crowd navigation; Multi-destination; Deep reinforcement learning; TIME OBSTACLE AVOIDANCE; ATTENTION;

D O I：

10.1016/j.neunet.2023.12.032

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Real-world robot applications usually require navigating agents to face multiple destinations. Besides, the real-world crowded environments usually contain dynamic and static crowds that implicitly interact with each other during navigation. To address this challenging task, a novel modular hierarchical reinforcement learning (MHRL) method is developed in this paper. MHRL is composed of three modules, i.e., destination evaluation, policy switch, and motion network, which are designed exactly according to the three phases of solving the original navigation problem. First, the destination evaluation module rates all destinations and selects the one with the lowest cost. Subsequently, the policy switch module decides which motion network to be used according to the selected destination and the obstacle state. Finally, the selected motion network outputs the robot action. Owing to the complementary strengths of a variety of motion networks and the cooperation of modules in each layer, MHRL is able to deal with hybrid crowds effectively. Extensive simulation experiments demonstrate that MHRL achieves better performance than state-of-the-art methods.

引用

页码：474 / 484

页数：11

共 50 条

[1]

Alonso-Mora J, 2013, SPRINGER TRAC ADV RO, V83, P203

[2] Probabilistically safe motion planning to avoid dynamic obstacles with uncertain motion patterns [J].

Aoude, Georges S. ;

Luders, Brandon D. ;

Joseph, Joshua M. ;

Roy, Nicholas ;

How, Jonathan P. .

AUTONOMOUS ROBOTS, 2013, 35 (01) :51-76

[3]

Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726

[4] REAL-TIME OBSTACLE AVOIDANCE FOR FAST MOBILE ROBOTS [J].

BORENSTEIN, J ;

KOREN, Y .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05) :1179-1187

[5]

Cao C, 2019, IEEE INT CONF ROBOT, P5551, DOI [10.1109/ICRA.2019.8794192, 10.1109/icra.2019.8794192]

[6] Visual attention: The past 25 years [J].

Carrasco, Marisa .

VISION RESEARCH, 2011, 51 (13) :1484-1525

[7] Relational Graph Learning for Crowd Navigation [J].

Chen, Changan ;

Hu, Sha ;

Nikdel, Payam ;

Mori, Greg ;

Savva, Manolis .

2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, :10007-10013

[8]

Chen CG, 2019, IEEE INT CONF ROBOT, P6015, DOI [10.1109/icra.2019.8794134, 10.1109/ICRA.2019.8794134]

[9]

Chen YF, 2017, IEEE INT C INT ROBOT, P1343, DOI 10.1109/IROS.2017.8202312

[10] Robot Navigation in Crowds by Graph Convolutional Networks With Attention Learned From Human Gaze [J].

Chen, Yuying ;

Liu, Congcong ;

Shi, Bertram E. ;

Liu, Ming .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :2754-2761

← 1 2 3 4 5 →