HELSA: Hierarchical Reinforcement Learning with Spatiotemporal Abstraction for Large-Scale Multi-Agent Path Finding

被引：1

作者：

Song, Zhaoyi ^{[1
]}

Zhang, Rongqing ^{[1
]}

Cheng, Xiang ^{[2
]}

机构：

[1] Tongji Univ, Sch Software Engn, Shanghai 200092, Peoples R China

[2] Peking Univ, Sch Elect, Beijing 100871, Peoples R China

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

10.1109/IROS55552.2023.10342261

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Multi-Agent Path Finding (MAPF) problem is a critical challenge in dynamic multi-robot systems. Recent studies have revealed that multi-agent reinforcement learning (MARL) is a promising approach to solving MAPF problems in a fully decentralized manner. However, as the size of the multi-robot system increases, sample inefficiency becomes a major impediment to learning-based methods. This paper presents a hierarchical reinforcement learning (HRL) framework for large-scale multi-agent path finding, featuring applying spatial and temporal abstraction to capture intermediate reward and thus encourage efficient exploration. Specifically, we introduce a meta controller that partitions the map into interconnected regions and optimizes agents' region-wise paths towards globally better solutions. Additionally, we design a lower-level controller that efficiently solves each sub-problem by incorporating heuristic guidance and an inter-agent communication mechanism with RL-based policies. Our empirical results on test instances of various scales demonstrate that our method outperforms existing approaches in terms of both success rate and makespan.

引用

页码：7318 / 7325

页数：8

共 50 条

[41] Learning Selective Communication for Multi-Agent Path Finding
Ma, Ziyuan
Luo, Yudong
Pan, Jia
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1455 - 1462
[42] Addressing deadlock in large-scale, complex rail networks via multi-agent deep reinforcement learning
Bretas, A. M. C.
Mendes, A.
Chalup, S.
Jackson, M.
Clement, R.
Sanhueza, C.
EXPERT SYSTEMS, 2025, 42 (01)
[43] A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control
Xuesi Li
Jingchen Li
Haobin Shi
Applied Intelligence, 2023, 53 : 21433 - 21447
[44] Evolution of a Complex Predator-Prey Ecosystem on Large-scale Multi-Agent Deep Reinforcement Learning
Yamada, Jun
Shawe-Taylor, John
Fountas, Zafeirios
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[45] Distributed Task Offloading for Large-Scale VEC Systems: A Multi-agent Deep Reinforcement Learning Method
Lu, Yanfei
Han, Dengyu
Wang, Xiaoxuan
Gao, Qinghe
2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 161 - 165
[46] A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control
Li, Xuesi
Li, Jingchen
Shi, Haobin
APPLIED INTELLIGENCE, 2023, 53 (18) : 21433 - 21447
[47] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
Zhao, Bocheng
Huo, Mingying
Li, Zheng
Yu, Ze
Qi, Naiming
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
[48] A Large-Scale Multi-Agent Deep Reinforcement Learning Method for Cooperative Output Voltage Control of PEMFCs
Li, Jiawen
Cui, Haoyang
Jiang, Wei
Yu, Hengwen
IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (01): : 78 - 94
[49] Digital Twin Enhanced Multi-Agent Reinforcement Learning for Large-Scale Mobile Network Coverage Optimization
Liu, Haoqiang
Su, Weikang
Li, Tong
Huang, Wenzhen
Li, Yong
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (01)
[50] Hierarchical nearly cyclic pursuit for consensus in large-scale multi-agent systems
Iqbal, Muhammad
Leth, John
Trung Dung Ngo
IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (05): : 740 - 746

← 1 2 3 4 5 →