HELSA: Hierarchical Reinforcement Learning with Spatiotemporal Abstraction for Large-Scale Multi-Agent Path Finding

被引:1
|
作者
Song, Zhaoyi [1 ]
Zhang, Rongqing [1 ]
Cheng, Xiang [2 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 200092, Peoples R China
[2] Peking Univ, Sch Elect, Beijing 100871, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/IROS55552.2023.10342261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Multi-Agent Path Finding (MAPF) problem is a critical challenge in dynamic multi-robot systems. Recent studies have revealed that multi-agent reinforcement learning (MARL) is a promising approach to solving MAPF problems in a fully decentralized manner. However, as the size of the multi-robot system increases, sample inefficiency becomes a major impediment to learning-based methods. This paper presents a hierarchical reinforcement learning (HRL) framework for large-scale multi-agent path finding, featuring applying spatial and temporal abstraction to capture intermediate reward and thus encourage efficient exploration. Specifically, we introduce a meta controller that partitions the map into interconnected regions and optimizes agents' region-wise paths towards globally better solutions. Additionally, we design a lower-level controller that efficiently solves each sub-problem by incorporating heuristic guidance and an inter-agent communication mechanism with RL-based policies. Our empirical results on test instances of various scales demonstrate that our method outperforms existing approaches in terms of both success rate and makespan.
引用
收藏
页码:7318 / 7325
页数:8
相关论文
共 50 条
  • [41] Learning Selective Communication for Multi-Agent Path Finding
    Ma, Ziyuan
    Luo, Yudong
    Pan, Jia
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1455 - 1462
  • [42] Addressing deadlock in large-scale, complex rail networks via multi-agent deep reinforcement learning
    Bretas, A. M. C.
    Mendes, A.
    Chalup, S.
    Jackson, M.
    Clement, R.
    Sanhueza, C.
    EXPERT SYSTEMS, 2025, 42 (01)
  • [43] A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control
    Xuesi Li
    Jingchen Li
    Haobin Shi
    Applied Intelligence, 2023, 53 : 21433 - 21447
  • [44] Evolution of a Complex Predator-Prey Ecosystem on Large-scale Multi-Agent Deep Reinforcement Learning
    Yamada, Jun
    Shawe-Taylor, John
    Fountas, Zafeirios
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [45] Distributed Task Offloading for Large-Scale VEC Systems: A Multi-agent Deep Reinforcement Learning Method
    Lu, Yanfei
    Han, Dengyu
    Wang, Xiaoxuan
    Gao, Qinghe
    2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 161 - 165
  • [46] A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control
    Li, Xuesi
    Li, Jingchen
    Shi, Haobin
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21433 - 21447
  • [47] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
    Zhao, Bocheng
    Huo, Mingying
    Li, Zheng
    Yu, Ze
    Qi, Naiming
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
  • [48] A Large-Scale Multi-Agent Deep Reinforcement Learning Method for Cooperative Output Voltage Control of PEMFCs
    Li, Jiawen
    Cui, Haoyang
    Jiang, Wei
    Yu, Hengwen
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (01): : 78 - 94
  • [49] Digital Twin Enhanced Multi-Agent Reinforcement Learning for Large-Scale Mobile Network Coverage Optimization
    Liu, Haoqiang
    Su, Weikang
    Li, Tong
    Huang, Wenzhen
    Li, Yong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (01)
  • [50] Hierarchical nearly cyclic pursuit for consensus in large-scale multi-agent systems
    Iqbal, Muhammad
    Leth, John
    Trung Dung Ngo
    IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (05): : 740 - 746