HELSA: Hierarchical Reinforcement Learning with Spatiotemporal Abstraction for Large-Scale Multi-Agent Path Finding

被引:1
|
作者
Song, Zhaoyi [1 ]
Zhang, Rongqing [1 ]
Cheng, Xiang [2 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 200092, Peoples R China
[2] Peking Univ, Sch Elect, Beijing 100871, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/IROS55552.2023.10342261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Multi-Agent Path Finding (MAPF) problem is a critical challenge in dynamic multi-robot systems. Recent studies have revealed that multi-agent reinforcement learning (MARL) is a promising approach to solving MAPF problems in a fully decentralized manner. However, as the size of the multi-robot system increases, sample inefficiency becomes a major impediment to learning-based methods. This paper presents a hierarchical reinforcement learning (HRL) framework for large-scale multi-agent path finding, featuring applying spatial and temporal abstraction to capture intermediate reward and thus encourage efficient exploration. Specifically, we introduce a meta controller that partitions the map into interconnected regions and optimizes agents' region-wise paths towards globally better solutions. Additionally, we design a lower-level controller that efficiently solves each sub-problem by incorporating heuristic guidance and an inter-agent communication mechanism with RL-based policies. Our empirical results on test instances of various scales demonstrate that our method outperforms existing approaches in terms of both success rate and makespan.
引用
收藏
页码:7318 / 7325
页数:8
相关论文
共 50 条
  • [1] Lifelong Multi-Agent Path Finding in Large-Scale Warehouse
    Li, Jiaoyang
    Tinka, Andrew
    Kiesel, Scott
    Durham, Joseph W.
    Kumar, T. K. Satish
    Koenig, Sven
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11272 - 11281
  • [2] Multi-Agent Mix Hierarchical Deep Reinforcement Learning for Large-Scale Fleet Management
    Huang, Xiaohui
    Ling, Jiahao
    Yang, Xiaofei
    Zhang, Xiong
    Yang, Kaiming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 14294 - 14305
  • [3] Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems
    Fu, Qingxu
    Qiu, Tenghai
    Yi, Jianqiang
    Pu, Zhiqiang
    Wu, Shiguang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9341 - 9349
  • [4] CPLNS: Cooperative Parallel Large Neighborhood Search for Large-Scale Multi-Agent Path Finding
    Chen, Kai
    Qu, Qingjun
    Zhu, Feng
    Yi, Zhengming
    Tang, Wenjie
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (11) : 2069 - 2086
  • [5] Tactical reward shaping for large-scale combat by multi-agent reinforcement learning
    DUO Nanxun
    WANG Qinzhao
    LYU Qiang
    WANG Wei
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1516 - 1529
  • [6] Tactical Reward Shaping for Large-Scale Combat by Multi-Agent Reinforcement Learning
    Duo, Nanxun
    Wang, Qinzhao
    Lyu, Qiang
    Wang, Wei
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1516 - 1529
  • [7] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [8] Hierarchical Coordination Multi-Agent Reinforcement Learning With Spatio-Temporal Abstraction
    Ma, Tinghuai
    Peng, Kexing
    Rong, Huan
    Qian, Yurong
    Al-Nabhan, Najla
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 533 - 547
  • [9] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [10] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229