Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

被引:0
|
作者
Fu, Qingxu [1 ,2 ]
Qiu, Tenghai [1 ]
Yi, Jianqiang [1 ,2 ]
Pu, Zhiqiang [1 ,2 ]
Wu, Shiguang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When dealing with a series of imminent issues, humans can naturally concentrate on a subset of these concerning issues by prioritizing them according to their contributions to motivational indices, e.g., the probability of winning a game. This idea of concentration offers insights into reinforcement learning of sophisticated Large-scale Multi-Agent Systems (LMAS) participated by hundreds of agents. In such an LMAS, each agent receives a long series of entity observations at each step, which can overwhelm existing aggregation networks such as graph attention networks and cause inefficiency. In this paper, we propose a concentration network called ConcNet. First, ConcNet scores the observed entities considering several motivational indices, e.g., expected survival time and state value of the agents, and then ranks, prunes, and aggregates the encodings of observed entities to extract features. Second, distinct from the well-known attention mechanism, ConcNet has a unique motivational subnetwork to explicitly consider the motivational indices when scoring the observed entities. Furthermore, we present a concentration policy gradient architecture that can learn effective policies in LMAS from scratch. Extensive experiments demonstrate that the presented architecture has excellent scalability and flexibility, and significantly outperforms existing methods on LMAS benchmarks.
引用
收藏
页码:9341 / 9349
页数:9
相关论文
共 50 条
  • [1] Tactical reward shaping for large-scale combat by multi-agent reinforcement learning
    DUO Nanxun
    WANG Qinzhao
    LYU Qiang
    WANG Wei
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1516 - 1529
  • [2] Tactical Reward Shaping for Large-Scale Combat by Multi-Agent Reinforcement Learning
    Duo, Nanxun
    Wang, Qinzhao
    Lyu, Qiang
    Wang, Wei
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1516 - 1529
  • [3] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [4] Digital Twin Enhanced Multi-Agent Reinforcement Learning for Large-Scale Mobile Network Coverage Optimization
    Liu, Haoqiang
    Su, Weikang
    Li, Tong
    Huang, Wenzhen
    Li, Yong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (01)
  • [5] Distributed Task Offloading for Large-Scale VEC Systems: A Multi-agent Deep Reinforcement Learning Method
    Lu, Yanfei
    Han, Dengyu
    Wang, Xiaoxuan
    Gao, Qinghe
    2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 161 - 165
  • [6] Large-Scale Machine Learning Cluster Scheduling via Multi-Agent Graph Reinforcement Learning
    Zhao, Xiaoyang
    Wu, Chuan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4962 - 4974
  • [7] Multi-agent Reinforcement Learning in a Large Scale Environment via Supervisory Network and Curriculum Learning
    Do, Seungwon
    Lee, Changeun
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 207 - 210
  • [8] Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach
    Chen, Yue
    Li, Changle
    Yue, Wenwei
    Zhang, Hehe
    Mao, Guoqiang
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [9] Pheromone-inspired Communication Framework for Large-scale Multi-agent Reinforcement Learning
    Cao, Zixuan
    Ma, Xiujun
    Shi, Mengzhi
    Zhao, Zhanbo
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 75 - 86
  • [10] GPLight: Grouped Multi-agent Reinforcement Learning for Large-scale Traffic Signal Control
    Liu, Yilin
    Luo, Guiyang
    Yuan, Quan
    Li, Jinglin
    Jin, Lei
    Chen, Bo
    Pan, Rui
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 199 - 207