GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation

被引:0
|
作者
Haoqiang Chen
Yadong Liu
Zongtan Zhou
Dewen Hu
Ming Zhang
机构
[1] College of Intelligence Science and Technology,National University of Defense Technology
来源
Applied Intelligence | 2020年 / 50卷
关键词
Multi-agent; Reinforcement learning; Graph network; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-agent reinforcement learning (MARL) is an important way to realize multi-agent cooperation. But there are still many challenges, including the scalability and the uncertainty of the environment that limit its application. In this paper, we explored to solve those problems through the graph network and the attention mechanism. Finally we succeeded in extending the existing algorithm and obtaining a new algorithm called GAMA. Specifically through the graph network, we made the environment information shared among agents. Meanwhile, the unimportant information was filtered out with the help of the attention mechanism, which helped to improve the communication efficiency. As a result, GAMA obtained the highest mean episode rewards compared to the baselines as well as excellent scalability. The reason why we choose the graph network is that understanding the relationship among agents plays a key role in solving multi-agent problems. And the graph network is very suitable for relational induction bias. Through the integration with the attention mechanism, it was shown that agents could figure out their relationship and focus on the influential environment factors in our experiment.
引用
收藏
页码:4195 / 4205
页数:10
相关论文
共 50 条
  • [1] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
    Chen, Haoqiang
    Liu, Yadong
    Zhou, Zongtan
    Hu, Dewen
    Zhang, Ming
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4195 - 4205
  • [2] A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Pu, Zhiqiang
    Liu, Zhen
    Yi, Jianqiang
    Qiu, Tenghai
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1257 - 1262
  • [3] Packet Routing with Graph Attention Multi-Agent Reinforcement Learning
    Mai, Xuan
    Fu, Quanzhi
    Chen, Yi
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [4] Research on cooperation and reinforcement learning algorithm in multi-agent systems
    Zheng, Shuli
    Han, Jianghong
    Luo, Xiangfeng
    Jiang, Jianwen
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2002, 15 (04): : 453 - 457
  • [5] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] Multi-Agent Hierarchical Graph Attention Actor-Critic Reinforcement Learning
    Li, Tongyue
    Shi, Dianxi
    Jin, Songchang
    Wang, Zhen
    Yang, Huanhuan
    Chen, Yang
    ENTROPY, 2025, 27 (01)
  • [7] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [8] Attention Enhanced Reinforcement Learning for Multi agent Cooperation
    Pu, Zhiqiang
    Wang, Huimu
    Liu, Zhen
    Yi, Jianqiang
    Wu, Shiguang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8235 - 8249
  • [9] Self-Clustering Hierarchical Multi-Agent Reinforcement Learning With Extensible Cooperation Graph
    Fu, Qingxu
    Qiu, Tenghai
    Yi, Jianqiang
    Pu, Zhiqiang
    Ai, Xiaolin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [10] A cooperation model using reinforcement learning for multi-agent
    Lee, M
    Lee, J
    Jeong, HJ
    Lee, Y
    Choi, S
    Gatton, TM
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 5, 2006, 3984 : 675 - 681