Cooperative Multi-Agent Reinforcement Learning with Hierarchical Relation Graph under Partial Observability

被引:8
作者
Li, Yang [1 ]
Wang, Xinzhi [1 ]
Wang, Jianshu [1 ]
Wang, Wei [1 ]
Luo, Xiangfeng [1 ]
Xie, Shaorong [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
来源
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) | 2020年
基金
中国国家自然科学基金;
关键词
Reinforcement Learning; Multi-Agent; Hierarchical Relation Graph;
D O I
10.1109/ICTAI50040.2020.00011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cooperation among agents with partial observation is an important task in multi-agent reinforcement learning (MARL), aiming to maximize a common reward. Most existing cooperative MARL approaches focus on building different model frameworks, such as centralized, decentralized, and centralized training with decentralized execution. These methods employ partial observation of agents as input directly, but rarely consider the local relationship between agents. The local relationship can help agents integrate observation information among different agents in a local range, and then adopt a more effective cooperation policy. In this paper, we propose a MARL method based on spatial relationship called hierarchical relation graph soft actorcritic (HRG-SAC). The method first uses a hierarchical relation graph generation module to represent the spatial relationship between agents in local space. Second, it integrates feature information of the relation graph through the graph convolution network (GCN). Finally, the soft actor-critic (SAC) is used to optimize agents' actions in training for compliance control. We conduct experiments on the Food Collector task and compare HRG-SAC with three baseline methods. The results demonstrate that the hierarchical relation graph can significantly improve MARL performance in the cooperative task.T
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [31] Mention Recommendation in Twitter with Cooperative Multi-Agent Reinforcement Learning
    Gui, Tao
    Liu, Peng
    Zhang, Qi
    Zhu, Liang
    Peng, Minlong
    Zhou, Yunhua
    Huang, Xuanjing
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 535 - 544
  • [32] Multi-hop temporal knowledge graph reasoning with multi-agent reinforcement learning
    Bai, Luyi
    Chen, Mingzhuo
    Xiao, Qianwen
    [J]. APPLIED SOFT COMPUTING, 2024, 160
  • [33] Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control
    Peake, Ashley
    McCalmon, Joe
    Raiford, Benjamin
    Liu, Tongtong
    Alqahtani, Sarra
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 15 - 22
  • [34] Multi-Agent Evolutionary Reinforcement Learning Based on Cooperative Games
    Yu, Jin
    Zhang, Ya
    Sun, Changyin
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [35] Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning
    Zhang, Yanqiang
    Feng, Dawei
    Ding, Bo
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 358 - 372
  • [36] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
    Shao, Kun
    Zhu, Yuanheng
    Tang, Zhentao
    Zhao, Dongbin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [37] WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
    Liu, Hui
    Zhang, Zhen
    Wang, Dongqing
    [J]. IEEE ACCESS, 2020, 8 : 216320 - 216331
  • [38] Packet Routing with Graph Attention Multi-Agent Reinforcement Learning
    Mai, Xuan
    Fu, Quanzhi
    Chen, Yi
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [39] Cooperative Reinforcement Learning Algorithm to Distributed Power System Based on Multi-Agent
    Gao, La-mei
    Zeng, Jun
    Wu, Jie
    Li, Min
    [J]. 2009 3RD INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS: ELECTRIC VEHICLE AND GREEN ENERGY, 2009, : 53 - 53
  • [40] Reinforcement Learning with Value Function Decomposition for Hierarchical Multi-Agent Consensus Control
    Zhu, Xiaoxia
    [J]. MATHEMATICS, 2024, 12 (19)