Cooperative Multi-Agent Reinforcement Learning with Hierarchical Relation Graph under Partial Observability

被引：8

作者：

Li, Yang ^{[1
]}

Wang, Xinzhi ^{[1
]}

Wang, Jianshu ^{[1
]}

Wang, Wei ^{[1
]}

Luo, Xiangfeng ^{[1
]}

Xie, Shaorong ^{[1
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China

来源：

2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) | 2020年

基金：

中国国家自然科学基金;

关键词：

Reinforcement Learning; Multi-Agent; Hierarchical Relation Graph;

D O I：

10.1109/ICTAI50040.2020.00011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cooperation among agents with partial observation is an important task in multi-agent reinforcement learning (MARL), aiming to maximize a common reward. Most existing cooperative MARL approaches focus on building different model frameworks, such as centralized, decentralized, and centralized training with decentralized execution. These methods employ partial observation of agents as input directly, but rarely consider the local relationship between agents. The local relationship can help agents integrate observation information among different agents in a local range, and then adopt a more effective cooperation policy. In this paper, we propose a MARL method based on spatial relationship called hierarchical relation graph soft actorcritic (HRG-SAC). The method first uses a hierarchical relation graph generation module to represent the spatial relationship between agents in local space. Second, it integrates feature information of the relation graph through the graph convolution network (GCN). Finally, the soft actor-critic (SAC) is used to optimize agents' actions in training for compliance control. We conduct experiments on the Food Collector task and compare HRG-SAC with three baseline methods. The results demonstrate that the hierarchical relation graph can significantly improve MARL performance in the cooperative task.T

引用

页码：1 / 8

页数：8

共 50 条

[31] Mention Recommendation in Twitter with Cooperative Multi-Agent Reinforcement Learning
Gui, Tao
Liu, Peng
Zhang, Qi
Zhu, Liang
Peng, Minlong
Zhou, Yunhua
Huang, Xuanjing
[J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 535 - 544
[32] Multi-hop temporal knowledge graph reasoning with multi-agent reinforcement learning
Bai, Luyi
Chen, Mingzhuo
Xiao, Qianwen
[J]. APPLIED SOFT COMPUTING, 2024, 160
[33] Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control
Peake, Ashley
McCalmon, Joe
Raiford, Benjamin
Liu, Tongtong
Alqahtani, Sarra
[J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 15 - 22
[34] Multi-Agent Evolutionary Reinforcement Learning Based on Cooperative Games
Yu, Jin
Zhang, Ya
Sun, Changyin
[J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
[35] Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning
Zhang, Yanqiang
Feng, Dawei
Ding, Bo
[J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 358 - 372
[36] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
Shao, Kun
Zhu, Yuanheng
Tang, Zhentao
Zhao, Dongbin
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[37] WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
Liu, Hui
Zhang, Zhen
Wang, Dongqing
[J]. IEEE ACCESS, 2020, 8 : 216320 - 216331
[38] Packet Routing with Graph Attention Multi-Agent Reinforcement Learning
Mai, Xuan
Fu, Quanzhi
Chen, Yi
[J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[39] Cooperative Reinforcement Learning Algorithm to Distributed Power System Based on Multi-Agent
Gao, La-mei
Zeng, Jun
Wu, Jie
Li, Min
[J]. 2009 3RD INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS: ELECTRIC VEHICLE AND GREEN ENERGY, 2009, : 53 - 53
[40] Reinforcement Learning with Value Function Decomposition for Hierarchical Multi-Agent Consensus Control
Zhu, Xiaoxia
[J]. MATHEMATICS, 2024, 12 (19)

← 1 2 3 4 5 →