Cooperative multi-agent game based on reinforcement learning

被引：1

作者：

Liu, Hongbo ^{[1
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China

来源：

HIGH-CONFIDENCE COMPUTING | 2024年 / 4卷 / 01期

关键词：

Collaborative multi-agent; Reinforcement learning; Credit distribution; Multi-agent communication; Reward shaping;

D O I：

10.1016/j.hcc.2024.100205

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-agent reinforcement learning holds tremendous potential for revolutionizing intelligent systems across diverse domains. However, it is also concomitant with a set of formidable challenges, which include the effective allocation of credit values to each agent, real-time collaboration among heterogeneous agents, and an appropriate reward function to guide agent behavior. To handle these issues, we propose an innovative solution named the Graph Attention Counterfactual Multiagent Actor-Critic algorithm (GACMAC). This algorithm encompasses several key components: First, it employs a multiagent actor-critic framework along with counterfactual baselines to assess the individual actions of each agent. Second, it integrates a graph attention network to enhance real-time collaboration among agents, enabling heterogeneous agents to effectively share information during handling tasks. Third, it incorporates prior human knowledge through a potential-based reward shaping method, thereby elevating the convergence speed and stability of the algorithm. We tested our algorithm on the StarCraft Multi-Agent Challenge (SMAC) platform, which is a recognized platform for testing multiagent algorithms, and our algorithm achieved a win rate of over 95% on the platform, comparable to the current state-of-the-art multi-agent controllers. (c) 2024 The Author(s). Published by Elsevier B.V. on behalf of Shandong University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页数：11

共 50 条

[1] Multi-agent Cooperative Search based on Reinforcement Learning
Sun, Yinjiang
Zhang, Rui
Liang, Wenbao
Xu, Cheng
PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 891 - 896
[2] Multi-agent cooperative learning research based on reinforcement learning
Liu, Fei
Zeng, Guangzhou
2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 1408 - 1413
[3] A Multi-agent Reinforcement Learning Algorithm Based on Stackelberg Game
Cheng, Chi
Zhu, Zhangqing
Xin, Bo
Chen, Chunlin
2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 727 - 732
[4] Cooperative targets assignment based on multi-agent reinforcement learning
Ma Y.
Wu L.
Xu X.
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (09): : 2793 - 2801
[5] A Cooperative Multi-Agent Reinforcement Learning Method Based on Coordination Degree
Cui, Haoyan
Zhang, Zhen
IEEE ACCESS, 2021, 9 : 123805 - 123814
[6] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
Wang, Xin
Zhao, Chen
Huang, Tingwen
Chakrabarti, Prasun
Kurths, Juergen
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
[7] A review of cooperative multi-agent deep reinforcement learning
Oroojlooy, Afshin
Hajinezhad, Davood
APPLIED INTELLIGENCE, 2023, 53 (11) : 13677 - 13722
[8] Cooperative reinforcement learning in topology-based multi-agent systems
Xiao, Dan
Tan, Ah-Hwee
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2013, 26 (01) : 86 - 119
[9] A review of cooperative multi-agent deep reinforcement learning
Afshin Oroojlooy
Davood Hajinezhad
Applied Intelligence, 2023, 53 : 13677 - 13722
[10] The Cooperative Reinforcement Learning in a Multi-Agent Design System
Liu, Hong
Wang, Jihua
PROCEEDINGS OF THE 2013 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2013, : 139 - 144

← 1 2 3 4 5 →