ASN: action semantics network for multiagent reinforcement learning

被引：2

作者：

Yang, Tianpei ^{[1
,2
,3
]}

Wang, Weixun ^{[4
]}

Hao, Jianye ^{[1
,5
]}

Taylor, Matthew E. ^{[2
,3
]}

Liu, Yong ^{[6
]}

Hao, Xiaotian ^{[1
]}

Hu, Yujing ^{[4
]}

Chen, Yingfeng ^{[4
]}

Fan, Changjie ^{[4
]}

Ren, Chunxu ^{[4
]}

Huang, Ye ^{[4
]}

Zhu, Jiangcheng ^{[5
]}

Gao, Yang ^{[6
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada

[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada

[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China

[5] Huawei, Shenzhen, Peoples R China

[6] Nanjing Univ, Nanjing, Peoples R China

来源：

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS | 2023年 / 37卷 / 02期

基金：

加拿大自然科学与工程研究理事会; 中国国家自然科学基金;

关键词：

Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;

D O I：

10.1007/s10458-023-09628-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.

引用

页数：37

共 50 条

[1] ASN: action semantics network for multiagent reinforcement learning
Tianpei Yang
Weixun Wang
Jianye Hao
Matthew E. Taylor
Yong Liu
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Chunxu Ren
Ye Huang
Jiangcheng Zhu
Yang Gao
Autonomous Agents and Multi-Agent Systems, 2023, 37
[2] A survey and critique of multiagent deep reinforcement learning
Hernandez-Leal, Pablo
Kartal, Bilal
Taylor, Matthew E.
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2019, 33 (06) : 750 - 797
[3] A survey and critique of multiagent deep reinforcement learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
Autonomous Agents and Multi-Agent Systems, 2019, 33 : 750 - 797
[4] Mean-Field Multiagent Reinforcement Learning: A Decentralized Network Approach
Gu, Haotian
Guo, Xin
Wei, Xiaoli
Xu, Renyuan
MATHEMATICS OF OPERATIONS RESEARCH, 2025, 50 (01) : 506 - 536
[5] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
Zhou, Ziyuan
Liu, Guanjun
Guo, Weiran
Zhou, MengChu
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646
[6] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
da Silva, Felipe Leno
Glatt, Ruben
Reali Costa, Anna Helena
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
[7] Learning Cooperative Behaviours in Multiagent Reinforcement Learning
Phon-Amnuaisuk, Somnuk
NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 570 - 579
[8] Multiagent Adversarial Inverse Reinforcement Learning
Wei, Ermo
Wicke, Drew
Luke, Sean
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2265 - 2266
[9] Cascaded Attention: Adaptive and Gated Graph Attention Network for Multiagent Reinforcement Learning
Qi, Shuhan
Huang, Xinhao
Peng, Peixi
Huang, Xuzhong
Zhang, Jiajia
Wang, Xuan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3769 - 3779
[10] Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems
Hao, Jianye
Leung, Ho-Fung
Ming, Zhong
ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 9 (04)

← 1 2 3 4 5 →