ASN: action semantics network for multiagent reinforcement learning

被引：2

作者：

Yang, Tianpei ^{[1
,2
,3
]}

Wang, Weixun ^{[4
]}

Hao, Jianye ^{[1
,5
]}

Taylor, Matthew E. ^{[2
,3
]}

Liu, Yong ^{[6
]}

Hao, Xiaotian ^{[1
]}

Hu, Yujing ^{[4
]}

Chen, Yingfeng ^{[4
]}

Fan, Changjie ^{[4
]}

Ren, Chunxu ^{[4
]}

Huang, Ye ^{[4
]}

Zhu, Jiangcheng ^{[5
]}

Gao, Yang ^{[6
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada

[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada

[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China

[5] Huawei, Shenzhen, Peoples R China

[6] Nanjing Univ, Nanjing, Peoples R China

来源：

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS | 2023年 / 37卷 / 02期

基金：

加拿大自然科学与工程研究理事会; 中国国家自然科学基金;

关键词：

Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;

D O I：

10.1007/s10458-023-09628-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.

引用

页数：37

共 50 条

[41] Multiagent reinforcement learning for strictly constrained tasks based on Reward Recorder
Ding, Lifu
Yan, Gangfeng
Liu, Jianing
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8387 - 8411
[42] Interpreting Primal-Dual Algorithms for Constrained Multiagent Reinforcement Learning
Tabas, Daniel
Zamzam, Ahmed S.
Zhang, Baosen
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
[43] Multiagent Reinforcement Learning Methods to Resolve Demand Capacity Balance Problems
Spatharis, Christos
Kravaris, Theocharis
Vouros, George A.
Blekas, Konstantinos
Chalkiadakis, Georgios
Cordero Garcia, Jose Manuel
Calvo Fernandez, Esther
10TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2018), 2018,
[44] Policy Evaluation and Seeking for Multiagent Reinforcement Learning via Best Response
Yan, Rui
Duan, Xiaoming
Shi, Zongying
Zhong, Yisheng
Marden, Jason R.
Bullo, Francesco
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (04) : 1898 - 1913
[45] Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors
Leon, Florin
COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II: THIRD INTERNATIONAL CONFERENCE, ICCCI 2011, 2011, 6923 : 201 - 210
[46] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes
V. E. Bolshakov
A. N. Alfimtsev
Doklady Mathematics, 2023, 108 : S382 - S392
[47] Centralized Norm Enforcement in Mixed-Motive Multiagent Reinforcement Learning
Cheang, Rafael M.
Brandao, Anarosa A. F.
Sichman, Jaime S.
COORDINATION, ORGANIZATIONS, INSTITUTIONS, NORMS, AND ETHICS FOR GOVERNANCE OF MULTI-AGENT SYSTEMS XV, 2022, 13549 : 121 - 133
[48] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes
Bolshakov, V. E.
Alfimtsev, A. N.
DOKLADY MATHEMATICS, 2023, 108 (SUPPL 2) : S382 - S392
[49] Temporal graph convolutional network for multi-agent reinforcement learning of action detection
Wang, Liangliang
Liu, Jiayao
Wang, Ke
Ge, Lianzheng
Liang, Peidong
APPLIED SOFT COMPUTING, 2024, 163
[50] MULTIAGENT COORDINATION SYSTEMS BASED ON NEURO-FUZZY MODELS WITH REINFORCEMENT LEARNING
Mendoza, Leonardo Forero
Batista, Evelyn
de Mello, Harold Dias
Pacheco, Marco Aurelio
2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 931 - 937

← 1 2 3 4 5 →