ASN: action semantics network for multiagent reinforcement learning

被引：0

作者：

Tianpei Yang

Weixun Wang

Jianye Hao

Matthew E. Taylor

Yong Liu

Xiaotian Hao

Yujing Hu

Yingfeng Chen

Changjie Fan

Chunxu Ren

Ye Huang

Jiangcheng Zhu

Yang Gao

机构：

[1] Tianjin University,College of Intelligence and Computing

[2] University of Alberta,Department of Computing Science

[3] Alberta Machine Intelligence Institute (Amii),Fuxi AI Lab

[4] NetEase,undefined

[5] Huawei,undefined

[6] Nanjing University,undefined

来源：

Autonomous Agents and Multi-Agent Systems | 2023年 / 37卷

关键词：

Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system’s evolution. Learning in MASs is difficult since each agent’s selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions’ influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.

引用

共 50 条

[21] Deep neural network based missing data prediction of electrocardiogram signal using multiagent reinforcement learning
Banerjee, Soumyendu
Singh, Girish Kumar
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 67
[22] Multiagent Learning and Coordination with Clustered Deep Q-Network
Pageaud, Simon
Deslandres, Veronique
Lehoux, Vassilissa
Hassas, Salima
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2156 - 2158
[23] Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection
Akramizadeh, Ali
Afshar, Ahmad
Menhaj, Mohammad B.
INTELLIGENT DATA ANALYSIS, 2011, 15 (06) : 913 - 929
[24] Hierarchical multiagent reinforcement learning schemes for air traffic management
Spatharis, Christos
Bastas, Alevizos
Kravaris, Theocharis
Blekas, Konstantinos
Vouros, George A.
Manuel Cordero, Jose
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01) : 147 - 159
[25] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
Devlin, Sam
Yliniemi, Logan
Kudenko, Daniel
Tumer, Kagan
AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172
[26] Hierarchical multiagent reinforcement learning schemes for air traffic management
Christos Spatharis
Alevizos Bastas
Theocharis Kravaris
Konstantinos Blekas
George A. Vouros
Jose Manuel Cordero
Neural Computing and Applications, 2023, 35 : 147 - 159
[27] Implementing Traffic Signal Optimal Control by Multiagent Reinforcement Learning
Song, Jiong
Jin, Zhao
Zhu, WenJun
2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2578 - 2582
[28] Multiagent reinforcement learning in extensive form games with complete information
Akramizadeh, Ali
Menhaj, Mohammad-B.
Afshar, Ahmad
ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 205 - 211
[29] The dynamics of reinforcement social learning in networked cooperative multiagent systems
Hao, Jianye
Huang, Dongping
Cai, Yi
Leung, Ho-fung
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 111 - 122
[30] V-Learning-A Simple, Efficient, Decentralized Algorithm for Multiagent Reinforcement Learning
Jin, Chi
Liu, Qinghua
Wang, Yuanhao
Yu, Tiancheng
MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (04) : 2295 - 2322

← 1 2 3 4 5 →