ASN: action semantics network for multiagent reinforcement learning

被引:0
|
作者
Tianpei Yang
Weixun Wang
Jianye Hao
Matthew E. Taylor
Yong Liu
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Chunxu Ren
Ye Huang
Jiangcheng Zhu
Yang Gao
机构
[1] Tianjin University,College of Intelligence and Computing
[2] University of Alberta,Department of Computing Science
[3] Alberta Machine Intelligence Institute (Amii),Fuxi AI Lab
[4] NetEase,undefined
[5] Huawei,undefined
[6] Nanjing University,undefined
来源
Autonomous Agents and Multi-Agent Systems | 2023年 / 37卷
关键词
Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system’s evolution. Learning in MASs is difficult since each agent’s selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions’ influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
引用
收藏
相关论文
共 50 条
  • [21] Deep neural network based missing data prediction of electrocardiogram signal using multiagent reinforcement learning
    Banerjee, Soumyendu
    Singh, Girish Kumar
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 67
  • [22] Multiagent Learning and Coordination with Clustered Deep Q-Network
    Pageaud, Simon
    Deslandres, Veronique
    Lehoux, Vassilissa
    Hassas, Salima
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2156 - 2158
  • [23] Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection
    Akramizadeh, Ali
    Afshar, Ahmad
    Menhaj, Mohammad B.
    INTELLIGENT DATA ANALYSIS, 2011, 15 (06) : 913 - 929
  • [24] Hierarchical multiagent reinforcement learning schemes for air traffic management
    Spatharis, Christos
    Bastas, Alevizos
    Kravaris, Theocharis
    Blekas, Konstantinos
    Vouros, George A.
    Manuel Cordero, Jose
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01) : 147 - 159
  • [25] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
    Devlin, Sam
    Yliniemi, Logan
    Kudenko, Daniel
    Tumer, Kagan
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172
  • [26] Hierarchical multiagent reinforcement learning schemes for air traffic management
    Christos Spatharis
    Alevizos Bastas
    Theocharis Kravaris
    Konstantinos Blekas
    George A. Vouros
    Jose Manuel Cordero
    Neural Computing and Applications, 2023, 35 : 147 - 159
  • [27] Implementing Traffic Signal Optimal Control by Multiagent Reinforcement Learning
    Song, Jiong
    Jin, Zhao
    Zhu, WenJun
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2578 - 2582
  • [28] Multiagent reinforcement learning in extensive form games with complete information
    Akramizadeh, Ali
    Menhaj, Mohammad-B.
    Afshar, Ahmad
    ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 205 - 211
  • [29] The dynamics of reinforcement social learning in networked cooperative multiagent systems
    Hao, Jianye
    Huang, Dongping
    Cai, Yi
    Leung, Ho-fung
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 111 - 122
  • [30] V-Learning-A Simple, Efficient, Decentralized Algorithm for Multiagent Reinforcement Learning
    Jin, Chi
    Liu, Qinghua
    Wang, Yuanhao
    Yu, Tiancheng
    MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (04) : 2295 - 2322