ASN: action semantics network for multiagent reinforcement learning

被引:2
|
作者
Yang, Tianpei [1 ,2 ,3 ]
Wang, Weixun [4 ]
Hao, Jianye [1 ,5 ]
Taylor, Matthew E. [2 ,3 ]
Liu, Yong [6 ]
Hao, Xiaotian [1 ]
Hu, Yujing [4 ]
Chen, Yingfeng [4 ]
Fan, Changjie [4 ]
Ren, Chunxu [4 ]
Huang, Ye [4 ]
Zhu, Jiangcheng [5 ]
Gao, Yang [6 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China
[5] Huawei, Shenzhen, Peoples R China
[6] Nanjing Univ, Nanjing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;
D O I
10.1007/s10458-023-09628-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
引用
收藏
页数:37
相关论文
共 50 条
  • [41] Multiagent reinforcement learning for strictly constrained tasks based on Reward Recorder
    Ding, Lifu
    Yan, Gangfeng
    Liu, Jianing
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8387 - 8411
  • [42] Interpreting Primal-Dual Algorithms for Constrained Multiagent Reinforcement Learning
    Tabas, Daniel
    Zamzam, Ahmed S.
    Zhang, Baosen
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [43] Multiagent Reinforcement Learning Methods to Resolve Demand Capacity Balance Problems
    Spatharis, Christos
    Kravaris, Theocharis
    Vouros, George A.
    Blekas, Konstantinos
    Chalkiadakis, Georgios
    Cordero Garcia, Jose Manuel
    Calvo Fernandez, Esther
    10TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2018), 2018,
  • [44] Policy Evaluation and Seeking for Multiagent Reinforcement Learning via Best Response
    Yan, Rui
    Duan, Xiaoming
    Shi, Zongying
    Zhong, Yisheng
    Marden, Jason R.
    Bullo, Francesco
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (04) : 1898 - 1913
  • [45] Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors
    Leon, Florin
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II: THIRD INTERNATIONAL CONFERENCE, ICCCI 2011, 2011, 6923 : 201 - 210
  • [46] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes
    V. E. Bolshakov
    A. N. Alfimtsev
    Doklady Mathematics, 2023, 108 : S382 - S392
  • [47] Centralized Norm Enforcement in Mixed-Motive Multiagent Reinforcement Learning
    Cheang, Rafael M.
    Brandao, Anarosa A. F.
    Sichman, Jaime S.
    COORDINATION, ORGANIZATIONS, INSTITUTIONS, NORMS, AND ETHICS FOR GOVERNANCE OF MULTI-AGENT SYSTEMS XV, 2022, 13549 : 121 - 133
  • [48] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes
    Bolshakov, V. E.
    Alfimtsev, A. N.
    DOKLADY MATHEMATICS, 2023, 108 (SUPPL 2) : S382 - S392
  • [49] Temporal graph convolutional network for multi-agent reinforcement learning of action detection
    Wang, Liangliang
    Liu, Jiayao
    Wang, Ke
    Ge, Lianzheng
    Liang, Peidong
    APPLIED SOFT COMPUTING, 2024, 163
  • [50] MULTIAGENT COORDINATION SYSTEMS BASED ON NEURO-FUZZY MODELS WITH REINFORCEMENT LEARNING
    Mendoza, Leonardo Forero
    Batista, Evelyn
    de Mello, Harold Dias
    Pacheco, Marco Aurelio
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 931 - 937