ASN: action semantics network for multiagent reinforcement learning

被引:2
|
作者
Yang, Tianpei [1 ,2 ,3 ]
Wang, Weixun [4 ]
Hao, Jianye [1 ,5 ]
Taylor, Matthew E. [2 ,3 ]
Liu, Yong [6 ]
Hao, Xiaotian [1 ]
Hu, Yujing [4 ]
Chen, Yingfeng [4 ]
Fan, Changjie [4 ]
Ren, Chunxu [4 ]
Huang, Ye [4 ]
Zhu, Jiangcheng [5 ]
Gao, Yang [6 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China
[5] Huawei, Shenzhen, Peoples R China
[6] Nanjing Univ, Nanjing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;
D O I
10.1007/s10458-023-09628-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
引用
收藏
页数:37
相关论文
共 50 条
  • [31] Multiagent Reinforcement Learning With Learning Automata for Microgrid Energy Management and Decision Optimization
    Fang, Xiaohan
    Wang, Jinkuan
    Yin, Chunhui
    Han, Yinghua
    Zhao, Qiang
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 779 - 784
  • [32] Satisficing Paths and Independent Multiagent Reinforcement Learning in Stochastic Games
    Yongacoglu, Bora
    Arslan, Gurdal
    Yuksel, Serdar
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2023, 5 (03): : 745 - 773
  • [33] Network Slice Reconfiguration by Exploiting Deep Reinforcement Learning With Large Action Space
    Wei, Fengsheng
    Feng, Gang
    Sun, Yao
    Wang, Yatong
    Qin, Shuang
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (04): : 2197 - 2211
  • [34] Radar Network Time Scheduling for Multi-Target ISAR Task With Game Theory and Multiagent Reinforcement Learning
    Liu, Xiao-Wen
    Zhang, Qun
    Luo, Ying
    Lu, Xiaofei
    Dong, Chen
    IEEE SENSORS JOURNAL, 2021, 21 (04) : 4462 - 4473
  • [35] Automated Design of Complex Analog Circuits with Multiagent based Reinforcement Learning
    Zhang, Jinxin
    Bao, Jiarui
    Huang, Zhangcheng
    Zeng, Xuan
    Lu, Ye
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [36] Human attention guided multiagent hierarchical reinforcement learning for heterogeneous agents
    Liu, Dingbang
    Ren, Fenghui
    Yan, Jun
    Su, Guoxin
    Kato, Shohei
    Gu, Wen
    Zhang, Minjie
    KNOWLEDGE-BASED SYSTEMS, 2025, 316
  • [37] Multiagent reinforcement learning for autonomous driving in traffic zones with unsignalized intersections
    Spatharis, Christos
    Blekas, Konstantinos
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 28 (01) : 103 - 119
  • [38] DeRL: Coupling Decomposition in Action Space for Reinforcement Learning Task
    He, Ziming
    Li, Jingchen
    Wu, Fan
    Shi, Haobin
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 1030 - 1043
  • [39] Robust Multiagent Reinforcement Learning for UAV Systems: Countering Byzantine Attacks
    Medhi, Jishu K.
    Liu, Rui
    Wang, Qianlong
    Chen, Xuhui
    INFORMATION, 2023, 14 (11)
  • [40] Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning
    Hao, Jianye
    Leung, Ho-Fung
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2013, 8 (03)