ASN: action semantics network for multiagent reinforcement learning

被引:2
|
作者
Yang, Tianpei [1 ,2 ,3 ]
Wang, Weixun [4 ]
Hao, Jianye [1 ,5 ]
Taylor, Matthew E. [2 ,3 ]
Liu, Yong [6 ]
Hao, Xiaotian [1 ]
Hu, Yujing [4 ]
Chen, Yingfeng [4 ]
Fan, Changjie [4 ]
Ren, Chunxu [4 ]
Huang, Ye [4 ]
Zhu, Jiangcheng [5 ]
Gao, Yang [6 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China
[5] Huawei, Shenzhen, Peoples R China
[6] Nanjing Univ, Nanjing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;
D O I
10.1007/s10458-023-09628-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
引用
收藏
页数:37
相关论文
共 50 条
  • [11] Multiagent Reinforcement Learning With Unshared Value Functions
    Hu, Yujing
    Gao, Yang
    An, Bo
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (04) : 647 - 662
  • [12] QFuture: Learning Future Expectation Cognition in Multiagent Reinforcement Learning
    Liu, Boyin
    Pu, Zhiqiang
    Pan, Yi
    Yi, Jianqiang
    Chen, Min
    Wang, Shijie
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (04) : 1302 - 1314
  • [13] CuMARL: Curiosity-Based Learning in Multiagent Reinforcement Learning
    Ningombam, Devarani Devi
    Yoo, Byunghyun
    Kim, Hyun Woo
    Song, Hwa Jeon
    Yi, Sungwon
    IEEE ACCESS, 2022, 10 : 87254 - 87265
  • [14] Attention-Based Intrinsic Reward Mixing Network for Credit Assignment in Multiagent Reinforcement Learning
    Li, Wei
    Liu, Weiyan
    Shao, Shitong
    Huang, Shiyi
    Song, Aiguo
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (02) : 270 - 281
  • [15] Automated design of action advising trigger conditions for multiagent reinforcement A
    Wang, Tonghao
    Peng, Xingguang
    Wang, Tao
    Liu, Tong
    Xu, Demin
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 85
  • [16] Opponent portrait for multiagent reinforcement learning in competitive environment
    Ma, Yuxi
    Shen, Meng
    Zhao, Yuhang
    Li, Zhao
    Tong, Xiaoyao
    Zhang, Quanxin
    Wang, Zhi
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (12) : 7461 - 7474
  • [17] Constrained Multiagent Reinforcement Learning for Large Agent Population
    Ling, Jiajing
    Singh, Arambam James
    Thien, Nguyen Duc
    Kumar, Akshat
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 183 - 199
  • [18] Domain-Aware Multiagent Reinforcement Learning in Navigation
    Saeed, Ifrah
    Cullen, Andrew C.
    Erfani, Sarah
    Alpcan, Tansu
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [19] Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning
    Yang, Chen
    Yang, Guangkai
    Chen, Hao
    Zhang, Junge
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [20] Multiagent reinforcement learning with organizational-learning oriented Classifier System
    Takadama, K
    Nakasuka, S
    Terano, T
    1998 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION - PROCEEDINGS, 1998, : 63 - 68