Attention-Based Intrinsic Reward Mixing Network for Credit Assignment in Multiagent Reinforcement Learning

被引:5
|
作者
Li, Wei [1 ]
Liu, Weiyan [1 ]
Shao, Shitong [1 ]
Huang, Shiyi [1 ]
Song, Aiguo [1 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China
关键词
Training; Teamwork; Reinforcement learning; Games; Behavioral sciences; Optimization; Task analysis; Attention mechanism; credit assignment; intrinsic reward; mixing network; multiagent reinforcement learning; LEVEL; GAMES;
D O I
10.1109/TG.2023.3263013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit assignment is a critical problem in cooperative multiagent reinforcement learning (MARL). To address this problem, current studies mainly rely on the intrinsic reward, which is directly summed with the global reward to generate a total reward. However, such kinds of intrinsic reward functions ignore the dependence among agents and inevitably limit the adaptivity and effectiveness of MARL methods. In this article, we propose a novel method, Attention-based Intrinsic Reward Mixing Network (AIRMN), for credit assignment in MARL. Specifically, we design a new intrinsic reward network on the basis of the attention mechanism, in order to enhance the effectiveness of teamwork. Besides, we devise a new mixing network that combines the intrinsic and extrinsic rewards in a nonlinear and dynamic manner, so as to adapt the total reward to the variation of the environment. Experimental results on the battle games of StarCraft II demonstrate that AIRMN outperforms the state-of-the-art methods in terms of the average test win rate and also validate that AIRMN can dynamically return the precise intrinsic reward to each agent based on their contributions to the team cooperation, thereby better dealing with the credit assignment problem.
引用
收藏
页码:270 / 281
页数:12
相关论文
共 50 条
  • [1] Intelligent Video Streaming at Network Edge: An Attention-Based Multiagent Reinforcement Learning Solution
    Tang, Xiangdong
    Chen, Fei
    He, Yunlong
    FUTURE INTERNET, 2023, 15 (07)
  • [2] VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning
    Wei, Qinglai
    Li, Yugu
    Zhang, Jie
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 182 - 195
  • [3] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    IEEE ACCESS, 2022, 10 : 114402 - 114413
  • [4] Coalition Game of Radar Network for Multitarget Tracking via Model-Based Multiagent Reinforcement Learning
    Xiong, Kui
    Zhang, Tianxian
    Cui, Guolong
    Wang, Shiyuan
    Kong, Lingjiang
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (03) : 2123 - 2140
  • [5] Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network
    Du, Wei
    Ding, Shifei
    Zhang, Chenglong
    Shi, Zhongzhi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6851 - 6860
  • [6] Generating individual intrinsic reward for cooperative multiagent reinforcement learning
    Wu, Haolin
    Li, Hui
    Zhang, Jianwei
    Wang, Zhuang
    Zhang, Jianeng
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2021, 18 (05):
  • [7] CuMARL: Curiosity-Based Learning in Multiagent Reinforcement Learning
    Ningombam, Devarani Devi
    Yoo, Byunghyun
    Kim, Hyun Woo
    Song, Hwa Jeon
    Yi, Sungwon
    IEEE ACCESS, 2022, 10 : 87254 - 87265
  • [8] Uncertainty Estimation based Intrinsic Reward For Efficient Reinforcement Learning
    Chen, Chao
    Wan, Tianjiao
    Shi, Peichang
    Ding, Bo
    Gao, Zijian
    Feng, Dawei
    2022 IEEE 13TH INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2022), 2022, : 1 - 8
  • [9] Multi-Task Reinforcement Learning With Attention-Based Mixture of Experts
    Cheng, Guangran
    Dong, Lu
    Cai, Wenzhe
    Sun, Changyin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3811 - 3818
  • [10] AMARL: An Attention-Based Multiagent Reinforcement Learning Approach to the Min-Max Multiple Traveling Salesmen Problem
    Gao, Hao
    Zhou, Xing
    Xu, Xin
    Lan, Yixing
    Xiao, Yongqian
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9758 - 9772