Attention-Based Intrinsic Reward Mixing Network for Credit Assignment in Multiagent Reinforcement Learning

被引：5

作者：

Li, Wei ^{[1
]}

Liu, Weiyan ^{[1
]}

Shao, Shitong ^{[1
]}

Huang, Shiyi ^{[1
]}

Song, Aiguo ^{[1
]}

机构：

[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON GAMES | 2024年 / 16卷 / 02期

关键词：

Training; Teamwork; Reinforcement learning; Games; Behavioral sciences; Optimization; Task analysis; Attention mechanism; credit assignment; intrinsic reward; mixing network; multiagent reinforcement learning; LEVEL; GAMES;

D O I：

10.1109/TG.2023.3263013

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Credit assignment is a critical problem in cooperative multiagent reinforcement learning (MARL). To address this problem, current studies mainly rely on the intrinsic reward, which is directly summed with the global reward to generate a total reward. However, such kinds of intrinsic reward functions ignore the dependence among agents and inevitably limit the adaptivity and effectiveness of MARL methods. In this article, we propose a novel method, Attention-based Intrinsic Reward Mixing Network (AIRMN), for credit assignment in MARL. Specifically, we design a new intrinsic reward network on the basis of the attention mechanism, in order to enhance the effectiveness of teamwork. Besides, we devise a new mixing network that combines the intrinsic and extrinsic rewards in a nonlinear and dynamic manner, so as to adapt the total reward to the variation of the environment. Experimental results on the battle games of StarCraft II demonstrate that AIRMN outperforms the state-of-the-art methods in terms of the average test win rate and also validate that AIRMN can dynamically return the precise intrinsic reward to each agent based on their contributions to the team cooperation, thereby better dealing with the credit assignment problem.

引用

页码：270 / 281

页数：12

共 50 条

[1] Intelligent Video Streaming at Network Edge: An Attention-Based Multiagent Reinforcement Learning Solution
Tang, Xiangdong
Chen, Fei
He, Yunlong
FUTURE INTERNET, 2023, 15 (07)
[2] VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning
Wei, Qinglai
Li, Yugu
Zhang, Jie
Wang, Fei-Yue
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 182 - 195
[3] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
Liu, Jiayi
Wang, Gang
Guo, Xiangke
Wang, Siyuan
Fu, Qiang
IEEE ACCESS, 2022, 10 : 114402 - 114413
[4] Coalition Game of Radar Network for Multitarget Tracking via Model-Based Multiagent Reinforcement Learning
Xiong, Kui
Zhang, Tianxian
Cui, Guolong
Wang, Shiyuan
Kong, Lingjiang
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (03) : 2123 - 2140
[5] Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network
Du, Wei
Ding, Shifei
Zhang, Chenglong
Shi, Zhongzhi
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6851 - 6860
[6] Generating individual intrinsic reward for cooperative multiagent reinforcement learning
Wu, Haolin
Li, Hui
Zhang, Jianwei
Wang, Zhuang
Zhang, Jianeng
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2021, 18 (05):
[7] CuMARL: Curiosity-Based Learning in Multiagent Reinforcement Learning
Ningombam, Devarani Devi
Yoo, Byunghyun
Kim, Hyun Woo
Song, Hwa Jeon
Yi, Sungwon
IEEE ACCESS, 2022, 10 : 87254 - 87265
[8] Uncertainty Estimation based Intrinsic Reward For Efficient Reinforcement Learning
Chen, Chao
Wan, Tianjiao
Shi, Peichang
Ding, Bo
Gao, Zijian
Feng, Dawei
2022 IEEE 13TH INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2022), 2022, : 1 - 8
[9] Multi-Task Reinforcement Learning With Attention-Based Mixture of Experts
Cheng, Guangran
Dong, Lu
Cai, Wenzhe
Sun, Changyin
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3811 - 3818
[10] AMARL: An Attention-Based Multiagent Reinforcement Learning Approach to the Min-Max Multiple Traveling Salesmen Problem
Gao, Hao
Zhou, Xing
Xu, Xin
Lan, Yixing
Xiao, Yongqian
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9758 - 9772

← 1 2 3 4 5 →