Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward

被引:2
|
作者
Shao, Kun [1 ,2 ]
Zhu, Yuanheng [1 ]
Tang, Zhentao [1 ,2 ]
Zhao, Dongbin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
关键词
reinforcement learning; deep reinforcement learning; cooperative games; counterfactual reward; LEVEL; GAME; GO;
D O I
10.1109/ijcnn48605.2020.9207169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In partially observable fully cooperative games, agents generally tend to maximize global rewards with joint actions, so it is difficult for each agent to deduce their own contribution. To address this credit assignment problem, we propose a multi-agent reinforcement learning algorithm with counterfactual reward mechanism, which is termed as CoRe algorithm. CoRe computes the global reward difference in condition that the agent does not take its actual action but takes other actions, while other agents fix their actual actions. This approach can determine each agent's contribution for the global reward. We evaluate CoRe in a simplified Pig Chase game with a decentralised Deep Q Network (DQN) framework. The proposed method helps agents learn end-to-end collaborative behaviors. Compared with other DQN variants with global reward, CoRe significantly improves learning efficiency and achieves better results. In addition, CoRe shows excellent performances in various size game environments.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking
    Wang, Yue
    Wan, Yao
    Zhang, Chenwei
    Bai, Lu
    Cui, Lixin
    Yu, Philip S.
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1366 - 1371
  • [2] A review of cooperative multi-agent deep reinforcement learning
    Oroojlooy, Afshin
    Hajinezhad, Davood
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13677 - 13722
  • [3] A review of cooperative multi-agent deep reinforcement learning
    Afshin Oroojlooy
    Davood Hajinezhad
    Applied Intelligence, 2023, 53 : 13677 - 13722
  • [4] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] Transform networks for cooperative multi-agent deep reinforcement learning
    Wang, Hongbin
    Xie, Xiaodong
    Zhou, Lianke
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269
  • [6] The Cooperative Multi-agent Learning with Random Reward Values
    张化祥
    黄上腾
    JournalofShanghaiJiaotongUniversity, 2005, (02) : 147 - 150
  • [7] Train timetabling with the general learning environment and multi-agent deep reinforcement learning
    Li, Wenqing
    Ni, Shaoquan
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2022, 157 : 230 - 251
  • [8] Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains
    Ocana, Jim Martin Catacora
    Riccio, Francesco
    Capobianco, Roberto
    Nardi, Daniele
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1865 - 1867
  • [9] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    Applied Intelligence, 2023, 53 : 9261 - 9269
  • [10] Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
    Li, Jiahui
    Kuang, Kun
    Wang, Baoxiang
    Liu, Furui
    Chen, Long
    Wu, Fei
    Xiao, Jun
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 934 - 942