Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks

被引:0
|
作者
Takumi Aotani
Taisuke Kobayashi
Kenji Sugimoto
机构
[1] Nara Institute of Science and Technology,Division of Information Science
来源
Applied Intelligence | 2021年 / 51卷
关键词
Distributed autonomous system; Reinforcement learning; Reward shaping; Interests between agents;
D O I
暂无
中图分类号
学科分类号
摘要
A multi-agent system (MAS) is expected to be applied to various real-world problems where a single agent cannot accomplish given tasks. Due to the inherent complexity in the real-world MAS, however, manual design of group behaviors of agents is intractable. Multi-agent reinforcement learning (MARL), which is a framework for multiple agents in the same environment to learn their policies adaptively by using reinforcement learning, would be a promising methodology for such complexity in the MAS. To acquire the group behaviors by MARL, all the agents are required to understand how to achieve the respective tasks cooperatively. So far, we have proposed “bottom-up MARL”, which is a decentralized system to manage real and large-scale MARL, with a reward shaping algorithm to represent the group behaviors. The reward shaping algorithm, however, assumes that all the agents are in cooperative relationships to some extent. In this paper, therefore, we extend this algorithm to allow the agents not to know the interests between them. The interests are regarded as correlation coefficients derived from the agents’ rewards, which are numerically estimated in an online manner. Actually, in both simulations and real experiments without knowledge of the interests between the agents, they correctly estimated their interests, thereby allowing them to derive their new rewards to represent the feasible group behaviors in the decentralized manner. As a result, our extended algorithm succeeded in acquiring the group behaviors from cooperative tasks to competitive tasks.
引用
收藏
页码:4434 / 4452
页数:18
相关论文
共 50 条
  • [31] Optimal couple-group tracking control for the heterogeneous multi-agent systems with cooperative-competitive interactions via reinforcement learning method
    Li, Jun
    Ji, Lianghao
    Zhang, Cuijuan
    Li, Huaqing
    INFORMATION SCIENCES, 2022, 610 : 401 - 424
  • [32] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
    Lin, Jieyu
    Dzeparoska, Kristina
    Zhang, Sai Qian
    Leon-Garcia, Alberto
    Papernot, Nicolas
    2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
  • [33] Cooperative Multi-Agent Reinforcement Learning with Dynamic Target Localization: A Reward Sharing Approach
    Wickramaarachchi, Helani
    Kirley, Michael
    Geard, Nicholas
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 310 - 324
  • [34] A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising
    Wen, Chao
    Xu, Miao
    Zhang, Zhilin
    Zheng, Zhenzhe
    Wang, Yuhui
    Liu, Xiangyu
    Rong, Yu
    Xie, Dong
    Tan, Xiaoyang
    Yu, Chuan
    Xu, Jian
    Wu, Fan
    Chen, Guihai
    Zhu, Xiaoqiang
    Zheng, Bo
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1129 - 1139
  • [35] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Zhang, Bin
    Li, Dapeng
    Zhang, Zeren
    Zhou, Guangchong
    Chen, Hao
    Fan, Guoliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
  • [36] Underexplored Subspace Mining for Sparse-Reward Cooperative Multi-Agent Reinforcement Learning
    Yu, Yang
    Yin, Qiyue
    Zhang, Junge
    Chen, Hao
    Huang, Kaiqi
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [37] Rationality of reward sharing in multi-agent reinforcement learning
    Kazuteru Miyazaki
    Shigenobu Kobayashi
    New Generation Computing, 2001, 19 : 157 - 172
  • [38] Rationality of reward sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    NEW GENERATION COMPUTING, 2001, 19 (02) : 157 - 172
  • [39] Individual Reward Assisted Multi-Agent Reinforcement Learning
    Wang, Li
    Zhang, Yupeng
    Hu, Yujing
    Wang, Weixun
    Zhang, Chongjie
    Gao, Yang
    Hao, Jianye
    Lv, Tangjie
    Fan, Changjie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [40] The Cooperative Multi-agent Learning with Random Reward Values
    张化祥
    黄上腾
    Journal of Shanghai Jiaotong University, 2005, (02) : 147 - 150