Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks

被引:0
|
作者
Takumi Aotani
Taisuke Kobayashi
Kenji Sugimoto
机构
[1] Nara Institute of Science and Technology,Division of Information Science
来源
Applied Intelligence | 2021年 / 51卷
关键词
Distributed autonomous system; Reinforcement learning; Reward shaping; Interests between agents;
D O I
暂无
中图分类号
学科分类号
摘要
A multi-agent system (MAS) is expected to be applied to various real-world problems where a single agent cannot accomplish given tasks. Due to the inherent complexity in the real-world MAS, however, manual design of group behaviors of agents is intractable. Multi-agent reinforcement learning (MARL), which is a framework for multiple agents in the same environment to learn their policies adaptively by using reinforcement learning, would be a promising methodology for such complexity in the MAS. To acquire the group behaviors by MARL, all the agents are required to understand how to achieve the respective tasks cooperatively. So far, we have proposed “bottom-up MARL”, which is a decentralized system to manage real and large-scale MARL, with a reward shaping algorithm to represent the group behaviors. The reward shaping algorithm, however, assumes that all the agents are in cooperative relationships to some extent. In this paper, therefore, we extend this algorithm to allow the agents not to know the interests between them. The interests are regarded as correlation coefficients derived from the agents’ rewards, which are numerically estimated in an online manner. Actually, in both simulations and real experiments without knowledge of the interests between the agents, they correctly estimated their interests, thereby allowing them to derive their new rewards to represent the feasible group behaviors in the decentralized manner. As a result, our extended algorithm succeeded in acquiring the group behaviors from cooperative tasks to competitive tasks.
引用
收藏
页码:4434 / 4452
页数:18
相关论文
共 50 条
  • [1] Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks
    Aotani, Takumi
    Kobayashi, Taisuke
    Sugimoto, Kenji
    APPLIED INTELLIGENCE, 2021, 51 (07) : 4434 - 4452
  • [2] Mixed Cooperative-Competitive Communication Using Multi-agent Reinforcement Learning
    Vanneste, Astrid
    Van Wijnsberghe, Wesley
    Vanneste, Simon
    Mets, Kevin
    Mercelis, Siegfried
    Latre, Steven
    Hellinckx, Peter
    ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 197 - 206
  • [3] Demand response model: A cooperative-competitive multi-agent reinforcement learning approach
    Salazar, Eduardo J.
    Rosero, Veronica
    Gabrielski, Jawana
    Samper, Mauricio E.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [4] Bottom-up Multi-agent Reinforcement Learning for Selective Cooperation
    Aotani, Takumi
    Kobayashi, Taisuke
    Sugimoto, Kenji
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3590 - 3595
  • [5] Hierarchical relationship modeling in multi-agent reinforcement learning for mixed cooperative-competitive environments
    Xie, Shaorong
    Li, Yang
    Wang, Xinzhi
    Zhang, Han
    Zhang, Zhenyu
    Luo, Xiangfeng
    Yu, Hang
    INFORMATION FUSION, 2024, 108
  • [6] Bias Estimation Correction in Multi-Agent Reinforcement Learning for Mixed Cooperative-Competitive Environments
    Sarkar T.
    Kalita S.
    SN Computer Science, 5 (1)
  • [7] Learning Reward Machines in Cooperative Multi-agent Tasks
    Ardon, Leo
    Furelos-Blanco, Daniel
    Russo, Alessandra
    AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS. BEST AND VISIONARY PAPERS, AAMAS 2023 WORKSHOPS, 2024, 14456 : 43 - 59
  • [8] A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning
    Sun, Yu
    Lai, Jun
    Cao, Lei
    Chen, Xiliang
    Xu, Zhixiong
    Xu, Yue
    IEEE ACCESS, 2020, 8 : 135605 - 135616
  • [9] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
    Shao, Kun
    Zhu, Yuanheng
    Tang, Zhentao
    Zhao, Dongbin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks
    Hu, Guangzheng
    Li, Haoran
    Liu, Shasha
    Zhu, Yuanheng
    Zhao, Dongbin
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,