Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks

被引：0

作者：

Takumi Aotani

Taisuke Kobayashi

Kenji Sugimoto

机构：

[1] Nara Institute of Science and Technology,Division of Information Science

来源：

Applied Intelligence | 2021年 / 51卷

关键词：

Distributed autonomous system; Reinforcement learning; Reward shaping; Interests between agents;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A multi-agent system (MAS) is expected to be applied to various real-world problems where a single agent cannot accomplish given tasks. Due to the inherent complexity in the real-world MAS, however, manual design of group behaviors of agents is intractable. Multi-agent reinforcement learning (MARL), which is a framework for multiple agents in the same environment to learn their policies adaptively by using reinforcement learning, would be a promising methodology for such complexity in the MAS. To acquire the group behaviors by MARL, all the agents are required to understand how to achieve the respective tasks cooperatively. So far, we have proposed “bottom-up MARL”, which is a decentralized system to manage real and large-scale MARL, with a reward shaping algorithm to represent the group behaviors. The reward shaping algorithm, however, assumes that all the agents are in cooperative relationships to some extent. In this paper, therefore, we extend this algorithm to allow the agents not to know the interests between them. The interests are regarded as correlation coefficients derived from the agents’ rewards, which are numerically estimated in an online manner. Actually, in both simulations and real experiments without knowledge of the interests between the agents, they correctly estimated their interests, thereby allowing them to derive their new rewards to represent the feasible group behaviors in the decentralized manner. As a result, our extended algorithm succeeded in acquiring the group behaviors from cooperative tasks to competitive tasks.

引用

页码：4434 / 4452

页数：18

共 50 条

[41] An Efficient Centralized Multi-Agent Reinforcement Learner for Cooperative Tasks
Liao, Dengyu
Zhang, Zhen
Song, Tingting
Liu, Mingyang
IEEE ACCESS, 2023, 11 : 139284 - 139294
[42] Hierarchical multi-agent reinforcement learning for cooperative tasks with sparse rewards in continuous domain
Cao, Jingyu
Dong, Lu
Yuan, Xin
Wang, Yuanda
Sun, Changyin
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (01): : 273 - 287
[43] Competitive-Cooperative Multi-Agent Reinforcement Learning for Auction-based Federated Learning
Tang, Xiaoli
Yu, Han
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4262 - 4270
[44] Hierarchical multi-agent reinforcement learning for cooperative tasks with sparse rewards in continuous domain
Jingyu Cao
Lu Dong
Xin Yuan
Yuanda Wang
Changyin Sun
Neural Computing and Applications, 2024, 36 : 273 - 287
[45] Multi-Agent Meta-Reinforcement Learning with Coordination and Reward Shaping for Traffic Signal Control
Du, Xin
Wang, Jiahai
Chen, Siyuan
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 349 - 360
[46] Shaping multi-agent systems with gradient reinforcement learning
Olivier Buffet
Alain Dutech
François Charpillet
Autonomous Agents and Multi-Agent Systems, 2007, 15 : 197 - 220
[47] Shaping multi-agent systems with gradient reinforcement learning
Buffet, Olivier
Dutech, Alain
Charpillet, Francois
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 15 (02) : 197 - 220
[48] Autonomous learning of reward distribution for each agent in multi-agent reinforcement learning
Shibata, K
Ito, K
INTELLIGENT AUTONOMOUS SYSTEMS 6, 2000, : 495 - 502
[49] Learning competitive pricing strategies by multi-agent reinforcement learning
Kutschinski, E
Uthmann, T
Polani, D
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2003, 27 (11-12): : 2207 - 2218
[50] SAFE CONSENSUS CONTROL OF COOPERATIVE-COMPETITIVE MULTI-AGENT SYSTEMS VIA DIFFERENTIAL PRIVACY
Ma, Jiayue
Hu, Jiangping
KYBERNETIKA, 2022, 58 (03) : 426 - 439

← 1 2 3 4 5 →