Rapid behavior learning in multi-agent environment based on state value estimation of others

被引：0

作者：

Takahashi, Yasutake ^{[1
]}

Noma, Kentaro ^{[1
]}

Asada, Minoru ^{[1
]}

机构：

[1] Osaka Univ, Dept Adapt Machine Syst, Suita, Osaka 5650871, Japan

来源：

2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9 | 2007年

关键词：

D O I：

10.1109/IROS.2007.4399294

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical examples is a case of RoboCup competitions since other agents and their behaviors easily cause state and action space explosion. This paper presents a method of modular learning in a multiagent environment by which the learning agent can acquire cooperative behaviors with its team mates and competitive ones against its opponents. The key ideas to resolve the issue are as follows. First, a two-layer hierarchical system with multi learning modules is adopted to reduce the size of the sensor and action spaces. The state space of the top layer consists of the state values from the lower level, and the macro actions are used to reduce the size of the physical action space. Second, the state of the other to what extent it is close to its own goal is estimated by observation and used as a state value in the top layer state space to realize the cooperative/competitive behaviors. The method is applied to 4 (defense team) on 5 (offense team) game task, and the learning agent successfully acquired the teamwork plays (pass and shoot) within much shorter learning time (30 times quicker than the earlier work).

引用

页码：76 / 81

页数：6

共 50 条

[21] Modeling Others using Oneself in Multi-Agent Reinforcement Learning
Raileanu, Roberta
Denton, Emily
Szlam, Arthur
Fergus, Rob
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[22] A context model based on multi-agent in U-learning environment
Jun, SooJin
Han, SeonKwan
Kim, SooHwan
Kim, HyeonCheol
Lee, WonGyu
TECHNOLOGIES FOR E-LEARNING AND DIGITAL ENTERTAINMENT, PROCEEDINGS, 2007, 4469 : 274 - +
[23] Case-based student modeling in multi-agent learning environment
González, C
Burguillo, JC
Llamas, M
MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 72 - 81
[24] Multi-agent Q-learning Based Navigation in an Unknown Environment
Nath, Amar
Niyogi, Rajdeep
Singh, Tajinder
Kumar, Virendra
ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 1, 2022, 449 : 330 - 340
[25] Constructing adaptive individual learning environment based on multi-agent system
Chen, Peng
Meng, Anbo
Zhao, Chunhua
CIS WORKSHOPS 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY WORKSHOPS, 2007, : 374 - +
[26] Distributed State Estimation for Multi-Agent based Active Distribution Networks
Nguyen, P. H.
Kling, W. L.
IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
[27] Cooperative Multi-Agent Learning: The State of the Art
Liviu Panait
Sean Luke
Autonomous Agents and Multi-Agent Systems, 2005, 11 : 387 - 434
[28] Simulation for behavior learning of multi-agent robot
Maeda, Y
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 1998, 6 (01) : 53 - 64
[29] Cooperative multi-agent learning: The state of the art
Panait, L
Luke, S
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2005, 11 (03) : 387 - 434
[30] State-based episodic memory for multi-agent reinforcement learning
Xiao Ma
Wu-Jun Li
Machine Learning, 2023, 112 : 5163 - 5190

← 1 2 3 4 5 →