Modular Q-learning based multi-agent cooperation for robot soccer

被引：77

作者：

Park, KH ^{[1
]}

Kim, YJ ^{[1
]}

Kim, JH ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Yusong Gu, Taejon 305701, South Korea

来源：

ROBOTICS AND AUTONOMOUS SYSTEMS | 2001年 / 35卷 / 02期

关键词：

multi-agent system; robot soccer system; reinforcement learning; modular Q-learning; action selection;

D O I：

10.1016/S0921-8890(01)00114-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In a multi-agent system, action selection is important for the cooperation and coordination among agents. As the environment is dynamic and complex, modular Q-learning, which is one of the reinforcement learning schemes, is employed in assigning a proper action to an agent in the multi-agent system. The architecture of modular Q-learning consists of learning modules and a mediator module. The mediator module of the modular Q-learning system selects a proper action for the agent based on the Q-value obtained from each learning module. To obtain better performance, along with the Q-value, the mediator module also considers the state information in the action selection process. A uni-vector field is used for robot navigation. In the robot soccer environment, the effectiveness and applicability of modular Q-learning and the uni-vector field method are verified by real experiments using five micro-robots. (C) 2001 Elsevier Science B.V. All rights reserved.

引用

页码：109 / 122

页数：14

共 17 条

[11]

ONO N, 1996, P 2 INT C MULT SYST, P252

[12] ALLIANCE: An architecture for fault tolerant multirobot cooperation [J].

Parker, LE .

IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1998, 14 (02) :220-240

[13]

Rummery G.A., 1995, THESIS CAMBRIDGE U C

[14] Multiagent reinforcement learning in the Iterated Prisoner's Dilemma [J].

Sandholm, TW ;

Crites, RH .

BIOSYSTEMS, 1996, 37 (1-2) :147-166

[15] Designing distributed control architecture for cooperative multi-agent system and its real-time application to soccer robot [J].

Shim, HS ;

Kim, HS ;

Jung, MJ ;

Choi, IH ;

Kim, JH ;

Kim, JO .

ROBOTICS AND AUTONOMOUS SYSTEMS, 1997, 21 (02) :149-165

[16]

Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447

[17]

WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698

← 1 2 →