Modular Q-learning based multi-agent cooperation for robot soccer

被引:77
作者
Park, KH [1 ]
Kim, YJ [1 ]
Kim, JH [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Yusong Gu, Taejon 305701, South Korea
关键词
multi-agent system; robot soccer system; reinforcement learning; modular Q-learning; action selection;
D O I
10.1016/S0921-8890(01)00114-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In a multi-agent system, action selection is important for the cooperation and coordination among agents. As the environment is dynamic and complex, modular Q-learning, which is one of the reinforcement learning schemes, is employed in assigning a proper action to an agent in the multi-agent system. The architecture of modular Q-learning consists of learning modules and a mediator module. The mediator module of the modular Q-learning system selects a proper action for the agent based on the Q-value obtained from each learning module. To obtain better performance, along with the Q-value, the mediator module also considers the state information in the action selection process. A uni-vector field is used for robot navigation. In the robot soccer environment, the effectiveness and applicability of modular Q-learning and the uni-vector field method are verified by real experiments using five micro-robots. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:109 / 122
页数:14
相关论文
共 17 条
[11]  
ONO N, 1996, P 2 INT C MULT SYST, P252
[12]   ALLIANCE: An architecture for fault tolerant multirobot cooperation [J].
Parker, LE .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1998, 14 (02) :220-240
[13]  
Rummery G.A., 1995, THESIS CAMBRIDGE U C
[14]   Multiagent reinforcement learning in the Iterated Prisoner's Dilemma [J].
Sandholm, TW ;
Crites, RH .
BIOSYSTEMS, 1996, 37 (1-2) :147-166
[15]   Designing distributed control architecture for cooperative multi-agent system and its real-time application to soccer robot [J].
Shim, HS ;
Kim, HS ;
Jung, MJ ;
Choi, IH ;
Kim, JH ;
Kim, JO .
ROBOTICS AND AUTONOMOUS SYSTEMS, 1997, 21 (02) :149-165
[16]  
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[17]  
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698