Reinforcement learning in strategy selection for a coordinated multirobot system

被引：22

作者：

Hwang, Kao-Shing ^{[1
]}

Chen, Yu-Jen ^{[1
]}

Lee, Ching-Huang ^{[1
]}

机构：

[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 621, Taiwan

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS | 2007年 / 37卷 / 06期

关键词：

multiagent; multiple strategies; soccer robot;

D O I：

10.1109/TSMCA.2007.904823

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This correspondence presents a multistrategy decision-making system for robot soccer games. Through reinforcement processes, the coordination between robots is learned in the course of game. Meanwhile, a better action can be granted after an iterative learning process. The experimental scenario is a five-versus-five soccer game, where the proposed system dynamically assigns each player to a position in a primitive role, such as attacker, goalkeeper, etc. The responsibility of each player varies along with the change of the role in state transitions. Therefore, the system uses several strategies, such as offensive strategy, defensive strategy, and so on, for a variety of scenarios. Thus, the decision-making mechanism can choose a better strategy according to the circumstances encountered. In each strategy, a robot should behave in coordination with its teammates and resolve conflicts aggressively. The major task assignment to robots in each strategy is simply to catch good positions. Therefore, the problem of dispatching robots to good positions in a reasonable manner should be effectively handled with. This kind of problem is similar to assignment problems in linear programming research. Utilizing the Hungarian method, each robot can be assigned to its assigned spot with minimal cost. Consequently, robots based on the proposed decision-making system can accomplish each situational task in coordination.

引用

页码：1151 / 1157

页数：7

共 13 条

[1] The RoboCup Challenge - Keynote Introduction [J].

Asada, M ;

Kitano, H .

ROBOTICS AND AUTONOMOUS SYSTEMS, 1999, 29 (01) :3-12

[2] Cooperative, hybrid agent architecture for real-time traffic signal control [J].

Choy, MC ;

Srinivasan, D ;

Cheu, RL .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2003, 33 (05) :597-607

[3]

Chvatal V, 1983, Linear programming

[4]

Hillier F.S., 2001, INTRO OPERATIONS RES, V7th

[5] Strategy-based decision making of a soccer robot system using a real-time self-organizing fuzzy decision tree [J].

Huang, HP ;

Liang, CC .

FUZZY SETS AND SYSTEMS, 2002, 127 (01) :49-64

[6]

Hwang KS, 2006, LECT NOTES COMPUT SC, V3971, P599

[7] Cooperative strategy based on adaptive Q-learning for robot soccer systems [J].

Hwang, KS ;

Tan, SW ;

Chen, CC .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) :569-576

[8]

IVANCIC F, 2001, REINFORCEMENT LEARNI

[9] Modular fuzzy-reinforcement learning approach with internal model capabilities for multiagent systems [J].

Kaya, M ;

Alhajj, R .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (02) :1210-1223

[10] On actor-critic algorithms [J].

Konda, VR ;

Tsitsiklis, JN .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (04) :1143-1166

← 1 2 →