Cooperative Reinforcement Learning Based on Zero-Sum Games

被引：0

作者：

Hwang, Kao-Shing ^{[1
]}

Chiou, Jeng-Yih ^{[2
]}

Chen, Tse-Yu ^{[1
]}

机构：

[1] Natl Chung Cheng Univ, Dept Elect Engn X, Chiayi, Taiwan

[2] Kun San Univ Tainan, Dept Informat Engn, Tainan, Taiwan

来源：

2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7 | 2008年

关键词：

cooperation; zero-sum game theory; Q-learning; robot soccer system; reinforcement learning; strategy system;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The objective of this paper is to develop a strategy system in a robot soccer system with cooperative ability which is improved by self-learning. A reinforcement learning method based on the zero-sum game theory is developed in this paper. It enforces learning systems to choose an appropriate strategy complying with the opponent's actions. In order to achieve the purpose of cooperation, the system consists of two sub systems, one is a role assignment system, and the other is a reinforcement learning system

引用

页码：2857 / +

页数：3

共 10 条

[1] ALBUS JS, 1975, DYNAMIC SYSTEMS MEAS, P220
[2] [Anonymous], P AUSTR C ROB AUT AC
[3] Reinforcement learning: A survey
Kaelbling, LP
Littman, ML
Moore, AW
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
[4] Vector field based path planning and Petri-net based role selection mechanism with Q-learning for the soccer robot system
Kim, DH
Kim, YJ
Kim, KC
Kim, JH
Vadakkepat, P
[J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2000, 6 (01) : 75 - 87
[5] Multi-agent systems: A survey from the robot-soccer perspective
Kim, JH
Vadakkepat, P
[J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2000, 6 (01) : 3 - 17
[6] Modular Q-learning based multi-agent cooperation for robot soccer
Park, KH
Kim, YJ
Kim, JH
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2001, 35 (02) : 109 - 122
[7] SHIM HS, 1999, 4 INT S ART LIF ROB
[8] STONE P, MULTIAGENT SYSTEMS S
[9] Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[10] Reinforcement learning soccer teams with incomplete world models
Wiering, M
Salustowicz, R
Schmidhuber, J
[J]. AUTONOMOUS ROBOTS, 1999, 7 (01) : 77 - 88

← 1 →