Cooperative Reinforcement Learning Based on Zero-Sum Games

被引:0
作者
Hwang, Kao-Shing [1 ]
Chiou, Jeng-Yih [2 ]
Chen, Tse-Yu [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Elect Engn X, Chiayi, Taiwan
[2] Kun San Univ Tainan, Dept Informat Engn, Tainan, Taiwan
来源
2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7 | 2008年
关键词
cooperation; zero-sum game theory; Q-learning; robot soccer system; reinforcement learning; strategy system;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this paper is to develop a strategy system in a robot soccer system with cooperative ability which is improved by self-learning. A reinforcement learning method based on the zero-sum game theory is developed in this paper. It enforces learning systems to choose an appropriate strategy complying with the opponent's actions. In order to achieve the purpose of cooperation, the system consists of two sub systems, one is a role assignment system, and the other is a reinforcement learning system
引用
收藏
页码:2857 / +
页数:3
相关论文
共 10 条
  • [1] ALBUS JS, 1975, DYNAMIC SYSTEMS MEAS, P220
  • [2] [Anonymous], P AUSTR C ROB AUT AC
  • [3] Reinforcement learning: A survey
    Kaelbling, LP
    Littman, ML
    Moore, AW
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
  • [4] Vector field based path planning and Petri-net based role selection mechanism with Q-learning for the soccer robot system
    Kim, DH
    Kim, YJ
    Kim, KC
    Kim, JH
    Vadakkepat, P
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2000, 6 (01) : 75 - 87
  • [5] Multi-agent systems: A survey from the robot-soccer perspective
    Kim, JH
    Vadakkepat, P
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2000, 6 (01) : 3 - 17
  • [6] Modular Q-learning based multi-agent cooperation for robot soccer
    Park, KH
    Kim, YJ
    Kim, JH
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2001, 35 (02) : 109 - 122
  • [7] SHIM HS, 1999, 4 INT S ART LIF ROB
  • [8] STONE P, MULTIAGENT SYSTEMS S
  • [9] Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
  • [10] Reinforcement learning soccer teams with incomplete world models
    Wiering, M
    Salustowicz, R
    Schmidhuber, J
    [J]. AUTONOMOUS ROBOTS, 1999, 7 (01) : 77 - 88