Reinforcement learning of competitive skills with soccer agents

被引:0
作者
Leng, Jinsong [1 ]
Fyfe, Colin [2 ]
Jain, Lakhmi [1 ]
机构
[1] Univ S Australia, Sch Elect & Informat Engn, Knowledge Based Intelligent Engn Syst Ctr, Mawson Lakes, SA 5095, Australia
[2] Univ Paisley, Appl Computat Intelligence Res Unit, Paisley, Renfrew, Scotland
来源
KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS | 2007年 / 4692卷
关键词
agents; reinforcement learning; decision making;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning plays an important role in Multi-Agent Systems. The reasoning and learning ability of agents is the key for autonomous agents. Autonomous agents are required to be able to adapt and learn in uncertain environments via communication and collaboration (in both competitive and cooperative situations). For real-time, non-deterministic and dynamic systems, it is often extremely complex and difficult to formally verify their properties a priori. In this paper, we adopt the reinforcement learning algorithms to verify goal-oriented agenst competitive and cooperative learning abilities for decision making. In doing so, a simulation testbed is applied to test the learning algorithms in the specified scenarios. In addition, the function approximation technique known as tile coding (TC), is used to generate value functions, which can avoid the value function growing exponentially with the number of the state values.
引用
收藏
页码:572 / +
页数:3
相关论文
共 17 条
[1]  
DAYAN P, 1994, MACH LEARN, V14, P295
[2]  
*INFOGRAMES EP GAM, 2000, TECHN REP UNR TOURN
[3]  
Jenner HA, 1998, Hydroecologie Appliquee, V1-2, P1, DOI [DOI 10.1051/HYDRO:1989101, 10.1051/hydro:1989101]
[4]  
Jennings N. R., 1998, AGENT TECHNOLOGY FDN, P3, DOI [DOI 10.1007/978-3-662-03678-5_1, 10.1007/978-3-662-03678-5_1]
[5]  
KUHLMANN G, 2006, LNCS LNAI, V4020, P30
[6]  
Leng JS, 2006, LECT NOTES ARTIF INT, V4252, P472
[7]  
Riedmiller M., 2001, RoboCup 2000: Robot Soccer World Cup IV (Lecture Notes in Artificial Intelligence Vol.2019), P367
[8]  
Sherstov AA, 2005, LECT NOTES ARTIF INT, V3607, P194
[9]  
Singh SP, 1996, MACH LEARN, V22, P123, DOI 10.1007/BF00114726
[10]  
Stankevich L, 2005, LECT NOTES COMPUT SC, V3505, P289