A layered approach to learning coordination knowledge in multiagent environments

被引:13
作者
Erus, Guray
Polat, Faruk
机构
[1] Univ Paris 05, Lab SIP CRIP5, F-75006 Paris, France
[2] Middle E Tech Univ, TR-06531 Ankara, Turkey
关键词
reinforcement learning; hierarchical reinforcement learning; multiagent learning;
D O I
10.1007/s10489-006-0034-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiagent learning involves acquisition of cooperative behavior among intelligent agents in order to satisfy the joint goals. Reinforcement Learning (RL) is a promising unsupervised machine learning technique inspired from the earlier studies in animal learning. In this paper, we propose a new RL technique called the Two Level Reinforcement Learning with Communication (2LRL) method to provide cooperative action selection in a multiagent environment. In 2LRL, learning takes place in two hierarchical levels; in the first level agents learn to select their target and then they select the action directed to their target in the second level. The agents communicate their perception to their neighbors and use the communication information in their decision-making. We applied 2LRL method in a hunter-prey environment and observed a satisfactory cooperative behavior.
引用
收藏
页码:249 / 267
页数:19
相关论文
共 50 条
[31]  
Ono N, 1997, LECT NOTES ARTIF INT, V1221, P25
[32]   Modular Q-learning based multi-agent cooperation for robot soccer [J].
Park, KH ;
Kim, YJ ;
Kim, JH .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2001, 35 (02) :109-122
[33]  
Park MS, 2002, 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS, P88
[34]  
Parr R., 1998, ADV NEURAL INFORM PR
[35]  
PICKLETT M, 2002, P INT C MACH LEARN I
[36]   Learning sequences of compatible actions among agents [J].
Polat, F ;
Abul, O .
ARTIFICIAL INTELLIGENCE REVIEW, 2002, 17 (01) :21-37
[37]  
ROVATSOS M, 2004, P 2 EUR WORKSH MULT, P593
[38]  
SEN S, 1994, PROCEEDINGS OF THE TWELFTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P426
[39]   Learning intelligent behavior in a non-stationary and partially observable environment [J].
Senkul, S ;
Polat, F .
ARTIFICIAL INTELLIGENCE REVIEW, 2002, 18 (02) :97-115
[40]  
Shoham Y., 2003, Multi-agent reinforcement learning:a critical survey