Cooperation between multiple agents based on partially sharing policy

被引:0
作者
Hwang, Kao-Shing [1 ]
Lin, Chia-Ju [1 ]
Wit, Chun-Ju [1 ]
Lo, Chia-Yue [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Elect Engn, 168 Univ Rd, Chiayi, Taiwan
来源
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES | 2007年 / 4681卷
关键词
multi-agent; cooperation; sharing; reinforcement learning; mobile; robot;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In human society, learning is essential to intelligent behavior. However, people do not need to learn everything from scratch by their own discovery. Instead, they exchange information and knowledge with one another and learn from their peers and teachers. When a task is too complex for an individual to handle, one may cooperate with its partners in order to accomplish it. Like human society, cooperation exists in the other species, such as ants that are known to communicate about the locations of food and move it cooperatively. Using the experience and knowledge of other agents, a learning agent may learn faster, make fewer mistakes, and create rules for unstructured situations. In the proposed learning algorithm, an agent adapts to comply with its peers by learning carefully when it obtains a positive reinforcement feedback signal, but should learn more aggressively if a negative reward follows the action just taken. These two properties are applied to develop the proposed cooperative learning method conceptually. The algorithm is implemented in some cooperative tasks and demonstrates that agents can learn to accomplish a task together efficiently through a repetitive trials.
引用
收藏
页码:422 / +
页数:2
相关论文
共 12 条
[1]   Expertness based cooperative Q-learning [J].
Ahmadabadi, MN ;
Asadpour, M .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01) :66-76
[2]  
BANERJEE B, 2003, P 2 INT JOINT C AUT, P683
[3]  
BONARINI A, 2005, ECML WORKSH REINF LE
[4]   Multiagent learning using a variable learning rate [J].
Bowling, M ;
Veloso, M .
ARTIFICIAL INTELLIGENCE, 2002, 136 (02) :215-250
[5]  
Bowling M, 2001, P 17 INT JOINT C ART, V17, P1021
[6]  
HAFNER R, 2003, P 2003 IEEE RSJ INT
[7]  
Hwang KS, 2006, LECT NOTES COMPUT SC, V3971, P599
[8]  
KATO S, 1991, ADV ROBOTICS, V2, P1063
[9]  
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[10]  
Tan M., 1993, P 10 INT C MACH LEAR, P330