A simple interaction model for learner agents: An evolutionary approach

被引:2
作者
Beigi, Akram [1 ]
Mozayani, Nasser [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Univ Rd,Hengam St, Tehran, Iran
关键词
Interactions between agents; multi task reinforcement learning; evolutionary algorithms; dynamic environment;
D O I
10.3233/IFS-152024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently multi agent systems are used to solve complex problems. In these systems agents can cooperate when a problem is difficult or impossible to solve for an individual agent. Via learning, the agents attempt to maximize some of their utilities. In multi agent learning an agent learns to interact with other agents and considering their behaviors. By multi task learning, the agent simultaneously learns a set of related problems and with reinforcement learning, an agent learns a proper policy to achieve its goal. In learning process, using the experience of teammate agents by simple interactions among them is very beneficial. In this paper we have presented a simple model of agents' interactions using operators of an evolutionary algorithm. Applying the proposed model has improved significantly the performance of multi task learning in a nondeterministic and dynamic environment, specifically for the dynamic maze problem. The experimental results indicate our claim.
引用
收藏
页码:2713 / 2726
页数:14
相关论文
共 31 条
[1]  
[Anonymous], 2009, An introduction to multiagent systems
[2]  
[Anonymous], 2012, COMPUTER SCI P INT C
[3]  
[Anonymous], IEEE SICE ICASE INT
[4]  
Bagnall AJ, 2005, STUD FUZZ SOFT COMP, V183, P307
[5]  
Beigi A, 2011, LECT NOTES COMPUT SC, V6594, P245, DOI 10.1007/978-3-642-20267-4_26
[6]  
Beigi A, 2010, LECT NOTES COMPUT SC, V6335, P330, DOI 10.1007/978-3-642-15470-6_34
[7]  
Chen J., 2011, P 17 ACM SIGKDD INT, P42
[8]  
Cuayahuitl, 2009, THESIS U EDINBURGH
[9]  
GOH K, 2009, EVOLUTIONARY MULTIOB
[10]   Evolutionary computation on multitask reinforcement learning problems [J].
Handa, Hisashi .
2007 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL, VOLS 1 AND 2, 2007, :685-688