An Evolutionary Transfer Reinforcement Learning Framework for Multiagent Systems

被引:61
作者
Hou, Yaqing [1 ]
Ong, Yew-Soon [2 ]
Feng, Liang [3 ]
Zurada, Jacek M. [4 ,5 ]
机构
[1] Nanyang Technol Univ, Interdisciplinary Grad Sch, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[3] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[4] Univ Louisville, Dept Elect & Comp Engn, Louisville, KY 40292 USA
[5] Univ Social Sci, PL-90011 Lodz, Poland
基金
新加坡国家研究基金会;
关键词
Memetic automaton; multiagent systems (MASs); natural selection; reinforcement learning (RL); transfer learning (TL); ORGANIZING NEURAL-NETWORKS; MEMETIC ALGORITHMS; COMPUTATION;
D O I
10.1109/TEVC.2017.2664665
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an evolutionary transfer reinforcement learning framework (eTL) for developing intelligent agents capable of adapting to the dynamic environment of multiagent systems (MASs). Specifically, we take inspiration from Darwin's theory of natural selection and Universal Darwinism as the principal driving forces that govern the evolutionary knowledge transfer process. The essential backbone of our proposed eTL comprises several meme-inspired evolutionary mechanisms, namely meme representation, meme expression, meme assimilation, meme internal evolution, and meme external evolution. Our proposed approach constructs social selection mechanisms that are modeled after the principles of human learning to identify appropriate interacting partners. eTL also models the intrinsic parallelism of natural evolution and errors that are introduced due to the physiological limits of the agents' ability to perceive differences, so as to generate "growth" and "variation" of knowledge that agents have of the world, thus exhibiting higher adaptivity capabilities on solving complex problems. To verify the efficacy of the proposed paradigm, comprehensive investigations of the proposed eTL against existing state-of-the-art TL methods in MAS, are conducted on the "minefield navigation tasks" platform and the "Unreal Tournament 2004" first person shooter computer game, in which homogeneous and heterogeneous learning machines are considered.
引用
收藏
页码:601 / 615
页数:15
相关论文
共 47 条
[1]  
Adobbati R., 2001, P 2 WORKSH INFR AG, V45, P47
[2]  
[Anonymous], 1988, LEARNING REPRESENTAT
[3]  
[Anonymous], 1989, THESIS
[4]  
[Anonymous], 1997, JoM: Evolutionary Models of Information Transmission
[5]  
[Anonymous], IEEE T COMPUTATIONAL
[6]  
[Anonymous], 2000, IEEE C EVOL COMPUTAT
[7]  
Back T., 1997, IEEE Transactions on Evolutionary Computation, V1, P3, DOI 10.1109/4235.585888
[8]  
Bagnell JA, 2001, IEEE INT CONF ROBOT, P1615, DOI 10.1109/ROBOT.2001.932842
[9]  
Banerjee B, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P672
[10]  
Blackmore S., 2000, MEME MACHINE, V25