Evolutionary computation on multitask reinforcement learning problems

被引:5
作者
Handa, Hisashi [1 ]
机构
[1] Okayama Univ, Grad Sch Nat Sci & Technol, Okayama 7008530, Japan
来源
2007 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL, VOLS 1 AND 2 | 2007年
关键词
multitask reinforcement learning problems; evolutionary algorithms; dynamic environments;
D O I
10.1109/ICNSC.2007.372862
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, Multitask learning, which can cope with several tasks, has attracted much attention. Multitask Reinforcement Learning introduced by Tanaka et al is a problem class where number of problem instances of Markov Decision Processes sampled from the same probability distributions is sequentially given to reinforcement learning agents. The purpose of solving this problem is to realize adaptive agents for newly given environments by using knowledge acquired from past experience. Evolutionary Algorithms are often used to solve reinforcement learning problems if problem classes are quite different with Markov Decision Processes or state-action space is quite huge. From the viewpoint of Evolutionary Algorithms studies, the Multitask Reinforcement Learning problems are regarded as dynamic problems whose fitness landscape has changed temporally. In this paper, a memory-based Evolutionary Programming which is suitable for Multitask Reinforcement Learning problems is proposed.
引用
收藏
页码:685 / 688
页数:4
相关论文
共 11 条
[1]   An Overview of Evolutionary Algorithms for Parameter Optimization [J].
Baeck, Thomas ;
Schwefel, Hans-Paul .
EVOLUTIONARY COMPUTATION, 1993, 1 (01) :1-23
[2]  
Branke J., 1999, Proceedings of the IEEE Congress on Evolutionary Computation, Washington, DC, USA, DOI DOI 10.1109/CEC.1999.785502
[3]  
FOGEL D, 1999, EVOLUTIONARY COMPUTA
[4]  
Goldberg D.E, 1989, GENETIC ALGORITHMS S
[5]  
Handa H, 2006, GECCO 2006: Genetic and Evolutionary Computation Conference, Vol 1 and 2, P1195
[6]   Evolutionary optimization in uncertain environments - A survey [J].
Jin, Y ;
Branke, H .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2005, 9 (03) :303-317
[7]  
MORI N, 1997, ICGA, P299
[8]  
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[9]  
TANAKA F, P 2003 IEEE INT S CO, V3, P1108
[10]   Classifier Fitness Based on Accuracy [J].
Wilson, Stewart W. .
EVOLUTIONARY COMPUTATION, 1995, 3 (02) :149-175