Evolutionary computation on multitask reinforcement learning problems

被引：5

作者：

Handa, Hisashi ^{[1
]}

机构：

[1] Okayama Univ, Grad Sch Nat Sci & Technol, Okayama 7008530, Japan

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING, AND CONTROL, VOLS 1 AND 2 | 2007年

关键词：

multitask reinforcement learning problems; evolutionary algorithms; dynamic environments;

D O I：

10.1109/ICNSC.2007.372862

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, Multitask learning, which can cope with several tasks, has attracted much attention. Multitask Reinforcement Learning introduced by Tanaka et al is a problem class where number of problem instances of Markov Decision Processes sampled from the same probability distributions is sequentially given to reinforcement learning agents. The purpose of solving this problem is to realize adaptive agents for newly given environments by using knowledge acquired from past experience. Evolutionary Algorithms are often used to solve reinforcement learning problems if problem classes are quite different with Markov Decision Processes or state-action space is quite huge. From the viewpoint of Evolutionary Algorithms studies, the Multitask Reinforcement Learning problems are regarded as dynamic problems whose fitness landscape has changed temporally. In this paper, a memory-based Evolutionary Programming which is suitable for Multitask Reinforcement Learning problems is proposed.

引用

页码：685 / 688

页数：4

共 11 条

[1] An Overview of Evolutionary Algorithms for Parameter Optimization [J].

Baeck, Thomas ;

Schwefel, Hans-Paul .

EVOLUTIONARY COMPUTATION, 1993, 1 (01) :1-23

[2]

Branke J., 1999, Proceedings of the IEEE Congress on Evolutionary Computation, Washington, DC, USA, DOI DOI 10.1109/CEC.1999.785502

[3]

FOGEL D, 1999, EVOLUTIONARY COMPUTA

[4]

Goldberg D.E, 1989, GENETIC ALGORITHMS S

[5]

Handa H, 2006, GECCO 2006: Genetic and Evolutionary Computation Conference, Vol 1 and 2, P1195

[6] Evolutionary optimization in uncertain environments - A survey [J].

Jin, Y ;

Branke, H .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2005, 9 (03) :303-317

[7]

MORI N, 1997, ICGA, P299

[8]

Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447

[9]

TANAKA F, P 2003 IEEE INT S CO, V3, P1108

[10] Classifier Fitness Based on Accuracy [J].

Wilson, Stewart W. .

EVOLUTIONARY COMPUTATION, 1995, 3 (02) :149-175

← 1 2 →