Reinforcement learning with approximation spaces

被引：0

作者：

Peters, James F. ^{[1
]}

Henry, Christopher ^{[1
]}

机构：

[1] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB R3T 5V6, Canada

来源：

FUNDAMENTA INFORMATICAE | 2006年 / 71卷 / 2-3期

关键词：

approximation space; ecosystem; ethology; Monte Carlo method; reinforcement learning; rough sets; swarm;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper introduces a rough set approach to reinforcement learning by swarms of cooperating agents. The problem considered in this paper is how to guide reinforcement learning based on knowledge of acceptable behavior patterns. This is made possible by considering behavior patterns of swarms in the context of approximation spaces. Rough set theory introduced by Zdzislaw Pawlak in the early 1980s provides a ground for deriving pattern-based rewards within approximation spaces. Both conventional and approximation space-based forms of reinforcement comparison and the actor-critic method as well as two forms of the off-policy Monte Carlo learning control method are investigated in this article. The study of swarm behavior by collections of biologically-inspired bots is carried out in the context of an artificial ecosystem testbed. This ecosystem has an ethological basis that makes it possible to observe and explain the behavior of biological organisms that carries over into the study of reinforcement learning by interacting robotic devices. The results of ecosystem experiments with six forms of reinforcement learning are given. The contribution of this article is the presentation of several viable alternatives to conventional reinforcement learning methods defined in the context of approximation spaces.

引用

页码：323 / 349

页数：27

共 58 条

[1] [Anonymous], 1968, OUTLINE GEN TOPOLOGY
[2] [Anonymous], 1979, Monte Carlo Methods, DOI DOI 10.1007/978-94-009-5819-7
[3] [Anonymous], 1999, Swarm Intelligence
[4] [Anonymous], 1657, EXCERCITATIONUM MATH
[5] Bazan JG, 2004, LECT NOTES ARTIF INT, V3066, P346
[6] Efremovic VA., 1951, Mat. Sb., V31, P189
[7] GASKETT C, 2002, THESIS AUSTR NATL U
[8] Gomolinska A, 2005, LECT NOTES COMPUT SC, V3400, P57
[9] Reinforcement learning: A survey
Kaelbling, LP
Littman, ML
Moore, AW
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
[10] Komorowski J., 1999, ROUGH SETS TUTORIAL, P3

← 1 2 3 4 5 6 →