Reinforcement learning with pattern-based rewards

被引：0

作者：

Peters, JF ^{[1
]}

Henry, C ^{[1
]}

Ramanna, S ^{[1
]}

机构：

[1] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB R3T 5V6, Canada

来源：

PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE | 2005年

关键词：

approximation space; ecosystem; intelligent systems; reinforcement learning; rough sets; swarm;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper introduces an approach to deriving pattern-based rewards during reinforcement learning by cooperating agents. Rough set theory introduced by Zdzislaw Pawlak in 1982 provides a ground for deriving pattern-based rewards in the context of approximation spaces. The framework provided by an approximation space makes it possible to derive pattern-based reference rewards used to compute action rewards as well as action preferences. Approximation spaces are used to derive action-based reference rewards at the swarm intelligence level. Two different forms of reinforcement comparison are considered as a part of a study of learning in real-time by a swarm. In addition, this article introduces an artificial ecosystem test-bed that makes it possible to study learning by collections of biologically-inspired bots. The contribution of this article is the introduction of an approach to rewarding swarm behavior in the context of approximation spaces.

引用

页码：267 / 272

页数：6

共 19 条

[1]

[Anonymous], SOFT COMPUTING SCI

[2]

Bonabeau Eric, 1999, Swarm intelligence: from natural to artificial systems

[3] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

[4]

Komorowski J., 1999, Rough Fuzzy Hybridization: A New Trend in Decision Making, P3

[5] ROUGH SETS [J].

PAWLAK, Z .

INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05) :341-356

[6]

Pawlak Z, 1991, Theory and decision library: series D

[7]

Peters J. F., 2004, ENG APPL ARTIF INTEL, V17, P1

[8] Approximation spaces for hierarchical intelligent behavioral system models [J].

Peters, JF .

MONITORING, SECURITY, AND RESCUE TECHNIQUES IN MULTIAGENT SYSTEMS, 2005, :13-30

[9]

Peters JF, 2004, LECT NOTES COMPUT SC, V3213, P764

[10]

Peters JF, 2003, STUD FUZZ SOFT COMP, V116, P141

← 1 2 →