Reinforcement learning with pattern-based rewards

被引:0
作者
Peters, JF [1 ]
Henry, C [1 ]
Ramanna, S [1 ]
机构
[1] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB R3T 5V6, Canada
来源
PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE | 2005年
关键词
approximation space; ecosystem; intelligent systems; reinforcement learning; rough sets; swarm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces an approach to deriving pattern-based rewards during reinforcement learning by cooperating agents. Rough set theory introduced by Zdzislaw Pawlak in 1982 provides a ground for deriving pattern-based rewards in the context of approximation spaces. The framework provided by an approximation space makes it possible to derive pattern-based reference rewards used to compute action rewards as well as action preferences. Approximation spaces are used to derive action-based reference rewards at the swarm intelligence level. Two different forms of reinforcement comparison are considered as a part of a study of learning in real-time by a swarm. In addition, this article introduces an artificial ecosystem test-bed that makes it possible to study learning by collections of biologically-inspired bots. The contribution of this article is the introduction of an approach to rewarding swarm behavior in the context of approximation spaces.
引用
收藏
页码:267 / 272
页数:6
相关论文
共 19 条
[1]  
[Anonymous], SOFT COMPUTING SCI
[2]  
Bonabeau Eric, 1999, Swarm intelligence: from natural to artificial systems
[3]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285
[4]  
Komorowski J., 1999, Rough Fuzzy Hybridization: A New Trend in Decision Making, P3
[5]   ROUGH SETS [J].
PAWLAK, Z .
INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05) :341-356
[6]  
Pawlak Z, 1991, Theory and decision library: series D
[7]  
Peters J. F., 2004, ENG APPL ARTIF INTEL, V17, P1
[8]   Approximation spaces for hierarchical intelligent behavioral system models [J].
Peters, JF .
MONITORING, SECURITY, AND RESCUE TECHNIQUES IN MULTIAGENT SYSTEMS, 2005, :13-30
[9]  
Peters JF, 2004, LECT NOTES COMPUT SC, V3213, P764
[10]  
Peters JF, 2003, STUD FUZZ SOFT COMP, V116, P141