Reinforcement learning with approximation spaces

被引:0
作者
Peters, James F. [1 ]
Henry, Christopher [1 ]
机构
[1] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB R3T 5V6, Canada
关键词
approximation space; ecosystem; ethology; Monte Carlo method; reinforcement learning; rough sets; swarm;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper introduces a rough set approach to reinforcement learning by swarms of cooperating agents. The problem considered in this paper is how to guide reinforcement learning based on knowledge of acceptable behavior patterns. This is made possible by considering behavior patterns of swarms in the context of approximation spaces. Rough set theory introduced by Zdzislaw Pawlak in the early 1980s provides a ground for deriving pattern-based rewards within approximation spaces. Both conventional and approximation space-based forms of reinforcement comparison and the actor-critic method as well as two forms of the off-policy Monte Carlo learning control method are investigated in this article. The study of swarm behavior by collections of biologically-inspired bots is carried out in the context of an artificial ecosystem testbed. This ecosystem has an ethological basis that makes it possible to observe and explain the behavior of biological organisms that carries over into the study of reinforcement learning by interacting robotic devices. The results of ecosystem experiments with six forms of reinforcement learning are given. The contribution of this article is the presentation of several viable alternatives to conventional reinforcement learning methods defined in the context of approximation spaces.
引用
收藏
页码:323 / 349
页数:27
相关论文
共 58 条
  • [21] Pawlak, 1981, 429 POL AC SCI
  • [22] ROUGH SETS
    PAWLAK, Z
    [J]. INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05): : 341 - 356
  • [23] PAWLAK Z, 1981, 431 POL AC SCI I COM
  • [24] PAWLAK Z, 1991, KNOWLEDGE ENG PROBLE, V9
  • [25] Peters JE, 2002, FUND INFORM, V51, P157
  • [26] Peters JF, 2005, PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, P267
  • [27] Rough ethograms: Study of intelligent system behavior
    Peters, JF
    Henry, C
    Ramanna, S
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, PROCEEDINGS, 2005, : 117 - 126
  • [28] Approximation spaces for hierarchical intelligent behavioral system models
    Peters, JF
    [J]. MONITORING, SECURITY, AND RESCUE TECHNIQUES IN MULTIAGENT SYSTEMS, 2005, : 13 - 30
  • [29] Peters JF, 2005, LECT NOTES COMPUT SC, V3400, P153
  • [30] Peters JF, 2004, LECT NOTES COMPUT SC, V3213, P764