Generalized state values in an anticipatory learning classifier system

被引:0
作者
Butz, MV [1 ]
Goldberg, DE
机构
[1] Univ Illinois, Illinois Genet Algorithms Lab, Urbana, IL 61801 USA
[2] Univ Wurzburg, Dept Cognit Psychol, Wurzburg, Germany
来源
ANTICIPATORY BEHAVIOR IN ADAPTIVE LEARNING SYSTEMS: FOUNDATIONS, THEORIES, AND SYSTEMS | 2003年 / 2684卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces generalized state values to the anticipatory learning classifier system ACS2. Previous studies showed that the evolving generalized state value in ACS2 might be overgeneral. for a proper policy representation. Thus, the policy representation is separated from the model representation. A function approximation module is added that approximates state values. Actual action choice then depends on the learned generalized state values predicted by the means of the predictive model yielding anticipatory behavior. It is shown that the function approximation module accurately generalizes the state value function in the investigated MDP. Improvement of the approach by the means of further anticipatory interaction between predictive model learner and state value learner is suggested. We also propose the implementation of task dependent anticipatory attentional mechanisms exploiting the representation of the generalized state-value function. Finally, the anticipatory framework may be extended to support multiple motivations integrated in a motivational module which could be influenced by emotional biases.
引用
收藏
页码:282 / 301
页数:20
相关论文
共 25 条
  • [1] [Anonymous], 2002, ANTICIPATORY LEARNIN
  • [2] [Anonymous], 2001, Proceedings of the Genetic and Evolutionary Computation Conference (GECCO01)
  • [3] AVILAGARCIA O, 2002, EPSRC BBSRC INT WORK
  • [4] BOYAN J, 1995, ADV NEURAL INFORMATI, V7
  • [5] Butz M. V., 2001, Advances in Learning Classifier Systems. Third International Workshop, IWLCS 2000. Revised Papers (Lecture Notes in Artificial Intelligence Vol.1996), P253
  • [6] Butz MV, 2002, LECT NOTES ARTIF INT, V2321, P3
  • [7] BUTZ MV, 2002, NATURAL COMPUTING, V1, P427
  • [8] BUTZ MV, 2003, IN PRESS ADAPTIVE BE
  • [9] BUTZ MV, 2002, 2002018 ILLIGAL
  • [10] BUTZ MV, 2003, IN PRESS EVOLUTIONAR