STATE AGGREGATION FOR REINFORCEMENT LEARNING USING NEUROEVOLUTION

被引:0
作者
Wright, Robert [1 ]
Gemelli, Nathaniel [1 ]
机构
[1] AF Res Lab, Informat Directorate, 525 Brooks Rd, Rome, NY 13441 USA
来源
ICAART 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE | 2009年
关键词
Reinforcement learning; NeuroEvolution; Evolutionary algorithms; State aggregation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new machine learning algorithm, RL-SANE, which uses a combination of neuroevolution (NE) and traditional reinforcement learning (RL) techniques to improve learning performance. RL-SANE is an innovative combination of the neuroevolutionary algorithm NEAT(Stanley, 2004) and the RL algorithm Sarsa(lambda)(Sutton and Barto, 1998). It uses the special ability of NEAT to generate and train customized neural networks that provide a means for reducing the size of the state space through state aggregation. Reducing the size of the state space through aggregation enables Sarsa(lambda) to be applied to much more difficult problems than standard tabular based approaches. Previous similar work in this area, such as in Whiteson and Stone (Whiteson and Stone, 2006) and Stanley and Miikkulainen (Stanley and Miikkulainen, 2001), have shown positive and promising results. This paper gives a brief overview of neuroevolutionary methods, introduces the RL-SANE algorithm, presents a comparative analysis of RL-SANE to other neuroevolutionary algorithms, and concludes with a discussion of enhancements that need to be made to RL-SANE.
引用
收藏
页码:45 / +
页数:2
相关论文
共 15 条
  • [1] Boyan J. A., 1995, Advances in Neural Information Processing Systems 7, P369
  • [2] CARRERAS M, 2002, IEEE TTTC INT C AUT
  • [3] Gomez FJ, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P1356
  • [4] JAMES D, 2004, P 2004 C GEN EV COMP
  • [5] Forming Neural Networks Through Efficient and Adaptive Coevolution
    Moriarty, David E.
    Miikkulainen, Risto
    [J]. EVOLUTIONARY COMPUTATION, 1997, 5 (04) : 373 - 399
  • [6] Rumelhart D. E., 1988, Learning Representations by Back-Propagating Errors, P696
  • [7] SIEBEL NT, 2007, EFFICIENT LEARNING N, V4713
  • [8] Singh S. P., 1995, Advances in Neural Information Processing Systems 7, P361
  • [9] STANLEY K, 2001, EVOLVING NEURAL NETW
  • [10] Stanley K O, 2002, Proceedings of the 4th Annual Conference on Genetic and Evolutionary Computation, P569