Efficient implementation of dynamic fuzzy Q-learning

被引:0
作者
Deng, C [1 ]
Er, MJ [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
来源
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS | 2003年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a Dynamic Fuzzy Q-Learning (DFQL) method that is capable of tuning the Fuzzy Inference Systems (FIS) online. On-line self-organizing learning is developed so that structure and parameters identification are accomplished automatically and simultaneously. Self-organizing fuzzy inference is introduced to calculate actions and Q-functions so as to enable us to deal with continuous-valued states and actions. We provide the conditions of the convergence of the algorithm. Furthermore, the learning methods based on bias component and eligibility traces for rapid reinforcement learning are discussed.
引用
收藏
页码:1854 / 1858
页数:5
相关论文
共 6 条
  • [1] Fuzzy inference system learning by reinforcement methods
    Jouffe, L
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 1998, 28 (03): : 338 - 355
  • [3] Continuous-action Q-learning
    Millán, JDR
    Posenato, D
    Dedieu, E
    [J]. MACHINE LEARNING, 2002, 49 (2-3) : 247 - 265
  • [4] Incremental multi-step Q-learning
    Peng, J
    Williams, RJ
    [J]. MACHINE LEARNING, 1996, 22 (1-3) : 283 - 290
  • [5] Feature-based methods for large scale dynamic programming
    Tsitsiklis, JN
    VanRoy, B
    [J]. MACHINE LEARNING, 1996, 22 (1-3) : 59 - 94
  • [6] A fast approach for automatic generation of fuzzy rules by generalized dynamic fuzzy neural networks
    Wu, SQ
    Er, MJ
    Gao, Y
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2001, 9 (04) : 578 - 594