Adding memory condition to learning classifier systems to solve partially observable environments

被引:0
|
作者
Zang, Zhao Xiang [1 ]
Li, De Hua [1 ]
Wang, Jun Ying [2 ]
机构
[1] Huazhong Univ Sci & Technol, Inst Pattern Recognit & Artificial Intelligence, Wuhan 430074, Hubei, Peoples R China
[2] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443000, Hubei, Peoples R China
关键词
learning classifier systems; LCSs; extended classifier system; XCS; internal memory; partially observable environments; aliasing state;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Within the paradigm of learning classifier systems, extended classifier system (XCS) is outstanding. However, the original XCS has no memory mechanism and can only learn optimal policy in Markovian environments, where the optimal action is determined solely by the state of current sensory input. But in practice, most environments are partially observable environments with respect to agent's sensation, and they form the most general class of environments: non-Markov environments. In these environments, XCS either fails completely, or only develops a suboptimal policy, since it is memoryless. In this paper, we develop a new learning classifier system based on XCS, named 'XCSMM', which adds an internal message to XCS as an internal memory, and then extends the classifier with a memory condition that is used to sense the internal memory. XCSMM holds a simple and clear memory mechanism, which is easy to understand and implement. Besides, four sets of different complex maze problems have been employed to test XCSMM. Experimental results show that XCSMM is able to evolve optimal or suboptimal solutions in most non- Markovian environments.
引用
收藏
页码:345 / 352
页数:8
相关论文
共 11 条
  • [1] Adding memory condition to learning classifier systems to solve partially observable environments
    Zang, Z.X. (zxzang@gmail.com), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (46): : 345 - 352
  • [2] A Recursive Classifier System for Partially Observable Environments
    Hamzeh, Ali
    Hashemi, Sattar
    Sami, Ashkan
    Rahmani, Adel
    FUNDAMENTA INFORMATICAE, 2009, 97 (1-2) : 15 - 40
  • [3] Learning classifier systems with memory condition to solve non-Markov problems
    Zhaoxiang Zang
    Dehua Li
    Junying Wang
    Soft Computing, 2015, 19 : 1679 - 1699
  • [4] Learning classifier systems with memory condition to solve non-Markov problems
    Zang, Zhaoxiang
    Li, Dehua
    Wang, Junying
    SOFT COMPUTING, 2015, 19 (06) : 1679 - 1699
  • [5] A new architecture for learning classifier systems to solve POMDP problems
    Hamzeh, Ali
    Rahmani, Adel
    FUNDAMENTA INFORMATICAE, 2008, 84 (3-4) : 329 - 351
  • [6] A Bayesian approach to learning classifier systems in uncertain environments
    Aliprandi, Davide
    Mancastroppa, Alex
    Matteucci, Matteo
    GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1537 - +
  • [7] Memory Exploitation in Learning Classifier Systems
    Smith, Robert E.
    EVOLUTIONARY COMPUTATION, 1994, 2 (03) : 199 - 220
  • [8] Learning classifier systems to evolve classification rules for systems of memory constrained components
    Scheidler A.
    Middendorf M.
    Evolutionary Intelligence, 2011, 4 (03) : 127 - 143
  • [9] Model-free reinforcement learning for motion planning of autonomous agents with complex tasks in partially observable environments
    Li, Junchao
    Cai, Mingyu
    Kan, Zhen
    Xiao, Shaoping
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (01)
  • [10] Analysis of Online Signature Based Learning Classifier Systems for Noisy Environments: A Feedback Control Theoretic Approach
    Shafi, Kamran
    Abbass, Hussein A.
    SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 395 - 406