Adding memory condition to learning classifier systems to solve partially observable environments

被引：0

作者：

Zang, Zhao Xiang ^{[1
]}

Li, De Hua ^{[1
]}

Wang, Jun Ying ^{[2
]}

机构：

[1] Huazhong Univ Sci & Technol, Inst Pattern Recognit & Artificial Intelligence, Wuhan 430074, Hubei, Peoples R China

[2] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443000, Hubei, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY | 2013年 / 46卷 / 04期

关键词：

learning classifier systems; LCSs; extended classifier system; XCS; internal memory; partially observable environments; aliasing state;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Within the paradigm of learning classifier systems, extended classifier system (XCS) is outstanding. However, the original XCS has no memory mechanism and can only learn optimal policy in Markovian environments, where the optimal action is determined solely by the state of current sensory input. But in practice, most environments are partially observable environments with respect to agent's sensation, and they form the most general class of environments: non-Markov environments. In these environments, XCS either fails completely, or only develops a suboptimal policy, since it is memoryless. In this paper, we develop a new learning classifier system based on XCS, named 'XCSMM', which adds an internal message to XCS as an internal memory, and then extends the classifier with a memory condition that is used to sense the internal memory. XCSMM holds a simple and clear memory mechanism, which is easy to understand and implement. Besides, four sets of different complex maze problems have been employed to test XCSMM. Experimental results show that XCSMM is able to evolve optimal or suboptimal solutions in most non- Markovian environments.

引用

页码：345 / 352

页数：8

共 11 条

[1] Adding memory condition to learning classifier systems to solve partially observable environments
Zang, Z.X. (zxzang@gmail.com), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (46): : 345 - 352
[2] A Recursive Classifier System for Partially Observable Environments
Hamzeh, Ali
Hashemi, Sattar
Sami, Ashkan
Rahmani, Adel
FUNDAMENTA INFORMATICAE, 2009, 97 (1-2) : 15 - 40
[3] Learning classifier systems with memory condition to solve non-Markov problems
Zhaoxiang Zang
Dehua Li
Junying Wang
Soft Computing, 2015, 19 : 1679 - 1699
[4] Learning classifier systems with memory condition to solve non-Markov problems
Zang, Zhaoxiang
Li, Dehua
Wang, Junying
SOFT COMPUTING, 2015, 19 (06) : 1679 - 1699
[5] A new architecture for learning classifier systems to solve POMDP problems
Hamzeh, Ali
Rahmani, Adel
FUNDAMENTA INFORMATICAE, 2008, 84 (3-4) : 329 - 351
[6] A Bayesian approach to learning classifier systems in uncertain environments
Aliprandi, Davide
Mancastroppa, Alex
Matteucci, Matteo
GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1537 - +
[7] Memory Exploitation in Learning Classifier Systems
Smith, Robert E.
EVOLUTIONARY COMPUTATION, 1994, 2 (03) : 199 - 220
[8] Learning classifier systems to evolve classification rules for systems of memory constrained components
Scheidler A.
Middendorf M.
Evolutionary Intelligence, 2011, 4 (03) : 127 - 143
[9] Model-free reinforcement learning for motion planning of autonomous agents with complex tasks in partially observable environments
Li, Junchao
Cai, Mingyu
Kan, Zhen
Xiao, Shaoping
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (01)
[10] Analysis of Online Signature Based Learning Classifier Systems for Noisy Environments: A Feedback Control Theoretic Approach
Shafi, Kamran
Abbass, Hussein A.
SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 395 - 406

← 1 2 →