Robot reinforcement learning accuracy-based learning classifier systems with Fuzzy Policy Gradient descent(XCS-FPGRL)

被引：0

作者：

Shao, Jie ^{[1
]}

Yu, Jingru ^{[1
]}

机构：

[1] Zhengzhou Chenggong Univ Finance & Econ, Dept Informat Engn, Zhengzhou 451200, Peoples R China

来源：

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS | 2015年 / 15卷

关键词：

Convergence; Rrobot; Reinforcement learning; Accuracy-based learning classifier system with Gradient descent (XCS-FPGRL); XCS (Accuracy-based learning classifier system);

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presented a novel approach XCS-FPGRL to research on robot reinforcement learning. XCS-FPGRL combines covering operator and genetic algorithm. The systems is responsible for adjusting precision and reducing search space according to some reward obtained from the environment, acts as an innovation discovery component which is responsible for discovering new better reinforcement learning rules. The experiment and simulation showed that robot reinforcement learning can achieved convergence very quickly.

引用

页码：1013 / 1018

页数：6

共 14 条

[1]

Baneamoon S. M., 2007, INT J INTELLIGENT TE, V2, P172

[2]

Baneamoon SM, 2008, 2008 INT C EL DES, P930

[3]

Bay S. J., 1995, MOBILE ROBOTS, P88

[4]

bull L., 2004, APPL LEARNING CLASSI, P276

[5]

Bull L., 2003, A Simple Accuracy-based Learning Classifier System

[6]

Bull L., 2005, FDN LEARNING CLASSIF, P1

[7] Learning classifier system ensembles with rule-sharing [J].

Bull, Larry ;

Studley, Matthew ;

Bagnall, Anthony ;

Whittley, Ian .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2007, 11 (04) :496-502

[8]

Dixon PW, 2002, LECT NOTES ARTIF INT, V2321, P133

[9]

Gemeinder M., 2003, Applied Soft Computing, V3, P149, DOI DOI 10.1016/S1568-4946(03)00010-3

[10]

Lan Ting, 2007, Robot, V29, P298

← 1 2 →