HIERARCHICAL NEURO-FUZZY MODELS BASED ON REINFORCEMENT LEARNING FOR AUTONOMOUS AGENTS

被引：0

作者：

Figueiredo, Karla ^{[1
,2
]}

Vellasco, Marley ^{[1
]}

Pacheco, Marco ^{[1
]}

de Souza, Flavio Joaquim ^{[3
]}

机构：

[1] Pontif Catholic Univ Rio de Janeiro, Dept Elect Engn, Rua Marques de Sao Vicente 225, BR-224:3190 Rio De Janeiro, Brazil

[2] Univ Estadual Zona Oeste, Dept Appl Math & Computat Sci, BR-23070200 Rio De Janeiro, Brazil

[3] Univ Estado Rio de Janeiro, Dept Syst & Comp Engn, Rio De Janeiro, Brazil

来源：

INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2014年 / 10卷 / 04期

关键词：

Reinforcement learning; Autonomous agents; Hybrid neuro-fuzzy; Hierarchical partitioning; Robotics;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work introduces a new class of neuro-uzzy systems for intelligent agents, called ReinfolrenteUt Learning - Hierarchical Neuro-Puzzy Systent. This new class combines a hierarchical partitioning method of the input space with a Reinforcement Learning algorithm to achieve the following important characteristics: automatic creation of the model's structure; self-adjustment of the pammeters; autonomous learning of the actions; capacity to deal with a greater number of inputs; and automatic generation of linguistic fuzzy rules. The proposed model was devised to overcome limitations of traditional reinforcement learning methods based on lookup tables, particularly in applications involving continuous environments and/or environments considered to he high dimensional. The paper details the hierarchical neuro-fuzzy architecture, its basic cell, and the learning algorithm. The performance of the proposed system was evaluated in four benchmark applications the Mountain Car Problem, the Cart-Centering Problem. the Inverted Pendulum and the Khepera Robot Control. The results obtained demonstrate the capacity of the novel hierarchical neuro-fuzzy system to automatically extract knowledge from the agent's direct interaction with large and/or continuous ClItliVOTIMCIttS. This knowledge is in the form of fuzzy linguistic rules, with no prior definition of the number and position of the fuzzy sets.

引用

页码：1471 / 1494

页数：24

共 44 条

[1]

Abraham A., 2005, FUZZY SYSTEM ENG THE

[2]

ALBUS J S, 1971, Mathematical Biosciences, V10, P25, DOI 10.1016/0025-5564(71)90051-4

[3]

ALBUS JS, 1975, ASME, V97, P220, DOI DOI 10.1115/1.3426922

[4]

Berenji HR, 1998, 1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, P622, DOI 10.1109/FUZZY.1998.687560

[5]

Berenji HR, 1996, FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, P2208, DOI 10.1109/FUZZY.1996.553542

[6] Reinforcement distribution in fuzzy Q-learning [J].

Bonarini, Andrea ;

Lazaric, Alessandro ;

Montrone, Francesco ;

Restelli, Marcello .

FUZZY SETS AND SYSTEMS, 2009, 160 (10) :1420-1443

[7]

Bowling Michael, 2001, P IJCAI 2001, V17, P1021

[8]

Boyars A., 1995, ADV NEURAL INFORM PR

[9]

BROWN M, 1995, PROCEEDINGS OF 1995 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS I-IV, P2139, DOI 10.1109/FUZZY.1995.409976

[10]

Busoniu L, 2008, LECT NOTES ARTIF INT, V4865, P27, DOI 10.1007/978-3-540-77949-0_3

← 1 2 3 4 5 →