Robot learning with GA-based fuzzy reinforcement learning agents

被引：47

作者：

Zhou, CJ ^{[1
]}

机构：

[1] Singapore Polytech, Sch Elect & Elect Engn, Singapore 139651, Singapore

来源：

INFORMATION SCIENCES | 2002年 / 145卷 / 1-2期

关键词：

robot learning; reinforcement learning; genetic algorithms; neural fuzzy systems; learning agents; biped robot;

D O I：

10.1016/S0020-0255(02)00223-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

How to learn from both expert knowledge and measurement-based information for a robot to acquire perception and motor skills is a challenging research topic in the field of autonomous robotic systems. For this reason, a-general GA (genetic algorithm)-based fuzzy reinforcement learning (GAFRL) agent is proposed in this paper. We first characterize the robot learning problem and point out some major issues that need to be addressed in conjunction with reinforcement learning. Based on a neural fuzzy network architecture of the GAFRL agent, we then discuss how different kinds of expert knowledge and measurement-based information can be incorporated in the GAFRL agent so as to accelerate its learning. By making use of the global optimization capability of GAs, the GAFRL can solve the local minima problem in traditional actor-critic reinforcement learning. On the other hand, with the prediction capability of the critic network, GAs can evaluate the candidate solutions regularly even during the periods without external feedback from the environment. This can guide GAs to perform a more effective global search. Finally, different types of GAFRL agents are constructed and verified using the simulation model of a physical biped robot. (C) 2002 Elsevier Science Inc. All rights reserved.

引用

页码：45 / 68

页数：24

共 24 条

[1]

[Anonymous], 1989, GENETIC ALGORITHM SE

[2]

[Anonymous], 1975, Ann Arbor

[3]

[Anonymous], 1991, Handbook of genetic algorithms

[4]

Barto A. G., 1997, Neural systems for control, P7, DOI 10.1016/B978-012526430-3/50003-9

[5] LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS [J].

BERENJI, HR ;

KHEDKAR, P .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :724-740

[6]

Brooks R., 1993, ROBOT LEARNING

[7]

Connell J.H., 1993, ROBOT LEARNING

[8] Control of variable-speed gaits for a biped robot [J].

Kun, AL ;

Miller, WT .

IEEE ROBOTICS & AUTOMATION MAGAZINE, 1999, 6 (03) :19-29

[9] GA-based fuzzy reinforcement learning for control of a magnetic bearing system [J].

Lin, CT ;

Jou, CP .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (02) :276-289

[10] Adaptive fuzzy command acquisition with reinforcement learning [J].

Lin, CT ;

Kan, MC .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1998, 6 (01) :102-121

← 1 2 3 →