H∞ reinforcement learning control of robot manipulators using fuzzy wavelet networks

被引：50

作者：

Lin, Chuan-Kai ^{[1
]}

机构：

[1] Naval Acad, Dept Elect Engn, Kaohsiung 813, Taiwan

来源：

FUZZY SETS AND SYSTEMS | 2009年 / 160卷 / 12期

关键词：

Fuzzy wavelet network (FWN); Reinforcement learning; H-infinity control; Robot manipulators; NEURAL-NETWORK; NONLINEAR-SYSTEMS; AUTOPILOT-DESIGN; ADAPTIVE-CONTROL; TRACKING CONTROL; ARCHITECTURE; AGENTS;

D O I：

10.1016/j.fss.2008.09.010

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, an H-infinity reinforcement learning controller based on a fuzzy wavelet network (FWN) is proposed to perform a position-tracking task for a robot manipulator. The proposed controller adopts the actor-critic reinforcement learning control scheme. The primary reinforcement is generated by a performance measurement unit. The learning unit of the controller consists of an associative search network (ASN) and an adaptive critic network (ACN). The ASN is employed to approximate unknown nonlinear functions in the robot dynamics and the ACN is utilized to construct a more informative signal than the primary reinforcement alone to tune the ASN. Since the FWN can provide accurate function approximation, both the ASN and ACN are implemented by the FWN. In addition, the proposed controller requires no prior knowledge about the dynamics of the robot manipulators and no off-line learning phase. Moreover, by employing the H-infinity control theory, it is possible to attenuate the effects of the approximation errors of the FWNs and external disturbances to a prescribed level. In contrast to the general H-infinity problem, only simple equations, rather than Riccati equations, should be solved. Computer simulations on a SCARA robot with 3 degrees-of-freedom confirm the effectiveness of the FWN-based controller with H-infinity stabilization. (C) 2008 Elsevier B.V. All rights reserved.

引用

页码：1765 / 1786

页数：22

共 35 条

[1]

[Anonymous], 1999, Neural network control of robot manipulators and nonlinear systems

[2]

Barto A. G., 1995, Models of Information Processing in the Basal Ganglia, P215

[3] NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].

BARTO, AG ;

SUTTON, RS ;

ANDERSON, CW .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846

[4] Reinforcement learning control of nonlinear multi-link system [J].

Bucak, IO ;

Zohdy, MA .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2001, 14 (05) :563-575

[5] Space-frequency localized basis function networks for nonlinear system estimation and control [J].

Cannon, M ;

Slotine, JJE .

NEUROCOMPUTING, 1995, 9 (03) :293-342

[6] A nonlinear adaptive H-infinity tracking control design in robotic systems via neural networks [J].

Chang, YC ;

Chen, BS .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 1997, 5 (01) :13-29

[7] Robust tracking enhancement of robot systems including motor dynamics: A fuzzy-based dynamic game approach [J].

Chen, BS ;

Uang, HJ ;

Tseng, CS .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1998, 6 (04) :538-552

[8]

Crites R. H., 1994, ADV NEURAL INFORM PR, V7, P401

[9] An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control [J].

Dai, X ;

Li, CK ;

Rad, AB .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2005, 6 (03) :285-293

[10] ACCURACY ANALYSIS FOR WAVELET APPROXIMATIONS [J].

DELYON, B ;

JUDITSKY, A ;

BENVENISTE, A .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (02) :332-348

← 1 2 3 4 →