Continuous action reinforcement learning automata and their application to adaptive digital filter design

被引：31

作者：

Howell, MN ^{[1
]}

Gordon, TJ ^{[1
]}

机构：

[1] Loughborough Univ Technol, Dept Aeronaut & Automot Engn, Loughborough LE11 3TU, Leics, England

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2001年 / 14卷 / 05期

关键词：

reinforcement learning; adaptive signal processing; system identification;

D O I：

10.1016/S0952-1976(01)00034-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the design of adaptive IIR filters, the multi-modal nature of the error surfaces can limit the use of gradient-based and other iterative search methods. Stochastic learning automata have previously been shown to have global optimisation properties making them suitable for the optimisation of filter coefficients. Continuous action reinforcement learning automata are presented as an extension to the standard automata which operate over discrete parameter sets. Global convergence is claimed, and demonstrations are carried out via a number of computer simulations. (C) 2002 Elsevier Science Ltd. All rights reserved.

引用

页码：549 / 561

页数：13

共 13 条

[1]

[Anonymous], 1994, LEARNING AUTOMATA TH

[2]

Baba N., 1984, LECT NOTES CONTROL I

[3]

FAN H, 1986, IEEE T CIRCUITS SYST, V33, P939

[4] Moderated reinforcement learning of active and semi-active vehicle suspension control laws [J].

Frost, GP ;

Gordon, TJ ;

Howell, MN ;

Wu, QH .

PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 1996, 210 (04) :249-257

[5]

HOLLAND JH, 1975, ADAPTATION NATURAL A

[6] Continuous action reinforcement learning applied to vehicle suspension control [J].

Howell, MN ;

Frost, GP ;

Gordon, TJ ;

Wu, QH .

MECHATRONICS, 1997, 7 (03) :263-276

[7]

JOHNSON CR, 1977, P IEEE, V65, P1399, DOI 10.1109/PROC.1977.10722

[8]

NARENDRA K, 1989, LEARNING AUTOMATA IN

[9]

SHYNK J, 1989, IEEE ASSP MAG APR, P4

[10] STOCHASTIC LEARNING AUTOMATA AND ADAPTIVE IIR FILTERS [J].

TANG, CKK ;

MARS, P .

IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1991, 138 (04) :331-341

← 1 2 →