Nonmonotone BFGS-trained recurrent neural networks for temporal sequence processing

被引：12

作者：

Peng, Chun-Cheng ^{[1
]}

Magoulas, George D. ^{[1
]}

机构：

[1] Univ London Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England

来源：

APPLIED MATHEMATICS AND COMPUTATION | 2011年 / 217卷 / 12期

关键词：

Recurrent neural networks; Quasi-Newton methods; BFGS updates; Nonmonotone methods; Second-order training algorithms; Temporal sequence; LINE SEARCH TECHNIQUE; ALGORITHMS; OPTIMIZATION; DESCENT; CONSTRUCTION;

D O I：

10.1016/j.amc.2010.12.012

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

self-In this paper we propose a nonmonotone approach to recurrent neural networks training for temporal sequence processing applications. This approach allows learning performance to deteriorate in some iterations, nevertheless the network's performance is improved over time. A self-scaling BFGS is equipped with an adaptive nonmonotone technique that employs approximations of the Lipschitz constant and is tested on a set of sequence processing problems. Simulation results show that the proposed algorithm outperforms the BFGS as well as other methods previously applied to these sequences, providing an effective modification that is capable of training recurrent networks of various architectures. (C) 2010 Elsevier Inc. All rights reserved.

引用

页码：5421 / 5441

页数：21

共 66 条

[1] Meta learning evolutionary artificial neural networks
Abraham, A
[J]. NEUROCOMPUTING, 2004, 56 (1-4) : 1 - 38
[2] Numerical experience with a class of self-scaling quasi-Newton algorithms
Al-Baali, M
[J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1998, 96 (03) : 533 - 553
[3] [Anonymous], 1998, Introduction to connectionist modelling of cognitive processes
[4] [Anonymous], 1986, Proceedings of the 8th Annual Conference of the Cognitive Science Society
[5] [Anonymous], 1996, Neural fuzzy systems
[6] Asirvadam V. S., 2004, Proceedings of the 2004 IEEE International Conference on Control Applications (IEEE Cat. No.04CH37596), P586, DOI 10.1109/CCA.2004.1387275
[7] ASSAAD M, 2005, P 15 INT C ART NEUR, P169
[8] Bajramovic F, 2004, IEEE IJCNN, P837
[9] GRADIENT DESCENT LEARNING ALGORITHM OVERVIEW - A GENERAL DYNAMICAL-SYSTEMS PERSPECTIVE
BALDI, P
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01): : 182 - 195
[10] Trajectory priming with dynamic fuzzy networks in nonlinear optimal control
Becerikli, Y
Oysal, Y
Konar, AF
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (02): : 383 - 394

← 1 2 3 4 5 6 7 →