Nonmonotone BFGS-trained recurrent neural networks for temporal sequence processing

被引:12
作者
Peng, Chun-Cheng [1 ]
Magoulas, George D. [1 ]
机构
[1] Univ London Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
关键词
Recurrent neural networks; Quasi-Newton methods; BFGS updates; Nonmonotone methods; Second-order training algorithms; Temporal sequence; LINE SEARCH TECHNIQUE; ALGORITHMS; OPTIMIZATION; DESCENT; CONSTRUCTION;
D O I
10.1016/j.amc.2010.12.012
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
self-In this paper we propose a nonmonotone approach to recurrent neural networks training for temporal sequence processing applications. This approach allows learning performance to deteriorate in some iterations, nevertheless the network's performance is improved over time. A self-scaling BFGS is equipped with an adaptive nonmonotone technique that employs approximations of the Lipschitz constant and is tested on a set of sequence processing problems. Simulation results show that the proposed algorithm outperforms the BFGS as well as other methods previously applied to these sequences, providing an effective modification that is capable of training recurrent networks of various architectures. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:5421 / 5441
页数:21
相关论文
共 66 条
  • [1] Meta learning evolutionary artificial neural networks
    Abraham, A
    [J]. NEUROCOMPUTING, 2004, 56 (1-4) : 1 - 38
  • [2] Numerical experience with a class of self-scaling quasi-Newton algorithms
    Al-Baali, M
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1998, 96 (03) : 533 - 553
  • [3] [Anonymous], 1998, Introduction to connectionist modelling of cognitive processes
  • [4] [Anonymous], 1986, Proceedings of the 8th Annual Conference of the Cognitive Science Society
  • [5] [Anonymous], 1996, Neural fuzzy systems
  • [6] Asirvadam V. S., 2004, Proceedings of the 2004 IEEE International Conference on Control Applications (IEEE Cat. No.04CH37596), P586, DOI 10.1109/CCA.2004.1387275
  • [7] ASSAAD M, 2005, P 15 INT C ART NEUR, P169
  • [8] Bajramovic F, 2004, IEEE IJCNN, P837
  • [9] GRADIENT DESCENT LEARNING ALGORITHM OVERVIEW - A GENERAL DYNAMICAL-SYSTEMS PERSPECTIVE
    BALDI, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01): : 182 - 195
  • [10] Trajectory priming with dynamic fuzzy networks in nonlinear optimal control
    Becerikli, Y
    Oysal, Y
    Konar, AF
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (02): : 383 - 394