Nonmonotone Levenberg-Marquardt training of recurrent neural architectures for processing symbolic sequences

被引:7
|
作者
Peng, Chun-Cheng [1 ]
Magoulas, George D. [1 ]
机构
[1] Univ London, Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
来源
NEURAL COMPUTING & APPLICATIONS | 2011年 / 20卷 / 06期
关键词
Levenberg-Marquardt methods; Nonmonotone learning; Recurrent neural networks; ALGORITHM; CONVERGENCE; NETWORKS;
D O I
10.1007/s00521-010-0493-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present nonmonotone variants of the Levenberg-Marquardt (LM) method for training recurrent neural networks (RNNs). These methods inherit the benefits of previously developed LM with momentum algorithms and are equipped with nonmonotone criteria, allowing temporal increase in training errors, and an adaptive scheme for tuning the size of the nonmonotone slide window. The proposed algorithms are applied to training RNNs of various sizes and architectures in symbolic sequence-processing problems. Experiments show that the proposed nonmonotone learning algorithms train more effectively RNNs for sequence processing than the original monotone methods.
引用
收藏
页码:897 / 908
页数:12
相关论文
共 50 条
  • [1] Nonmonotone Levenberg–Marquardt training of recurrent neural architectures for processing symbolic sequences
    Chun-Cheng Peng
    George D. Magoulas
    Neural Computing and Applications, 2011, 20 : 897 - 908
  • [2] Recursive Bayesian Levenberg-Marquardt training of recurrent neural networks
    Mirikitani, Derrick
    Nikolaev, Nikolay
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 282 - 287
  • [3] Nonmonotone Levenberg-Marquardt algorithms and their convergence analysis
    Zhang, JZ
    Chen, LH
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1997, 92 (02) : 393 - 418
  • [4] Modified Levenberg-Marquardt Method for Neural Networks Training
    Suratgar, Amir Abolfazl
    Tavakoli, Mohammad Bagher
    Hoseinabadi, Abbas
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 6, 2005, : 46 - 48
  • [5] Levenberg-Marquardt Training Algorithms for Random Neural Networks
    Basterrech, Sebastian
    Mohammed, Samir
    Rubino, Gerardo
    Soliman, Mostafa
    COMPUTER JOURNAL, 2011, 54 (01): : 125 - 135
  • [6] Adaptive Levenberg-Marquardt Algorithm: A New Optimization Strategy for Levenberg-Marquardt Neural Networks
    Yan, Zhiqi
    Zhong, Shisheng
    Lin, Lin
    Cui, Zhiquan
    MATHEMATICS, 2021, 9 (17)
  • [7] Application of the Levenberg-Marquardt method to the training of spiking neural networks
    Silva, Sergio M.
    Ruano, Antonio E.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3978 - +
  • [8] Application of Levenberg-Marquardt method to the training of spiking neural networks
    Silva, SM
    Ruano, AE
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1354 - 1358
  • [9] Neural Network Training With Levenberg-Marquardt and Adaptable Weight Compression
    Smith, James S.
    Wu, Bo
    Wilamowski, Bogdan M.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (02) : 580 - 587
  • [10] Neighborhood based Levenberg-Marquardt algorithm for neural network training
    Lera, G
    Pinzolas, M
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (05): : 1200 - 1203