Nonmonotone Levenberg-Marquardt training of recurrent neural architectures for processing symbolic sequences

被引:7
|
作者
Peng, Chun-Cheng [1 ]
Magoulas, George D. [1 ]
机构
[1] Univ London, Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
关键词
Levenberg-Marquardt methods; Nonmonotone learning; Recurrent neural networks; ALGORITHM; CONVERGENCE; NETWORKS;
D O I
10.1007/s00521-010-0493-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present nonmonotone variants of the Levenberg-Marquardt (LM) method for training recurrent neural networks (RNNs). These methods inherit the benefits of previously developed LM with momentum algorithms and are equipped with nonmonotone criteria, allowing temporal increase in training errors, and an adaptive scheme for tuning the size of the nonmonotone slide window. The proposed algorithms are applied to training RNNs of various sizes and architectures in symbolic sequence-processing problems. Experiments show that the proposed nonmonotone learning algorithms train more effectively RNNs for sequence processing than the original monotone methods.
引用
收藏
页码:897 / 908
页数:12
相关论文
共 28 条
  • [21] Levenberg-Marquardt deep neural watermarking for 3D mesh using nearest centroid salient point learning
    Narendra, Modigari
    Valarmathi, M. L.
    Anbarasi, L. Jani
    Gandomi, Amir H.
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [22] Enhanced seagull optimization for enhanced accuracy in CUDA-accelerated Levenberg-Marquardt backpropagation neural networks for earthquake forecasting
    Kollam, Manoj
    Joshi, Ajay
    FRONTIERS IN BUILT ENVIRONMENT, 2024, 10
  • [23] ADVANCED ADAPTIVE NONMONOTONE CONJUGATE GRADIENT TRAINING ALGORITHM FOR RECURRENT NEURAL NETWORKS
    Peng, Chun-Cheng
    Magoulas, George D.
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (05) : 963 - 984
  • [24] Nonmonotone BFGS-trained recurrent neural networks for temporal sequence processing
    Peng, Chun-Cheng
    Magoulas, George D.
    APPLIED MATHEMATICS AND COMPUTATION, 2011, 217 (12) : 5421 - 5441
  • [25] Chemical reaction impact on Buongiorno model for trihybrid nanofluid blood flow in a squeezed porous channel using the Levenberg-Marquardt neural network algorithm
    Jawad, Muhammad
    Khan, Waris
    Fu, Zhuojia
    Ali, Mehboob
    Khan, Waqar Azeem
    Birkea, Fathea M. O.
    Oroud, Yazan
    RESULTS IN ENGINEERING, 2025, 25
  • [26] Heat transfer enhancement using ternary hybrid nanofluid for cross-viscosity model with intelligent Levenberg-Marquardt neural networks approach incorporating entropy generation
    Akbar, Noreen Sher
    Zamir, Tayyab
    Noor, Tayyaba
    Muhammad, Taseer
    Ali, Mohamed R.
    CASE STUDIES IN THERMAL ENGINEERING, 2024, 63
  • [27] Riemannian metrics for neural networks II: recurrent networks and learning symbolic data sequences
    Ollivier, Yann
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2015, 4 (02) : 154 - 193
  • [28] Digital financial asset price fluctuation forecasting in digital economy era using blockchain information: A reconstructed dynamic-bound Levenberg-Marquardt neural-network approach
    Shang, Dawei
    Yan, Zhiqi
    Zhang, Lei
    Cui, Zhiquan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228