Nonmonotone Levenberg-Marquardt training of recurrent neural architectures for processing symbolic sequences

被引：7

作者：

Peng, Chun-Cheng ^{[1
]}

Magoulas, George D. ^{[1
]}

机构：

[1] Univ London, Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England

来源：

NEURAL COMPUTING & APPLICATIONS | 2011年 / 20卷 / 06期

关键词：

Levenberg-Marquardt methods; Nonmonotone learning; Recurrent neural networks; ALGORITHM; CONVERGENCE; NETWORKS;

D O I：

10.1007/s00521-010-0493-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present nonmonotone variants of the Levenberg-Marquardt (LM) method for training recurrent neural networks (RNNs). These methods inherit the benefits of previously developed LM with momentum algorithms and are equipped with nonmonotone criteria, allowing temporal increase in training errors, and an adaptive scheme for tuning the size of the nonmonotone slide window. The proposed algorithms are applied to training RNNs of various sizes and architectures in symbolic sequence-processing problems. Experiments show that the proposed nonmonotone learning algorithms train more effectively RNNs for sequence processing than the original monotone methods.

引用

页码：897 / 908

页数：12

共 28 条

[21] Levenberg-Marquardt deep neural watermarking for 3D mesh using nearest centroid salient point learning
Narendra, Modigari
Valarmathi, M. L.
Anbarasi, L. Jani
Gandomi, Amir H.
SCIENTIFIC REPORTS, 2024, 14 (01)
[22] Enhanced seagull optimization for enhanced accuracy in CUDA-accelerated Levenberg-Marquardt backpropagation neural networks for earthquake forecasting
Kollam, Manoj
Joshi, Ajay
FRONTIERS IN BUILT ENVIRONMENT, 2024, 10
[23] ADVANCED ADAPTIVE NONMONOTONE CONJUGATE GRADIENT TRAINING ALGORITHM FOR RECURRENT NEURAL NETWORKS
Peng, Chun-Cheng
Magoulas, George D.
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (05) : 963 - 984
[24] Nonmonotone BFGS-trained recurrent neural networks for temporal sequence processing
Peng, Chun-Cheng
Magoulas, George D.
APPLIED MATHEMATICS AND COMPUTATION, 2011, 217 (12) : 5421 - 5441
[25] Chemical reaction impact on Buongiorno model for trihybrid nanofluid blood flow in a squeezed porous channel using the Levenberg-Marquardt neural network algorithm
Jawad, Muhammad
Khan, Waris
Fu, Zhuojia
Ali, Mehboob
Khan, Waqar Azeem
Birkea, Fathea M. O.
Oroud, Yazan
RESULTS IN ENGINEERING, 2025, 25
[26] Heat transfer enhancement using ternary hybrid nanofluid for cross-viscosity model with intelligent Levenberg-Marquardt neural networks approach incorporating entropy generation
Akbar, Noreen Sher
Zamir, Tayyab
Noor, Tayyaba
Muhammad, Taseer
Ali, Mohamed R.
CASE STUDIES IN THERMAL ENGINEERING, 2024, 63
[27] Riemannian metrics for neural networks II: recurrent networks and learning symbolic data sequences
Ollivier, Yann
INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2015, 4 (02) : 154 - 193
[28] Digital financial asset price fluctuation forecasting in digital economy era using blockchain information: A reconstructed dynamic-bound Levenberg-Marquardt neural-network approach
Shang, Dawei
Yan, Zhiqi
Zhang, Lei
Cui, Zhiquan
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228

← 1 2 3 →