Two highly efficient second-order algorithms for training feedforward networks

被引:118
作者
Ampazis, N [1 ]
Perantonis, SJ [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, GR-15310 Athens, Greece
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2002年 / 13卷 / 05期
关键词
algorithms; multilayer feedforward neural networks; nonlinear least-squares; optimization; supervised learning;
D O I
10.1109/TNN.2002.1031939
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present two highly efficient second-order algorithms for the training of multilayer feedforward neural networks. The algorithms are based on iterations of the form employed in the Levenberg-Marquardt (LM) method for nonlinear least squares problems with the inclusion of an additional adaptive momentum term arising from the formulation of the training task as a constrained optimization problem. Their implementation requires minimal additional computations compared to a standard LM iteration which are compensated, however, from their excellent convergence properties. Simulations to large scale classical neural-network benchmarks are presented which reveal the power of the two methods to obtain solutions in difficult problems, whereas other standard second-order techniques (including LM) fail to converge.
引用
收藏
页码:1064 / 1074
页数:11
相关论文
共 29 条
  • [1] Dynamics of multilayer networks in the vicinity of temporary minima
    Ampazis, N
    Perantonis, SJ
    Taylor, JG
    [J]. NEURAL NETWORKS, 1999, 12 (01) : 43 - 58
  • [2] [Anonymous], 1989, Complex Syst
  • [3] [Anonymous], 2005, NEURAL NETWORKS PATT
  • [4] 1ST-ORDER AND 2ND-ORDER METHODS FOR LEARNING - BETWEEN STEEPEST DESCENT AND NEWTON METHOD
    BATTITI, R
    [J]. NEURAL COMPUTATION, 1992, 4 (02) : 141 - 166
  • [5] Dennis J.E., 1996, NUMERICAL METHODS UN
  • [6] QUASI-NEWTON METHODS, MOTIVATION AND THEORY
    DENNIS, JE
    MORE, JJ
    [J]. SIAM REVIEW, 1977, 19 (01) : 46 - 89
  • [7] FAHLMAN S, P 1988 CONN MOD SUMM, P38
  • [8] Fahlman S. E., 1990, ADV NEURAL INFORMATI, P524, DOI DOI 10.1190/1.1821929
  • [9] COMPUTING OPTIMAL LOCALLY CONSTRAINED STEPS
    GAY, DM
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1981, 2 (02): : 186 - 197
  • [10] GILBRET JC, 1992, SIAM J OPTIMIZ, V2