Efficient computation of the Levenberg-Marquardt algorithm for feedforward networks with linear outputs

被引:0
作者
de Chazal, Philip [1 ]
McDonnell, Mark D. [2 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Fac Engn & Informat Technol, Charles Perkins Ctr, Sydney, NSW 2006, Australia
[2] Univ South Australia, Computat & Theoret Neurosci Lab, Adv Comp Res Ctr, Sch Informat Technol & Math Sci, Mawson Lakes, SA 5095, Australia
来源
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2016年
基金
澳大利亚研究理事会;
关键词
Levenberg-Marquardt algorithm; feedforward neural networks; Gauss-Newton method; approximate Hessian calculation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An efficient algorithm for the calculation of the approximate Hessian matrix for the Levenberg-Marquardt (LM) optimization algorithm for training a single-hidden-layer feedforward network with linear outputs is presented. The algorithm avoids explicit calculation of the Jacobian matrix and computes the gradient vector and approximate Hessian matrix directly. It requires approximately 1/N the floating point operations of other published algorithms, where N is the number of network outputs. The required memory for the algorithm is also less than 1/N of the memory required for algorithms explicitly computing the Jacobian matrix. We applied our algorithm to two large-scale classification problems - the MNIST and the Forest Cover Type databases. Our results were within 0.5% of the best performance of systems using pixel values as inputs to a feedforward network for the MNIST database. Our results were achieved with a much smaller network than other published results. We achieved state-of-the-art performance for the Forest Cover Type database.
引用
收藏
页码:68 / 75
页数:8
相关论文
共 17 条
  • [1] Bagnall AJ, 2003, IEEE IJCNN, P1802
  • [2] Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables
    Blackard, JA
    Dean, DJ
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 1999, 24 (03) : 131 - 151
  • [3] de Chazal P, 2015, INT CONF ACOUST SPEE, P2165, DOI 10.1109/ICASSP.2015.7178354
  • [4] Feng SJ, 2012, PROCEEDINGS OF 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, P966, DOI 10.1109/ICCT.2012.6511338
  • [5] TRAINING FEEDFORWARD NETWORKS WITH THE MARQUARDT ALGORITHM
    HAGAN, MT
    MENHAJ, MB
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (06): : 989 - 993
  • [6] Extreme Learning Machine for Regression and Multiclass Classification
    Huang, Guang-Bin
    Zhou, Hongming
    Ding, Xiaojian
    Zhang, Rui
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (02): : 513 - 529
  • [7] Jain R., 2008, J INDIAN SOC AGR STA, V62, P75
  • [8] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [9] Lecun Y., The MNIST Database of Hand- written Digits
  • [10] Levenberg K., 1944, Quarterly of Applied Mathematics, V2, P164, DOI [DOI 10.1090/QAM/10666, 10.1090/QAM/10666]