Adaptive Levenberg-Marquardt Algorithm: A New Optimization Strategy for Levenberg-Marquardt Neural Networks

被引:31
|
作者
Yan, Zhiqi [1 ]
Zhong, Shisheng [1 ]
Lin, Lin [1 ]
Cui, Zhiquan [1 ]
机构
[1] Harbin Inst Technol, Dept Mech Engn, Harbin 150000, Peoples R China
基金
中国国家自然科学基金;
关键词
Levenberg-Marquardt algorithm; convergence; neural networks; local minima; optimization; CONVERGENCE; SYSTEMS; NEURONS;
D O I
10.3390/math9172176
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Engineering data are often highly nonlinear and contain high-frequency noise, so the Levenberg-Marquardt (LM) algorithm may not converge when a neural network optimized by the algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural network's poor convergence commonly associated with the LM algorithm. Specifically, the effects of different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks, and special values of LM neural network parameters were found that could make the LM algorithm converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM algorithm. The algorithm coordinates the descent direction and the descent step by the iteration number, which can prevent falling into the local minimum value and avoid the influence of the parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM algorithm and its variants in terms of accuracy and speed in the context of testing common datasets and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A New Levenberg-Marquardt Algorithm for feedforward neural networks
    Li, Yanlai
    Wang, Kuanquan
    Li, Tao
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 3516 - 3519
  • [2] A Parallel Levenberg-Marquardt Algorithm
    Cao, Jun
    Novstrup, Krista A.
    Goyal, Ayush
    Midkiff, Samuel R.
    Caruthers, James M.
    ICS'09: PROCEEDINGS OF THE 2009 ACM SIGARCH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2009, : 450 - 459
  • [3] A Levenberg-Marquardt algorithm for unconstrained multicriteria optimization
    Fischer, Andreas
    Shukla, Pradyumn K.
    OPERATIONS RESEARCH LETTERS, 2008, 36 (05) : 643 - 646
  • [4] Optimisation Using Levenberg-Marquardt Algorithm of Neural Networks for Iris
    Sayed, Asim
    Sardeshmukh, M.
    Limkar, Suresh
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 91 - 98
  • [5] LOCAL LEVENBERG-MARQUARDT ALGORITHM FOR LEARNING FEEDFORWAD NEURAL NETWORKS
    Bilski, Jaroslaw
    Kowalczyk, Bartosz
    Marchlewska, Alina
    Zurada, Jacek M.
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2020, 10 (04) : 299 - 316
  • [6] Porosity inversion by Caianiello neural networks with Levenberg-Marquardt optimization
    Boateng, Cyril D.
    Fu, Li-Yun
    Yu, Wu
    Guan Xizhu
    INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2017, 5 (03): : SL33 - SL42
  • [7] The Parallel Modification to the Levenberg-Marquardt Algorithm
    Bilski, Jaroslaw
    Kowalczyk, Bartosz
    Grzanek, Konrad
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2018, PT I, 2018, 10841 : 15 - 24
  • [8] Levenberg-Marquardt training for modular networks
    Fun, MH
    Hagan, MT
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 468 - 473
  • [9] A NEW DAMPING STRATEGY OF LEVENBERG-MARQUARDT ALGORITHM FOR MULTILAYER PERCEPTRONS
    Kwak, Young-tae
    Hwang, Ji-won
    Yoo, Cheol-jung
    NEURAL NETWORK WORLD, 2011, 21 (04) : 327 - 340
  • [10] The application and modeling of the Levenberg-Marquardt algorithm
    Li, Jian-rong
    2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 278 - 280