Adaptive Levenberg-Marquardt Algorithm: A New Optimization Strategy for Levenberg-Marquardt Neural Networks

被引:32
|
作者
Yan, Zhiqi [1 ]
Zhong, Shisheng [1 ]
Lin, Lin [1 ]
Cui, Zhiquan [1 ]
机构
[1] Harbin Inst Technol, Dept Mech Engn, Harbin 150000, Peoples R China
基金
中国国家自然科学基金;
关键词
Levenberg-Marquardt algorithm; convergence; neural networks; local minima; optimization; CONVERGENCE; SYSTEMS; NEURONS;
D O I
10.3390/math9172176
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Engineering data are often highly nonlinear and contain high-frequency noise, so the Levenberg-Marquardt (LM) algorithm may not converge when a neural network optimized by the algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural network's poor convergence commonly associated with the LM algorithm. Specifically, the effects of different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks, and special values of LM neural network parameters were found that could make the LM algorithm converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM algorithm. The algorithm coordinates the descent direction and the descent step by the iteration number, which can prevent falling into the local minimum value and avoid the influence of the parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM algorithm and its variants in terms of accuracy and speed in the context of testing common datasets and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] An echo state network based on Levenberg-Marquardt algorithm
    Wang, Lei
    Yang, Cuili
    Qiao, Junfei
    Wang, Gongming
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 3899 - 3904
  • [32] A quasi-local Levenberg-Marquardt algorithm for neural network training
    Lera, G
    Pinzolas, M
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2242 - 2246
  • [33] A LEVENBERG-MARQUARDT METHOD FOR NONSMOOTH REGULARIZED LEAST SQUARES
    Aravkin, Aleksandr y.
    Baraldi, Robert
    Orban, Dominique
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2024, 46 (04) : A2557 - A2581
  • [34] THE REGULARIZING LEVENBERG-MARQUARDT SCHEME IS OF OPTIMAL ORDER
    Hanke, Martin
    JOURNAL OF INTEGRAL EQUATIONS AND APPLICATIONS, 2010, 22 (02) : 259 - 283
  • [35] Optimization of projection matrix between cameras based on Levenberg-Marquardt algorithm
    Zhang, Lei
    Zheng, Shubin
    Chai, Xiaodong
    Xu, Qingxia
    Zi, Anqi
    Journal of Information and Computational Science, 2015, 12 (04): : 1607 - 1614
  • [36] Backpropagation and Levenberg-Marquardt Algorithm for Training Finite Element Neural Network
    Reynaldi, Arnold
    Lukas, Samuel
    Margaretha, Helena
    2012 SIXTH UKSIM/AMSS EUROPEAN SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS), 2012, : 89 - 94
  • [37] Efficient computation of the Levenberg-Marquardt algorithm for feedforward networks with linear outputs
    de Chazal, Philip
    McDonnell, Mark D.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 68 - 75
  • [38] A Novel Modification on the Levenberg-Marquardt Algorithm for Avoiding Overfitting in Neural Network Training
    Iplikci, Serdar
    Bilgi, Batuhan
    Menemen, Ali
    Bahtiyar, Bedri
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 201 - 207
  • [39] ???????A New Modified Levenberg-Marquardt Method for Systems of Nonlinear Equations
    Chen, Liang
    Ma, Yanfang
    JOURNAL OF MATHEMATICS, 2023, 2023
  • [40] Nonmonotone Levenberg-Marquardt algorithms and their convergence analysis
    Zhang, JZ
    Chen, LH
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1997, 92 (02) : 393 - 418