Adaptive Levenberg-Marquardt Algorithm: A New Optimization Strategy for Levenberg-Marquardt Neural Networks

被引:31
|
作者
Yan, Zhiqi [1 ]
Zhong, Shisheng [1 ]
Lin, Lin [1 ]
Cui, Zhiquan [1 ]
机构
[1] Harbin Inst Technol, Dept Mech Engn, Harbin 150000, Peoples R China
基金
中国国家自然科学基金;
关键词
Levenberg-Marquardt algorithm; convergence; neural networks; local minima; optimization; CONVERGENCE; SYSTEMS; NEURONS;
D O I
10.3390/math9172176
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Engineering data are often highly nonlinear and contain high-frequency noise, so the Levenberg-Marquardt (LM) algorithm may not converge when a neural network optimized by the algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural network's poor convergence commonly associated with the LM algorithm. Specifically, the effects of different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks, and special values of LM neural network parameters were found that could make the LM algorithm converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM algorithm. The algorithm coordinates the descent direction and the descent step by the iteration number, which can prevent falling into the local minimum value and avoid the influence of the parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM algorithm and its variants in terms of accuracy and speed in the context of testing common datasets and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Automatic Image Parameter Optimization Based on Levenberg-Marquardt Algorithm
    Zheng Jinxin
    Du Junping
    ISIE: 2009 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, 2009, : 719 - 723
  • [32] Application of the Levenberg-Marquardt method to the training of spiking neural networks
    Silva, Sergio M.
    Ruano, Antonio E.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3978 - +
  • [33] Application of Levenberg-Marquardt method to the training of spiking neural networks
    Silva, SM
    Ruano, AE
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1354 - 1358
  • [34] Neighborhood based Levenberg-Marquardt algorithm for neural network training
    Lera, G
    Pinzolas, M
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (05): : 1200 - 1203
  • [35] Recursive Bayesian Levenberg-Marquardt training of recurrent neural networks
    Mirikitani, Derrick
    Nikolaev, Nikolay
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 282 - 287
  • [36] Adaptive Predistortions Based on Neural Networks Associated with Levenberg-Marquardt Algorithm for Satellite Down Links
    Rafik Zayani
    Ridha Bouallegue
    Daniel Roviras
    EURASIP Journal on Wireless Communications and Networking, 2008
  • [37] On a New Updating Rule of the Levenberg-Marquardt Parameter
    Zhao, Ruixue
    Fan, Jinyan
    JOURNAL OF SCIENTIFIC COMPUTING, 2018, 74 (02) : 1146 - 1162
  • [38] Solar Generation Forecasting by Recurrent Neural Networks optimized by Levenberg-Marquardt Algorithm
    Awan, Shahid M.
    Khan, Zubair A.
    Aslam, Muhammad
    IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 276 - 281
  • [39] Fast Computational Approach to the Levenberg-Marquardt Algorithm for Training Feedforward Neural Networks
    Bilski, Jaroslaw
    Smolag, Jacek
    Kowalczyk, Bartosz
    Grzanek, Konrad
    Izonin, Ivan
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 13 (02) : 45 - 61
  • [40] NEURAL NETWORKS PREDICTION FOR SEISMIC RESPONSE OF STRUCTURE UNDER THE LEVENBERG-MARQUARDT ALGORITHM
    徐赵东
    沈亚鹏
    李爱群
    Academic Journal of Xi'an Jiaotong University, 2003, (01) : 15 - 19