Adaptive Levenberg-Marquardt Algorithm: A New Optimization Strategy for Levenberg-Marquardt Neural Networks

被引:31
|
作者
Yan, Zhiqi [1 ]
Zhong, Shisheng [1 ]
Lin, Lin [1 ]
Cui, Zhiquan [1 ]
机构
[1] Harbin Inst Technol, Dept Mech Engn, Harbin 150000, Peoples R China
基金
中国国家自然科学基金;
关键词
Levenberg-Marquardt algorithm; convergence; neural networks; local minima; optimization; CONVERGENCE; SYSTEMS; NEURONS;
D O I
10.3390/math9172176
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Engineering data are often highly nonlinear and contain high-frequency noise, so the Levenberg-Marquardt (LM) algorithm may not converge when a neural network optimized by the algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural network's poor convergence commonly associated with the LM algorithm. Specifically, the effects of different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks, and special values of LM neural network parameters were found that could make the LM algorithm converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM algorithm. The algorithm coordinates the descent direction and the descent step by the iteration number, which can prevent falling into the local minimum value and avoid the influence of the parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM algorithm and its variants in terms of accuracy and speed in the context of testing common datasets and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A New Computational Approach to the Levenberg-Marquardt Learning Algorithm
    Bilski, Jaroslaw
    Kowalczyk, Barosz
    Smolag, Jacek
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I, 2023, 13588 : 16 - 26
  • [22] A New Robust Correntropy Based Levenberg-Marquardt Algorithm
    Heravi, Ahmad Reza
    Hodtani, Ghosheh Abed
    2016 IRAN WORKSHOP ON COMMUNICATION AND INFORMATION THEORY (IWCIT), 2016,
  • [23] Distributed localization using Levenberg-Marquardt algorithm
    Shervin Parvini Ahmadi
    Anders Hansson
    Sina Khoshfetrat Pakazad
    EURASIP Journal on Advances in Signal Processing, 2021
  • [24] Damage localization using Levenberg-Marquardt optimization
    Parker, Danny L.
    Frazier, William G.
    Gray, Mathew A.
    DAMAGE ASSESSMENT OF STRUCTURES VII, 2007, 347 : 95 - +
  • [25] Convergence analysis of a subsampled Levenberg-Marquardt algorithm
    Xing, Ganchen
    Gu, Jian
    Xiao, Xiantao
    OPERATIONS RESEARCH LETTERS, 2023, 51 (04) : 379 - 384
  • [26] Panorama Stitching Based on SIFT Algorithm and Levenberg-Marquardt Optimization
    Zhong Min
    Zeng Jiguo
    Xie Xusheng
    2012 INTERNATIONAL CONFERENCE ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING (ICMPBE2012), 2012, 33 : 811 - 818
  • [27] Panorama Stitching Based on SIFT Algorithm and Levenberg-Marquardt Optimization
    Zhong Min
    Zeng Jiguo
    Xie Xusheng
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL IV, 2010, : 142 - 145
  • [28] An Adaptive inverse-QR Recursive Levenberg-Marquardt Algorithm
    Nakornphanom, Kodchakorn Na
    Sitjongsataporn, Suchada
    2013 13TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT): COMMUNICATION AND INFORMATION TECHNOLOGY FOR NEW LIFE STYLE BEYOND THE CLOUD, 2013, : 535 - 539
  • [29] An Improved Levenberg-Marquardt Algorithm with Adaptive Learning Rate for RBF Neural Network
    An Ru
    Li Wen Jing
    Han Hong Gui
    Qiao Jun Fei
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3630 - 3635
  • [30] Distributed localization using Levenberg-Marquardt algorithm
    Ahmadi, Shervin Parvini
    Hansson, Anders
    Pakazad, Sina Khoshfetrat
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)