MAMGD: Gradient-Based Optimization Method Using Exponential Decay

被引:1
作者
Sakovich, Nikita [1 ]
Aksenov, Dmitry [1 ]
Pleshakova, Ekaterina [2 ]
Gataullin, Sergey [2 ]
机构
[1] Financial Univ Govt Russian Federat, Moscow 109456, Russia
[2] Russian Technol Univ, MIREA, 78 Vernadsky Ave, Moscow 119454, Russia
关键词
optimization; adaptive gradient methods; deep learning; neural networks; machine learning algorithms; gradient descent; comparative analysis;
D O I
10.3390/technologies12090154
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Optimization methods, namely, gradient optimization methods, are a key part of neural network training. In this paper, we propose a new gradient optimization method using exponential decay and the adaptive learning rate using a discrete second-order derivative of gradients. The MAMGD optimizer uses an adaptive learning step, exponential smoothing and gradient accumulation, parameter correction, and some discrete analogies from classical mechanics. The experiments included minimization of multivariate real functions, function approximation using multilayer neural networks, and training neural networks on popular classification and regression datasets. The experimental results of the new optimization technology showed a high convergence speed, stability to fluctuations, and an accumulation of gradient accumulators. The research methodology is based on the quantitative performance analysis of the algorithm by conducting computational experiments on various optimization problems and comparing it with existing methods.
引用
收藏
页数:20
相关论文
共 42 条
  • [1] Intelligent System for Estimation of the Spatial Position of Apples Based on YOLOv3 and Real Sense Depth Camera D415
    Andriyanov, Nikita
    Khasanshin, Ilshat
    Utkin, Daniil
    Gataullin, Timur
    Ignar, Stefan
    Shumaev, Vyacheslav
    Soloviev, Vladimir
    [J]. SYMMETRY-BASEL, 2022, 14 (01):
  • [2] A new inertial projected reflected gradient method with application to optimal control problems
    不详
    [J]. OPTIMIZATION METHODS & SOFTWARE, 2024, 40 (01) : 197 - 226
  • [3] Potential cyber threats of adversarial attacks on autonomous driving models
    Boltachev, Eldar
    [J]. JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2024, 20 (03) : 363 - 373
  • [4] Synthetic data for face recognition: Current state and future prospects
    Boutros, Fadi
    Struc, Vitomir
    Fierrez, Julian
    Damer, Naser
    [J]. IMAGE AND VISION COMPUTING, 2023, 135
  • [5] Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217
  • [6] Dozat T, 2016, OPENREVIEW
  • [7] Duchi J, 2011, J MACH LEARN RES, V12, P2121
  • [8] Comparison of the effectiveness of cepstral coefficients for Russian speech synthesis detection
    Efanov, Dmitry
    Aleksandrov, Pavel
    Mironov, Ilia
    [J]. JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2024, 20 (03) : 375 - 382
  • [9] Foret P., 2021, P INT C LEARN REPR V
  • [10] Biodiesel yield optimization from ternary (animal fat-cotton seed and rice bran) oils using response surface methodology and grey wolf optimizer
    Ganesha, T.
    Prakash, S. B.
    Rani, S. Sheela
    Ajith, B. S.
    Patel, G. C. Manjunath
    Samuel, Olusegun D.
    [J]. INDUSTRIAL CROPS AND PRODUCTS, 2023, 206