MAMGD: Gradient-Based Optimization Method Using Exponential Decay

被引:1
作者
Sakovich, Nikita [1 ]
Aksenov, Dmitry [1 ]
Pleshakova, Ekaterina [2 ]
Gataullin, Sergey [2 ]
机构
[1] Financial Univ Govt Russian Federat, Moscow 109456, Russia
[2] Russian Technol Univ, MIREA, 78 Vernadsky Ave, Moscow 119454, Russia
关键词
optimization; adaptive gradient methods; deep learning; neural networks; machine learning algorithms; gradient descent; comparative analysis;
D O I
10.3390/technologies12090154
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Optimization methods, namely, gradient optimization methods, are a key part of neural network training. In this paper, we propose a new gradient optimization method using exponential decay and the adaptive learning rate using a discrete second-order derivative of gradients. The MAMGD optimizer uses an adaptive learning step, exponential smoothing and gradient accumulation, parameter correction, and some discrete analogies from classical mechanics. The experiments included minimization of multivariate real functions, function approximation using multilayer neural networks, and training neural networks on popular classification and regression datasets. The experimental results of the new optimization technology showed a high convergence speed, stability to fluctuations, and an accumulation of gradient accumulators. The research methodology is based on the quantitative performance analysis of the algorithm by conducting computational experiments on various optimization problems and comparing it with existing methods.
引用
收藏
页数:20
相关论文
共 42 条
  • [11] Deep learning-based object detection in augmented reality: A systematic review
    Ghasemi, Yalda
    Jeong, Heejin
    Choi, Sung Ho
    Park, Kyeong-Beom
    Lee, Jae Yeol
    [J]. COMPUTERS IN INDUSTRY, 2022, 139
  • [12] Hinton G., 2012, CITED, V14, P2
  • [13] Deep Learning-Based Subsurface Target Detection From GPR Scans
    Hou, Feifei
    Lei, Wentai
    Li, Shuai
    Xi, Jingchun
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (06) : 8161 - 8171
  • [14] Adversarial Patch Attacks on Deep-Learning-Based Face Recognition Systems Using Generative Adversarial Networks
    Hwang, Ren-Hung
    Lin, Jia-You
    Hsieh, Sun-Ying
    Lin, Hsuan-Yu
    Lin, Chia-Liang
    [J]. SENSORS, 2023, 23 (02)
  • [15] Forecasting of digital financial crimes in Russia based on machine learning methods
    Ivanyuk, Vera
    [J]. JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2024, 20 (03) : 349 - 362
  • [16] Small Pests Detection in Field Crops Using Deep Learning Object Detection
    Khalid, Saim
    Oqaibi, Hadi Mohsen
    Aqib, Muhammad
    Hafeez, Yaser
    [J]. SUSTAINABILITY, 2023, 15 (08)
  • [17] Khan W., 2023, Natural Language Processing Journal, V4, DOI [DOI 10.1016/J.NLP.2023.100026, 10.1016/j.nlp.2023.100026]
  • [18] Kim S, 2023, APPL THERM ENG, V223, DOI [10.1016/j.applthermaleng.2023.120038, 10.1021/acsenergylett.3c02307]
  • [19] Kingma D. P., 2014, arXiv, DOI DOI 10.48550/ARXIV.1412.6980
  • [20] Primal-dual gradient methods for searching network equilibria in combined models with nested choice structure and capacity constraints
    Kubentayeva, Meruza
    Yarmoshik, Demyan
    Persiianov, Mikhail
    Kroshnin, Alexey
    Kotliarova, Ekaterina
    Tupitsa, Nazarii
    Pasechnyuk, Dmitry
    Gasnikov, Alexander
    Shvetsov, Vladimir
    Baryshev, Leonid
    Shurupov, Alexey
    [J]. COMPUTATIONAL MANAGEMENT SCIENCE, 2024, 21 (01)