The Improved Stochastic Fractional Order Gradient Descent Algorithm

被引:2
|
作者
Yang, Yang [1 ]
Mo, Lipo [1 ,2 ]
Hu, Yusen [1 ]
Long, Fei [3 ]
机构
[1] Beijing Technol & Business Univ, Sch Math & Stat, Beijing 100048, Peoples R China
[2] Beijing Technol & Business Univ, Sch Future Technol, Beijing 100048, Peoples R China
[3] Guizhou Inst Technol, Sch Artificial Intelligence & Elect Engn, Special Key Lab Artificial Intelligence & Intellig, Guiyang 550003, Peoples R China
关键词
machine learning; fractional calculus; stochastic gradient descent; convex optimization; online optimization; NEURAL-NETWORKS;
D O I
10.3390/fractalfract7080631
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper mainly proposes some improved stochastic gradient descent (SGD) algorithms with a fractional order gradient for the online optimization problem. For three scenarios, including standard learning rate, adaptive gradient learning rate, and momentum learning rate, three new SGD algorithms are designed combining a fractional order gradient and it is shown that the corresponding regret functions are convergent at a sub-linear rate. Then we discuss the impact of the fractional order on the convergence and monotonicity and prove that the better performance can be obtained by adjusting the order of the fractional gradient. Finally, several practical examples are given to verify the superiority and validity of the proposed algorithm.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Learning Stochastic Optimal Policies via Gradient Descent
    Massaroli, Stefano
    Poli, Michael
    Peluchetti, Stefano
    Park, Jinkyoo
    Yamashita, Atsushi
    Asama, Hajime
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 1094 - 1099
  • [42] On Projected Stochastic Gradient Descent Algorithm with Weighted Averaging for Least Squares Regression
    Cohen, Kobi
    Nedic, Angelia
    Srikant, R.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2314 - 2318
  • [43] GPUSGD: A GPU-accelerated stochastic gradient descent algorithm for matrix factorization
    Jin, Jing
    Lai, Siyan
    Hu, Su
    Lin, Jing
    Lin, Xiaola
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (14) : 3844 - 3865
  • [44] On the discrepancy principle for stochastic gradient descent
    Jahn, Tim
    Jin, Bangti
    INVERSE PROBLEMS, 2020, 36 (09)
  • [45] On the Generalization of Stochastic Gradient Descent with Momentum
    Ramezani-Kebrya, Ali
    Antonakopoulos, Kimon
    Cevher, Volkan
    Khisti, Ashish
    Liang, Ben
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 56
  • [46] Graph Drawing by Stochastic Gradient Descent
    Zheng, Jonathan X.
    Pawar, Samraat
    Goodman, Dan F. M.
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (09) : 2738 - 2748
  • [47] On the different regimes of stochastic gradient descent
    Sclocchi, Antonio
    Wyart, Matthieu
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 121 (09)
  • [48] A Stochastic Gradient Descent Approach for Stochastic Optimal Control
    Archibald, Richard
    Bao, Feng
    Yong, Jiongmin
    EAST ASIAN JOURNAL ON APPLIED MATHEMATICS, 2020, 10 (04) : 635 - 658
  • [49] Fractional-order global optimal backpropagation machine trained by an improved fractional-order steepest descent method
    Yi-fei Pu
    Jian Wang
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 809 - 833
  • [50] The effective noise of stochastic gradient descent
    Mignacco, Francesca
    Urbani, Pierfrancesco
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (08):