Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning

被引:3
|
作者
Yang, Zhuang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
Machine learning algorithms; Sensitivity; Machine learning; Ordinary differential equations; Information retrieval; Robustness; Computational complexity; Adaptive learning rate; conjugate gradient; large-scale learning; powerball function; stochastic optimization; QUASI-NEWTON METHOD;
D O I
10.1109/TBDATA.2023.3300546
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The extreme success of stochastic optimization (SO) in large-scale machine learning problems, information retrieval, bioinformatics, etc., has been widely reported, especially in recent years. As an effective tactic, conjugate gradient (CG) has been gaining its popularity in accelerating SO algorithms. This paper develops a novel type of stochastic conjugate gradient descent (SCG) algorithms from the perspective of the Powerball strategy and the hypergradient descent (HD) technique. The crucial idea behind the resulting methods is inspired by pursuing the equilibrium of ordinary differential equations (ODEs). We elucidate the effect of the Powerball strategy in SCG algorithms. The introduction of HD, on the other side, makes the resulting methods work with an online learning rate. Meanwhile, we provide a comprehension of the theoretical results for the resulting algorithms under non-convex assumptions. As a byproduct, we bridge the gap between the learning rate and powered stochastic optimization (PSO) algorithms, which is still an open problem. Resorting to numerical experiments on numerous benchmark datasets, we test the parameter sensitivity of the proposed methods and demonstrate the superior performance of our new algorithms over state-of-the-art algorithms.
引用
收藏
页码:1598 / 1606
页数:9
相关论文
共 50 条
  • [21] A conjugate gradient algorithm for large-scale unconstrained optimization problems and nonlinear equations
    Gonglin Yuan
    Wujie Hu
    Journal of Inequalities and Applications, 2018
  • [22] Constrained Stochastic Gradient Descent for Large-scale Least Squares Problem
    Mu, Yang
    Ding, Wei
    Zhou, Tianyi
    Tao, Dacheng
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 883 - 891
  • [23] The Hager-Zhang conjugate gradient algorithm for large-scale nonlinear equations
    Yuan, Gonglin
    Wang, Bopeng
    Sheng, Zhou
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2019, 96 (08) : 1533 - 1547
  • [24] A conjugate gradient algorithm for large-scale nonlinear equations and image restoration problems
    Yuan, Gonglin
    Li, Tingting
    Hu, Wujie
    APPLIED NUMERICAL MATHEMATICS, 2020, 147 : 129 - 141
  • [25] A conjugate gradient algorithm and its application in large-scale optimization problems and image restoration
    Gonglin Yuan
    Tingting Li
    Wujie Hu
    Journal of Inequalities and Applications, 2019
  • [26] A conjugate gradient algorithm and its application in large-scale optimization problems and image restoration
    Yuan, Gonglin
    Li, Tingting
    Hu, Wujie
    JOURNAL OF INEQUALITIES AND APPLICATIONS, 2019, 2019 (01)
  • [27] A Survey on Large-Scale Machine Learning
    Wang, Meng
    Fu, Weijie
    He, Xiangnan
    Hao, Shijie
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
  • [28] Accelerated Variance Reduction Stochastic ADMM for Large-Scale Machine Learning
    Liu, Yuanyuan
    Shang, Fanhua
    Liu, Hongying
    Kong, Lin
    Jiao, Licheng
    Lin, Zhouchen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) : 4242 - 4255
  • [29] Another Conjugate Gradient Algorithm with Guaranteed Descent and Conjugacy Conditions for Large-scale Unconstrained Optimization
    Andrei, Neculai
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2013, 159 (01) : 159 - 182
  • [30] A Modified Hestenes and Stiefel Conjugate Gradient Algorithm for Large-Scale Nonsmooth Minimizations and Nonlinear Equations
    Yuan, Gonglin
    Meng, Zehong
    Li, Yong
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 168 (01) : 129 - 152