Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning

被引：3

作者：

Yang, Zhuang ^{[1
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

来源：

IEEE TRANSACTIONS ON BIG DATA | 2023年 / 9卷 / 06期

关键词：

Machine learning algorithms; Sensitivity; Machine learning; Ordinary differential equations; Information retrieval; Robustness; Computational complexity; Adaptive learning rate; conjugate gradient; large-scale learning; powerball function; stochastic optimization; QUASI-NEWTON METHOD;

D O I：

10.1109/TBDATA.2023.3300546

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The extreme success of stochastic optimization (SO) in large-scale machine learning problems, information retrieval, bioinformatics, etc., has been widely reported, especially in recent years. As an effective tactic, conjugate gradient (CG) has been gaining its popularity in accelerating SO algorithms. This paper develops a novel type of stochastic conjugate gradient descent (SCG) algorithms from the perspective of the Powerball strategy and the hypergradient descent (HD) technique. The crucial idea behind the resulting methods is inspired by pursuing the equilibrium of ordinary differential equations (ODEs). We elucidate the effect of the Powerball strategy in SCG algorithms. The introduction of HD, on the other side, makes the resulting methods work with an online learning rate. Meanwhile, we provide a comprehension of the theoretical results for the resulting algorithms under non-convex assumptions. As a byproduct, we bridge the gap between the learning rate and powered stochastic optimization (PSO) algorithms, which is still an open problem. Resorting to numerical experiments on numerous benchmark datasets, we test the parameter sensitivity of the proposed methods and demonstrate the superior performance of our new algorithms over state-of-the-art algorithms.

引用

页码：1598 / 1606

页数：9

共 50 条

[21] A conjugate gradient algorithm for large-scale unconstrained optimization problems and nonlinear equations
Gonglin Yuan
Wujie Hu
Journal of Inequalities and Applications, 2018
[22] Constrained Stochastic Gradient Descent for Large-scale Least Squares Problem
Mu, Yang
Ding, Wei
Zhou, Tianyi
Tao, Dacheng
19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 883 - 891
[23] The Hager-Zhang conjugate gradient algorithm for large-scale nonlinear equations
Yuan, Gonglin
Wang, Bopeng
Sheng, Zhou
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2019, 96 (08) : 1533 - 1547
[24] A conjugate gradient algorithm for large-scale nonlinear equations and image restoration problems
Yuan, Gonglin
Li, Tingting
Hu, Wujie
APPLIED NUMERICAL MATHEMATICS, 2020, 147 : 129 - 141
[25] A conjugate gradient algorithm and its application in large-scale optimization problems and image restoration
Gonglin Yuan
Tingting Li
Wujie Hu
Journal of Inequalities and Applications, 2019
[26] A conjugate gradient algorithm and its application in large-scale optimization problems and image restoration
Yuan, Gonglin
Li, Tingting
Hu, Wujie
JOURNAL OF INEQUALITIES AND APPLICATIONS, 2019, 2019 (01)
[27] A Survey on Large-Scale Machine Learning
Wang, Meng
Fu, Weijie
He, Xiangnan
Hao, Shijie
Wu, Xindong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
[28] Accelerated Variance Reduction Stochastic ADMM for Large-Scale Machine Learning
Liu, Yuanyuan
Shang, Fanhua
Liu, Hongying
Kong, Lin
Jiao, Licheng
Lin, Zhouchen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) : 4242 - 4255
[29] Another Conjugate Gradient Algorithm with Guaranteed Descent and Conjugacy Conditions for Large-scale Unconstrained Optimization
Andrei, Neculai
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2013, 159 (01) : 159 - 182
[30] A Modified Hestenes and Stiefel Conjugate Gradient Algorithm for Large-Scale Nonsmooth Minimizations and Nonlinear Equations
Yuan, Gonglin
Meng, Zehong
Li, Yong
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 168 (01) : 129 - 152

← 1 2 3 4 5 →