Constrained Stochastic Gradient Descent for Large-scale Least Squares Problem

被引:0
|
作者
Mu, Yang [1 ]
Ding, Wei [1 ]
Zhou, Tianyi [2 ]
Tao, Dacheng [2 ]
机构
[1] Univ Massachusetts, 100 Morrissey Blvd, Boston, MA 02125 USA
[2] Univ Technol Sydney, Ultimo, NSW 2007, Australia
关键词
Stochastic optimization; Large-scale least squares; online learning; APPROXIMATION; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The least squares problem is one of the most important regression problems in statistics, machine learning and data mining. In this paper, we present the Constrained Stochastic Gradient Descent (CSGD) algorithm to solve the large-scale least squares problem. CSGD improves the Stochastic Gradient Descent (SGD) by imposing a provable constraint that the linear regression line passes through the mean point of all the data points. It results in the best regret bound o(logT), and fastest convergence speed among all first order approaches. Empirical studies justify the effectiveness of CSGD by comparing it with SGD and other state-of-the-art approaches. An example is also given to show how to use CSGD to optimize SGD based least squares problems to achieve a better performance.
引用
收藏
页码:883 / 891
页数:9
相关论文
共 50 条
  • [1] Gradient Projection Iterative Sketch for Large-Scale Constrained Least-Squares
    Tang, Junqi
    Golbabaee, Mohammad
    Davies, Mike E.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] Large-Scale Machine Learning with Stochastic Gradient Descent
    Bottou, Leon
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
  • [3] Solving large-scale constrained least-squares problems
    Abdel-Aziz, MR
    El-Alem, MM
    APPLIED MATHEMATICS AND COMPUTATION, 2003, 137 (2-3) : 571 - 587
  • [4] A large-scale stochastic gradient descent algorithm over a graphon
    Chen, Yan
    Li, Tao
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4806 - 4811
  • [5] Stochastic Gradient Descent for Large-scale Linear Nonparallel SVM
    Tang, Jingjing
    Tian, Yingjie
    Wu, Guoqiang
    Li, Dewei
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 980 - 983
  • [6] On Asymptotic Linear Convergence of Projected Gradient Descent for Constrained Least Squares
    Vu, Trung
    Raich, Raviv
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 4061 - 4076
  • [7] Large-scale support vector regression with budgeted stochastic gradient descent
    Zongxia Xie
    Yingda Li
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 1529 - 1541
  • [8] Large-scale support vector regression with budgeted stochastic gradient descent
    Xie, Zongxia
    Li, Yingda
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (06) : 1529 - 1541
  • [9] ON THE REGULARIZATION EFFECT OF STOCHASTIC GRADIENT DESCENT APPLIED TO LEAST-SQUARES
    Steinerberger, Stefan
    ELECTRONIC TRANSACTIONS ON NUMERICAL ANALYSIS, 2021, 54 : 610 - 619
  • [10] A unifying analysis of projected gradient descent for lp-constrained least squares
    Bahmani, S.
    Raj, B.
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2013, 34 (03) : 366 - 378