Stochastic Methods for l1-regularized Loss Minimization

被引:0
作者
Shalev-Shwartz, Shai [1 ]
Tewari, Ambuj [2 ]
机构
[1] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel
[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78701 USA
关键词
L1; regularization; optimization; coordinate descent; mirror descent; sparsity; GRADIENT DESCENT; COORDINATE; CONVERGENCE; REGRESSION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe and analyze two stochastic methods for l(1) regularized loss minimization problems, such as the Lasso. The first method updates the weight of a single feature at each iteration while the second method updates the entire weight vector but only uses a single training example at each iteration. In both methods, the choice of feature or example is uniformly at random. Our theoretical runtime analysis suggests that the stochastic methods should outperform state-of-the-art deterministic approaches, including their deterministic counterparts, when the size of the problem is large. We demonstrate the advantage of stochastic methods by experimenting with synthetic and natural data sets.(1)
引用
收藏
页码:1865 / 1892
页数:28
相关论文
共 35 条
  • [1] [Anonymous], 2008, P 25 INT C MACHINE L, DOI [10.1145/1390156.1390273, DOI 10.1145/1390156.1390273]
  • [2] [Anonymous], 2008, Advances in Neural Information Processing Systems, DOI DOI 10.7751/mitpress/8996.003.0015
  • [3] [Anonymous], 2009, ADV NEURAL INFORM PR
  • [4] Mirror descent and nonlinear projected subgradient methods for convex optimization
    Beck, A
    Teboulle, M
    [J]. OPERATIONS RESEARCH LETTERS, 2003, 31 (03) : 167 - 175
  • [5] Bottou L., STOCHASTIC GRADIENT
  • [6] On-line learning for very large data sets
    Bottou, U
    Le Cun, Y
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2005, 21 (02) : 137 - 151
  • [7] Christianini N., 2000, INTRO SUPPORT VECTOR, P189
  • [8] Clarkson KL, 2008, PROCEEDINGS OF THE NINETEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P922
  • [9] Duchi J., 2008, P 25 INT C MACH LEAR, P272, DOI DOI 10.1145/1390156.1390191
  • [10] Duchi JC., 2010, COLT, P14