Generalized Conditional Gradient for Sparse Estimation

被引:0
作者
Yu, Yaoliang [1 ]
Zhang, Xinhua [2 ]
Schuurmans, Dale [3 ]
机构
[1] Univ Waterloo, Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[2] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
[3] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
generalized conditional gradient; frank-wolfe; dictionary learning; matrix completion; multi-view learning; sparse estimation; ALGORITHMS; WOLFE; MINIMIZATION; CONVERGENCE; RANK; OPTIMIZATION; SUBGRADIENT; SHRINKAGE; SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sparsity is an important modeling tool that expands the applicability of convex formulations for data analysis, however it also creates significant challenges for efficient algorithm design. In this paper we investigate the generalized conditional gradient (GCG) algorithm for solving sparse optimization problems-demonstrating that, with some enhancements, it can provide a more efficient alternative to current state of the art approaches. After studying the convergence properties of GCG for general convex composite problems, we develop efficient methods for evaluating polar operators, a subroutine that is required in each GCG iteration. In particular, we show how the polar operator can be efficiently evaluated in learning low-rank matrices, instantiated with detailed examples on matrix completion and dictionary learning. A further improvement is achieved by interleaving GCG with fixed-rank local subspace optimization. A series of experiments on matrix completion, multi-class classification, and multi-view dictionary learning shows that the proposed method can significantly reduce the training cost of current alternatives.
引用
收藏
页数:46
相关论文
共 72 条
  • [1] Good Practice in Large-Scale Learning for Image Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 507 - 520
  • [2] [Anonymous], 2009, P INT C MACH LEARN
  • [3] [Anonymous], 2012, P KDD CUP 2011
  • [4] [Anonymous], 2006, Journal of the Royal Statistical Society, Series B
  • [5] [Anonymous], 00694765 HAL
  • [6] [Anonymous], P INT C MACH LEARN
  • [7] [Anonymous], 2013, PROC ICML
  • [8] Convex multi-task feature learning
    Argyriou, Andreas
    Evgeniou, Theodoros
    Pontil, Massimiliano
    [J]. MACHINE LEARNING, 2008, 73 (03) : 243 - 272
  • [9] Bach F., 2008, Convex sparse matrix factorizations
  • [10] BACH FR, 2006, 688 U CAL DEP STAT