Adaptive Restart for Accelerated Gradient Schemes

被引:0
作者
Brendan O’Donoghue
Emmanuel Candès
机构
[1] Stanford University,
来源
Foundations of Computational Mathematics | 2015年 / 15卷
关键词
Convex optimization; First order methods; Accelerated gradient schemes; 80M50; 90C06; 90C25;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper we introduce a simple heuristic adaptive restart technique that can dramatically improve the convergence rate of accelerated gradient schemes. The analysis of the technique relies on the observation that these schemes exhibit two modes of behavior depending on how much momentum is applied at each iteration. In what we refer to as the ‘high momentum’ regime the iterates generated by an accelerated gradient scheme exhibit a periodic behavior, where the period is proportional to the square root of the local condition number of the objective function. Separately, it is known that the optimal restart interval is proportional to this same quantity. This suggests a restart technique whereby we reset the momentum whenever we observe periodic behavior. We provide a heuristic analysis that suggests that in many cases adaptively restarting allows us to recover the optimal rate of convergence with no prior knowledge of function parameters.
引用
收藏
页码:715 / 732
页数:17
相关论文
共 24 条
  • [1] Auslender A.(2006)Interior gradient and proximal methods for convex and conic optimization SIAM J. Optim. 16 697-725
  • [2] Teboulle M.(2009)A fast iterative shrinkage-thresholding algorithm for linear inverse problems SIAM J. Imaging Sci. 2 183-202
  • [3] Beck A.(2011)Templates for convex cone problems with applications to sparse signal recovery Math. Program. Comput. 3 165-218
  • [4] Teboulle M.(2006)Stable signal recovery from incomplete and inaccurate measurements Commun. Pure Appl. Math. 59 1207-1223
  • [5] Becker S.(2008)An introduction to compressive sampling IEEE Signal Process. Mag. 25 21-30
  • [6] Candès E.(1998)Nonlinear wavelet image processing: variational problems, compression, and noise removal through wavelet shrinkage IEEE Trans. Image Process. 7 319-335
  • [7] Grant M.(2004)An iterative thresholding algorithm for linear inverse problems with a sparsity constraint Commun. Pure Appl. Math. 57 1413-1457
  • [8] Candès E.(2006)Compressed sensing IEEE Trans. Inf. Theory 52 1289-1306
  • [9] Romberg J.(1952)Methods of conjugate gradients for solving linear systems J. Res. Natl. Bur. Stand. 49 409-436
  • [10] Tao T.(1983)A method of solving a convex programming problem with convergence rate Sov. Math. Dokl. 27 372-376