Randomness and permutations in coordinate descent methods

被引:8
作者
Gurbuzbalaban, Mert [1 ]
Ozdaglar, Asuman [2 ]
Vanli, Nuri Denizcan [2 ]
Wright, Stephen J. [3 ,4 ]
机构
[1] Rutgers State Univ, Dept Management Sci & Informat Syst, 100 Rockafellar Rd, Piscataway, NJ 08854 USA
[2] MIT, Dept Elect Engn & Comp Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] Univ Wisconsin, Dept Comp Sci, 1210 West Dayton St, Madison, WI 53706 USA
[4] Univ Wisconsin, Wisconsin Inst Discovery, 1210 West Dayton St, Madison, WI 53706 USA
关键词
Coordinate descent; Random permutations; Without-replacement sampling; OPTIMIZATION; CONVERGENCE; EFFICIENCY;
D O I
10.1007/s10107-019-01438-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We consider coordinate descent (CD) methods with exact line search on convex quadratic problems. Our main focus is to study the performance of the CD method that use random permutations in each epoch and compare it to the performance of the CD methods that use deterministic orders and random sampling with replacement. We focus on a class of convex quadratic problems with a diagonally dominant Hessian matrix, for which we show that using random permutations instead of random with-replacement sampling improves the performance of the CD method in the worst-case. Furthermore, we prove that as the Hessian matrix becomes more diagonally dominant, the performance improvement attained by using random permutations increases. We also show that for this problem class, using any fixed deterministic order yields a superior performance than using random permutations. We present detailed theoretical analyses with respect to three different convergence criteria that are used in the literature and support our theoretical results with numerical experiments.
引用
收藏
页码:349 / 376
页数:28
相关论文
共 33 条
[1]  
[Anonymous], 2015, ARXIV151008560
[2]  
[Anonymous], 2012, STOCHASTIC GRADIENT
[3]  
[Anonymous], 2015, ADV NEURAL INFORM PR
[4]  
[Anonymous], P S LEARN DAT SCI PA
[5]   ON THE CONVERGENCE OF BLOCK COORDINATE DESCENT TYPE METHODS [J].
Beck, Amir ;
Tetruashvili, Luba .
SIAM JOURNAL ON OPTIMIZATION, 2013, 23 (04) :2037-2060
[6]  
Bertsekas D. P., 1999, Nonlinear Programming
[7]  
BERTSEKAS D. P., 1989, Parallel and Distributed Computation: Numerical Methods
[8]  
Bertsekas D.P., 2015, ARXIV150909257 CORR
[9]  
Bertsekas DP, 2015, Convex Optimization Algorithms
[10]  
Gurbuzbalaban M., 2017, ADV NEURAL INFORM PR