Accelerating Greedy Coordinate Descent Methods

被引:0
作者
Lu, Haihao [1 ,2 ]
Freund, Robert M. [3 ]
Mirrokni, Vahab [4 ]
机构
[1] MIT, Dept Math, Cambridge, MA 02139 USA
[2] MIT, Operat Res Ctr, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] MIT, Sloan Sch Management, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] Google Res, New York, NY USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80 | 2018年 / 80卷
关键词
CONVERGENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce and study two algorithms to accelerate greedy coordinate descent in theory and in practice: Accelerated Semi-Greedy Coordinate Descent (ASCD) and Accelerated Greedy Co-ordinate Descent (AGCD). On the theory side, our main results are for ASCD: we show that ASCD achieves O(1/k(2)) convergence, and it also achieves accelerated linear convergence for strongly convex functions. On the empirical side, while both AGCD and ASCD outperform Accelerated Randomized Coordinate Descent on most instances in our numerical experiments, we note that AGCD significantly outperforms the other two methods in our experiments, in spite of a lack of theoretical guarantees for this method. To complement this empirical finding for AGCD, we present an explanation why standard proof techniques for acceleration cannot work for AGCD, and we introduce a technical condition under which AGCD is guaranteed to have accelerated convergence. Finally, we confirm that this technical condition holds in our numerical experiments.
引用
收藏
页数:10
相关论文
共 29 条
[1]  
Allen-Zhu Zeyuan, 2014, ARXIV14071537
[2]  
[Anonymous], 2015, Advances in neural information processing systems
[3]  
[Anonymous], 2016, A Lyapunov Analysis of Momentum Methods in Optimization"
[4]   ON THE CONVERGENCE OF BLOCK COORDINATE DESCENT TYPE METHODS [J].
Beck, Amir ;
Tetruashvili, Luba .
SIAM JOURNAL ON OPTIMIZATION, 2013, 23 (04) :2037-2060
[5]  
Bertsekas D.P., 1997, Parallel and distributed computation: numerical methods
[6]  
Bubeck S., 2015, ARXIV PREPRINT ARXIV
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   ACCELERATED, PARALLEL, AND PROXIMAL COORDINATE DESCENT [J].
Fercoq, Olivier ;
Richtarik, Peter .
SIAM JOURNAL ON OPTIMIZATION, 2015, 25 (04) :1997-2023
[9]  
Frostig R, 2015, PR MACH LEARN RES, V37, P2540
[10]  
Gürbüzbalaban M, 2017, ADV NEUR IN, V30