Efficient block-coordinate descent algorithms for the Group Lasso

被引:111
作者
Qin Z. [1 ]
Scheinberg K. [2 ]
Goldfarb D. [1 ]
机构
[1] Department of Industrial Engineering and Operations Research, Columbia University, New York
[2] Department of Industrial and Systems Engineering, Lehigh University, Bethlehem
基金
美国国家科学基金会;
关键词
Block coordinate descent; Group Lasso; Iterative shrinkage thresholding; Line-search; Multiple measurement vector;
D O I
10.1007/s12532-013-0051-x
中图分类号
学科分类号
摘要
We present two algorithms to solve the Group Lasso problem (Yuan and Lin in, J R Stat Soc Ser B (Stat Methodol) 68(1):49-67, 2006). First, we propose a general version of the Block Coordinate Descent (BCD) algorithm for the Group Lasso that employs an efficient approach for optimizing each subproblem exactly. We show that it exhibits excellent performance when the groups are of moderate size. For groups of large size, we propose an extension of ISTA/FISTA SIAM (Beck and Teboulle in, SIAM J Imag Sci 2(1):183-202, 2009) based on variable step-lengths that can be viewed as a simplified version of BCD. By combining the two approaches we obtain an implementation that is very competitive and often outperforms other state-of-the-art approaches for this problem. We show how these methods fit into the globally convergent general block coordinate gradient descent framework in Tseng and Yun (Math Program 117(1):387-423, 2009). We also show that the proposed approach is more efficient in practice than the one implemented in Tseng and Yun (Math Program 117(1):387-423, 2009). In addition, we apply our algorithms to the Multiple Measurement Vector (MMV) recovery problem, which can be viewed as a special case of the Group Lasso problem, and compare their performance to other methods in this particular instance. © 2013 Springer-Verlag Berlin Heidelberg and Mathematical Optimization Society.
引用
收藏
页码:143 / 169
页数:26
相关论文
共 36 条
[1]  
Bach F., Consistency of the group Lasso and multiple kernel learning, J. Mach. Learn. Res., 9, pp. 1179-1225, (2008)
[2]  
Beck A., Teboulle M., A fast iterative shrinkage-thresholding algorithm for linear inverse problems, SIAM J. Imag. Sci., 2, 1, pp. 183-202, (2009)
[3]  
Van Den Berg E., Friedlander M., Joint-sparse Recovery from Multiple Measurements, 904, (2009)
[4]  
Van Den Berg E., Friedlander M., Sparse optimization with least-squares constraints, Tech. Rep.
[5]  
Technical Report TR-2010-02, Department of Computer Science, (2010)
[6]  
Van Den Berg E., Schmidt M., Friedlander M., Murphy K., Group sparsity via linear-time projection, Tech. Rep.
[7]  
Technical Report TR-2008-09, Department of Computer Science, (2008)
[8]  
Candes E., Romberg J., Tao T., Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information, Inform Theory IEEE Trans, 52, 2, pp. 489-509, (2006)
[9]  
Chen J., Huo X., Theoretical results on sparse representations of multiple-measurement vectors, IEEE Trans. Signal Process., 54, (2006)
[10]  
Dolan E., More J., Benchmarking optimization software with performance profiles, Math. Program., 91, 2, pp. 201-213, (2002)