SMOOTHING PROXIMAL GRADIENT METHOD FOR GENERAL STRUCTURED SPARSE REGRESSION

被引：150

作者：

Chen, Xi ^{[1
]}

Lin, Qihang ^{[2
]}

Kim, Seyoung ^{[1
]}

Carbonell, Jaime G. ^{[1
]}

Xing, Eric P. ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA 15213 USA

来源：

ANNALS OF APPLIED STATISTICS | 2012年 / 6卷 / 02期

关键词：

Sparse regression; structured sparsity; smoothing; proximal gradient; optimization; SHRINKAGE; SELECTION;

D O I：

10.1214/11-AOAS514

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We study the problem of estimating high-dimensional regression models regularized by a structured sparsity-inducing penalty that encodes prior structural information on either the input or output variables. We consider two widely adopted types of penalties of this kind as motivating examples: (1) the general overlapping-group-lasso penalty, generalized from the group-lasso penalty; and (2) the graph-guided-fused-lasso penalty, generalized from the fused-lasso penalty. For both types of penalties, due to their nonseparability and nonsmoothness, developing an efficient optimization method remains a challenging problem. In this paper we propose a general optimization approach, the smoothing proximal gradient (SPG) method, which can solve structured sparse regression problems with any smooth convex loss under a wide spectrum of structured sparsity-inducing penalties. Our approach combines a smoothing technique with an effective proximal gradient method. It achieves a convergence rate significantly faster than the standard first-order methods, subgradient methods, and is much more scalable than the most widely used interior-point methods. The efficiency and scalability of our method are demonstrated on both simulation experiments and real genetic data sets.

引用

页码：719 / 752

页数：34

共 42 条

[1]

ABATE N., 2005, DIABETES, V54, P1027

[2] A haplotype map of the human genome [J].

Altshuler, D ;

Brooks, LD ;

Chakravarti, A ;

Collins, FS ;

Daly, MJ ;

Donnelly, P ;

Gibbs, RA ;

Belmont, JW ;

Boudreau, A ;

Leal, SM ;

Hardenbol, P ;

Pasternak, S ;

Wheeler, DA ;

Willis, TD ;

Yu, FL ;

Yang, HM ;

Zeng, CQ ;

Gao, Y ;

Hu, HR ;

Hu, WT ;

Li, CH ;

Lin, W ;

Liu, SQ ;

Pan, H ;

Tang, XL ;

Wang, J ;

Wang, W ;

Yu, J ;

Zhang, B ;

Zhang, QR ;

Zhao, HB ;

Zhao, H ;

Zhou, J ;

Gabriel, SB ;

Barry, R ;

Blumenstiel, B ;

Camargo, A ;

Defelice, M ;

Faggart, M ;

Goyette, M ;

Gupta, S ;

Moore, J ;

Nguyen, H ;

Onofrio, RC ;

Parkin, M ;

Roy, J ;

Stahl, E ;

Winchester, E ;

Ziaugra, L ;

Shen, Y .

NATURE, 2005, 437 (7063) :1299-1320

[3]

[Anonymous], P INT C MACH LEARN

[4]

[Anonymous], 1999, Athena scientific Belmont

[5]

[Anonymous], 2009, P INT C MACH LEARN

[6]

[Anonymous], 2004, Optimization

[7]

[Anonymous], 2006, Journal of the Royal Statistical Society, Series B

[8]

[Anonymous], 2008, SIAM J OPTIM

[9]

[Anonymous], 2007, GRADIENT METHODS MIN

[10] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].

Beck, Amir ;

Teboulle, Marc .

SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202

← 1 2 3 4 5 →