An iterative algorithm for fitting nonconvex penalized generalized linear models with grouped predictors

被引:65
作者
She, Yiyuan [1 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
基金
美国国家科学基金会;
关键词
Nonconvex penalties; Group LASSO; Generalized linear models; Spectral analysis; Gene selection; VARIABLE SELECTION; LIKELIHOOD; REGRESSION; REGULARIZATION; SPARSITY; LASSO;
D O I
10.1016/j.csda.2011.11.013
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-dimensional data pose challenges in statistical learning and modeling. Sometimes the predictors can be naturally grouped where pursuing the between-group sparsity is desired. Collinearity may occur in real-world high-dimensional applications where the popular l(1) technique suffers from both selection inconsistency and prediction inaccuracy. Moreover, the problems of interest often go beyond Gaussian models. To meet these challenges, nonconvex penalized generalized linear models with grouped predictors are investigated and a simple-to-implement algorithm is proposed for computation. A rigorous theoretical result guarantees its convergence and provides tight preliminary scaling. This framework allows for grouped predictors and nonconvex penalties, including the discrete 10 and the l(0) + l(2)' type penalties. Penalty design and parameter tuning for nonconvex penalties are examined. Applications of super-resolution spectrum estimation in signal processing and cancer classification with joint gene selection in bioinformatics show the performance improvement by nonconvex penalized estimation. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:2976 / 2990
页数:15
相关论文
共 32 条
[1]  
[Anonymous], 2003, Wiley Series in Probability and Statistics
[2]  
[Anonymous], 2010, ARXIV10010736V1
[3]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[4]   Wavelet methods in statistics: Some recent developments and their applications [J].
Antoniadis, Anestis .
STATISTICS SURVEYS, 2007, 1 :16-55
[5]   Dimension reduction and variable selection in case control studies via regularized likelihood optimization [J].
Buneo, Florentino ;
Barbu, Adrian .
ELECTRONIC JOURNAL OF STATISTICS, 2009, 3 :1257-1287
[6]   Decoding by linear programming [J].
Candes, EJ ;
Tao, T .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (12) :4203-4215
[7]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[8]   Atomic decomposition by basis pursuit [J].
Chen, SSB ;
Donoho, DL ;
Saunders, MA .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 20 (01) :33-61
[9]   Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival [J].
Chiaretti, S ;
Li, XC ;
Gentleman, R ;
Vitale, A ;
Vignetti, M ;
Mandelli, F ;
Ritz, J ;
Foa, R .
BLOOD, 2004, 103 (07) :2771-2778
[10]   An iterative thresholding algorithm for linear inverse problems with a sparsity constraint [J].
Daubechies, I ;
Defrise, M ;
De Mol, C .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2004, 57 (11) :1413-1457