SLOPE-ADAPTIVE VARIABLE SELECTION VIA CONVEX OPTIMIZATION

被引:185
作者
Bogdan, Malgorzata [1 ]
van den Berg, Ewout [2 ]
Sabatti, Chiara [3 ,4 ]
Su, Weijie [4 ]
Candes, Emmanuel J. [4 ,5 ]
机构
[1] Wroclaw Univ Technol, Dept Math, PL-50370 Wroclaw, Poland
[2] IBM TJ Watson Res Ctr, Human Language Technol, Yorktown Hts, NY 10598 USA
[3] Stanford Univ, Dept Hlth Res & Policy, Div Biostat, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[5] Stanford Univ, Dept Math, Stanford, CA 94305 USA
基金
欧盟第七框架计划;
关键词
Sparse regression; variable selection; false discovery rate; Lasso; sorted l(1) penalized estimation (SLOPE); FALSE DISCOVERY RATE; CONFIDENCE-INTERVALS; REGRESSION SHRINKAGE; INFLATION CRITERION; SPARSITY; OPTIMALITY; ALGORITHM;
D O I
10.1214/15-AOAS842
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We introduce a new estimator for the vector of coefficients beta in the linear model y = X beta + z, where X has dimensions n x p with p possibly larger than n. SLOPE, short for Sorted L-One Penalized Estimation, is the solution to min(b is an element of Rp) 1/2 parallel to y-Xb parallel to(2)(l2)+lambda(1)vertical bar b vertical bar((1))+lambda(2)vertical bar b vertical bar((1))+...+lambda(p)vertical bar b vertical bar((p)) , where lambda (1)>= lambda(2) >= ... >= lambda(p) >= 0 and vertical bar b vertical bar((1)) >= vertical bar b vertical bar((2)) >= ... >=vertical bar b vertical bar((p)) are the decreasing absolute values of the entries of b. This is a convex program and we demonstrate a solution algorithm whose computational complexity is roughly comparable to that of classical l(1) procedures such as the Lasso. Here, the regularizer is a sorted l(1) norm, which penalizes the regression coefficients according to their rank: the higher the rank-that is, stronger the signal-the larger the penalty. This is similar to the Benjamini and Hochberg [J. Roy. Statist. Soc. Ser. B 57 (1995) 289-300] procedure (BH) which compares more significant p-values with more stringent thresholds. One notable choice of the sequence {lambda(i)} is given by the BH critical values.BH(i) = z(1 -i .q/2p), where q is an element of (0, 1) and z(alpha) is the quantile of a standard normal distribution. SLOPE aims to provide finite sample guarantees on the selected model; of special interest is the false discovery rate (FDR), defined as the expected proportion of irrelevant regressors among all selected predictors. Under orthogonal designs, SLOPE with lambda(BH) provably controls FDR at level q. Moreover, it also appears to have appreciable inferential properties under more general designs X while having substantial power, as demonstrated in a series of experiments running on both simulated and real data.
引用
收藏
页码:1103 / 1140
页数:38
相关论文
共 55 条
  • [1] Abramovich F., 1995, LECT NOTES STAT, V103, P5
  • [2] Adapting to unknown sparsity by controlling the false discovery rate
    Abramovich, Felix
    Benjamini, Yoav
    Donoho, David L.
    Johnstone, Iain M.
    [J]. ANNALS OF STATISTICS, 2006, 34 (02) : 584 - 653
  • [3] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [4] [Anonymous], 1972, STAT INFERENCE ORDER
  • [5] [Anonymous], 2007, GRADIENT METHODS MIN
  • [6] Bauer P. B. M., 1988, STATISTICS, V19, P39, DOI [DOI 10.1080/02331888808802068, 10.1080/02331888808802068]
  • [7] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
    Beck, Amir
    Teboulle, Marc
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01): : 183 - 202
  • [8] Templates for convex cone problems with applications to sparse signal recovery
    Becker S.R.
    Candès E.J.
    Grant M.C.
    [J]. Mathematical Programming Computation, 2011, 3 (3) : 165 - 218
  • [9] False discovery rate-adjusted multiple confidence intervals for selected parameters
    Benjamini, Y
    Yekutieli, D
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) : 71 - 81
  • [10] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300