SLOPE-ADAPTIVE VARIABLE SELECTION VIA CONVEX OPTIMIZATION

被引：185

作者：

Bogdan, Malgorzata ^{[1
]}

van den Berg, Ewout ^{[2
]}

Sabatti, Chiara ^{[3
,4
]}

Su, Weijie ^{[4
]}

Candes, Emmanuel J. ^{[4
,5
]}

机构：

[1] Wroclaw Univ Technol, Dept Math, PL-50370 Wroclaw, Poland

[2] IBM TJ Watson Res Ctr, Human Language Technol, Yorktown Hts, NY 10598 USA

[3] Stanford Univ, Dept Hlth Res & Policy, Div Biostat, Stanford, CA 94305 USA

[4] Stanford Univ, Dept Stat, Stanford, CA 94305 USA

[5] Stanford Univ, Dept Math, Stanford, CA 94305 USA

来源：

ANNALS OF APPLIED STATISTICS | 2015年 / 9卷 / 03期

基金：

欧盟第七框架计划;

关键词：

Sparse regression; variable selection; false discovery rate; Lasso; sorted l(1) penalized estimation (SLOPE); FALSE DISCOVERY RATE; CONFIDENCE-INTERVALS; REGRESSION SHRINKAGE; INFLATION CRITERION; SPARSITY; OPTIMALITY; ALGORITHM;

D O I：

10.1214/15-AOAS842

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We introduce a new estimator for the vector of coefficients beta in the linear model y = X beta + z, where X has dimensions n x p with p possibly larger than n. SLOPE, short for Sorted L-One Penalized Estimation, is the solution to min(b is an element of Rp) 1/2 parallel to y-Xb parallel to(2)(l2)+lambda(1)vertical bar b vertical bar((1))+lambda(2)vertical bar b vertical bar((1))+...+lambda(p)vertical bar b vertical bar((p)) , where lambda (1)>= lambda(2) >= ... >= lambda(p) >= 0 and vertical bar b vertical bar((1)) >= vertical bar b vertical bar((2)) >= ... >=vertical bar b vertical bar((p)) are the decreasing absolute values of the entries of b. This is a convex program and we demonstrate a solution algorithm whose computational complexity is roughly comparable to that of classical l(1) procedures such as the Lasso. Here, the regularizer is a sorted l(1) norm, which penalizes the regression coefficients according to their rank: the higher the rank-that is, stronger the signal-the larger the penalty. This is similar to the Benjamini and Hochberg [J. Roy. Statist. Soc. Ser. B 57 (1995) 289-300] procedure (BH) which compares more significant p-values with more stringent thresholds. One notable choice of the sequence {lambda(i)} is given by the BH critical values.BH(i) = z(1 -i .q/2p), where q is an element of (0, 1) and z(alpha) is the quantile of a standard normal distribution. SLOPE aims to provide finite sample guarantees on the selected model; of special interest is the false discovery rate (FDR), defined as the expected proportion of irrelevant regressors among all selected predictors. Under orthogonal designs, SLOPE with lambda(BH) provably controls FDR at level q. Moreover, it also appears to have appreciable inferential properties under more general designs X while having substantial power, as demonstrated in a series of experiments running on both simulated and real data.

引用

页码：1103 / 1140

页数：38

共 55 条

[1] Abramovich F., 1995, LECT NOTES STAT, V103, P5
[2] Adapting to unknown sparsity by controlling the false discovery rate
Abramovich, Felix
Benjamini, Yoav
Donoho, David L.
Johnstone, Iain M.
[J]. ANNALS OF STATISTICS, 2006, 34 (02) : 584 - 653
[3] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
AKAIKE, H
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
[4] [Anonymous], 1972, STAT INFERENCE ORDER
[5] [Anonymous], 2007, GRADIENT METHODS MIN
[6] Bauer P. B. M., 1988, STATISTICS, V19, P39, DOI [DOI 10.1080/02331888808802068, 10.1080/02331888808802068]
[7] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
Beck, Amir
Teboulle, Marc
[J]. SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01): : 183 - 202
[8] Templates for convex cone problems with applications to sparse signal recovery
Becker S.R.
Candès E.J.
Grant M.C.
[J]. Mathematical Programming Computation, 2011, 3 (3) : 165 - 218
[9] False discovery rate-adjusted multiple confidence intervals for selected parameters
Benjamini, Y
Yekutieli, D
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) : 71 - 81
[10] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
BENJAMINI, Y
HOCHBERG, Y
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300

← 1 2 3 4 5 6 →