Confidence Intervals and Hypothesis Testing for High-Dimensional Regression

被引:0
作者
Javanmard, Adel [1 ]
Montanari, Andrea [1 ,2 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
hypothesis testing; confidence intervals; LASSO; high-dimensional models; bias of an estimator; VARIABLE SELECTION; MODEL SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fitting high-dimensional statistical models often requires the use of non-linear parameter estimation procedures. As a consequence, it is generally impossible to obtain an exact characterization of the probability distribution of the parameter estimates. This in turn implies that it is extremely challenging to quantify the uncertainty associated with a certain parameter estimate. Concretely, no commonly accepted procedure exists for computing classical measures of uncertainty and statistical significance as confidence intervals or p-values for these models. We consider here high-dimensional linear regression problem, and propose an efficient algorithm for constructing confidence intervals and p-values. The resulting confidence intervals have nearly optimal size. When testing for the null hypothesis that a certain parameter is vanishing, our method has nearly optimal power. Our approach is based on constructing a 'de-biased' version of regularized M-estimators. The new construction improves over recent work in the field in that it does not assume a special structure on the design matrix. We test our method on synthetic data and a highthroughput genomic data set about riboflavin production rate, made publicly available by Biihlmann et al. (2014).
引用
收藏
页码:2869 / 2909
页数:41
相关论文
共 50 条
[1]  
Adel Javanmard, 2013, ADV NEURAL INFORM PR, P1187
[2]  
[Anonymous], ARXIV13115274
[3]  
[Anonymous], 2013, Advances in Neural Information Processing Systems
[4]  
Belloni A., 2013, CEMMAP WORKING PAPER, DOI DOI 10.1920/WP.CEM.2013.6713
[5]  
Belloni A., 2014, ARXIV12010224
[6]   Least squares after model selection in high-dimensional sparse models [J].
Belloni, Alexandre ;
Chernozhukov, Victor .
BERNOULLI, 2013, 19 (02) :521-547
[7]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[8]   High-Dimensional Statistics with a View Toward Applications in Biology [J].
Buehlmann, Peter ;
Kalisch, Markus ;
Meier, Lukas .
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 1, 2014, 1 :255-U809
[9]   Statistical significance in high-dimensional linear models [J].
Buehlmann, Peter .
BERNOULLI, 2013, 19 (04) :1212-1242
[10]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9