Detection boundary in sparse regression

被引:76
作者
Ingster, Yuri I. [1 ]
Tsybakov, Alexandre B. [2 ]
Verzelen, Nicolas [3 ]
机构
[1] St Petersburg State Electrotech Univ, St Petersburg 197376, Russia
[2] CREST, F-92240 Malakoff, France
[3] INRA, UMR MISTEA 729, F-34060 Montpellier, France
来源
ELECTRONIC JOURNAL OF STATISTICS | 2010年 / 4卷
关键词
High-dimensional regression; detection boundary; sparse vectors; sparsity; minimax hypothesis testing; OF-FIT TESTS; HIGHER CRITICISM; FEATURE-SELECTION; DANTZIG SELECTOR; RECOVERY; MIXTURES; LASSO;
D O I
10.1214/10-EJS589
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study the problem of detection of a p-dimensional sparse vector of parameters in the linear regression model with Gaussian noise. We establish the detection boundary, i.e., the necessary and sufficient conditions for the possibility of successful detection as both the sample size n and the dimension p tend to infinity. Testing procedures that achieve this boundary are also exhibited. Our results encompass the high-dimensional setting (p >> n). The main message is that, under some conditions, the detection boundary phenomenon that has been previously established for the Gaussian sequence model, extends to high-dimensional linear regression. Finally, we establish the detection boundaries when the variance of the noise is unknown. Interestingly, the rate of the detection boundary in high-dimensional setting with unknown variance can be different from the rate for the case of known variance.
引用
收藏
页码:1476 / 1526
页数:51
相关论文
共 31 条
  • [1] [Anonymous], 2003, LECT NOTES STAT
  • [2] [Anonymous], 1985, Ecole d'Ete de Probabilites de Saint-Flour XIII
  • [3] [Anonymous], 1995, OXFORD STUDIES PROBA
  • [4] [Anonymous], 1986, EMPIRICAL PROCESSES
  • [5] ARIAS-CASTRO E., 2010, ARXIV10071434
  • [6] SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR
    Bickel, Peter J.
    Ritov, Ya'acov
    Tsybakov, Alexandre B.
    [J]. ANNALS OF STATISTICS, 2009, 37 (04) : 1705 - 1732
  • [7] Estimation and confidence sets for sparse normal mixtures
    Cai, T. Tony
    Jin, Jiashun
    Low, Mark G.
    [J]. ANNALS OF STATISTICS, 2007, 35 (06) : 2421 - 2449
  • [8] Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
  • [9] Davidson KR, 2001, HANDBOOK OF THE GEOMETRY OF BANACH SPACES, VOL 1, P317, DOI 10.1016/S1874-5849(01)80010-3
  • [10] Higher criticism for detecting sparse heterogeneous mixtures
    Donoho, D
    Jin, JS
    [J]. ANNALS OF STATISTICS, 2004, 32 (03) : 962 - 994