Estimation of Linear Functionals in High-Dimensional Linear Models: From Sparsity to Nonsparsity

被引:3
作者
Zhao, Junlong [1 ]
Zhou, Yang [1 ]
Liu, Yufeng [2 ,3 ]
机构
[1] Beijing Normal Univ, Sch Stat, Beijing, Peoples R China
[2] Univ N Carolina, Dept Stat & Operat Res, Dept Genet, Dept Biostat,Carolina Ctr Genome Sci,Lineberger, Chapel Hill, NC USA
[3] Univ N Carolina, Lineberger Comprehens Canc Ctr, Dept Stat & Operat Res, Dept Genet,Dept Biostat,Carolina Ctr Genome Sci, Chapel Hill, NC 27599 USA
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Correlated predictors; Eigenvalue sparsity; Linear transformation; Prediction; CAUSAL INFERENCE; EFFICIENT ESTIMATION; TIME-SERIES; BIAS; IDENTIFICATION; VARIABLES;
D O I
10.1080/01621459.2023.2206084
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High-dimensional linear models are commonly used in practice. In many applications, one is interested in linear transformations beta(T)x of regression coefficients beta epsilon R-p, where x is a specific point and is not required to be identically distributed as the training data. One common approach is the plug-in technique which first estimates beta, then plugs the estimator in the linear transformation for prediction. Despite its popularity, estimation of beta canbe difficult for high-dimensionalproblems. Commonly used assumptions in the literature include that the signal of coefficients beta is sparse and predictors are weakly correlated. These assumptions, however, may not be easily verified, and can be violated in practice. When beta is non-sparse or predictors are strongly correlated, estimation of beta can be very difficult. In this article, we propose a novel point wise estimator for linear transformations of beta. This new estimator greatly relaxes the common assumptions for high-dimensional problems, and is adaptive to the degree of sparsity of beta and strength of correlations among the predictors. In particular, beta can be sparse or nonsparse and predictors can be strongly or weakly correlated. The proposed method is simple for implementation. Numerical and theoretical results demonstrate the competitive advantages of the proposed method for a wide range of problems. Supplementary materials for this article are available online.
引用
收藏
页码:1579 / 1591
页数:13
相关论文
共 27 条
  • [1] Estimation of linear projections of non-sparse coefficients in high-dimensional regression
    Azriel, David
    Schwartzman, Armin
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 174 - 206
  • [2] Benign overfitting in linear regression
    Bartlett, Peter L.
    Long, Philip M.
    Lugosi, Gabor
    Tsigler, Alexander
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (48) : 30063 - 30070
  • [3] Least squares after model selection in high-dimensional sparse models
    Belloni, Alexandre
    Chernozhukov, Victor
    [J]. BERNOULLI, 2013, 19 (02) : 521 - 547
  • [4] SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR
    Bickel, Peter J.
    Ritov, Ya'acov
    Tsybakov, Alexandre B.
    [J]. ANNALS OF STATISTICS, 2009, 37 (04) : 1705 - 1732
  • [5] Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
  • [6] CONFIDENCE INTERVALS FOR HIGH-DIMENSIONAL LINEAR REGRESSION: MINIMAX RATES AND ADAPTIVITY
    Cai, T. Tony
    Guo, Zijian
    [J]. ANNALS OF STATISTICS, 2017, 45 (02) : 615 - 646
  • [7] On the prediction performance of the Lasso
    Dalalyan, Arnak S.
    Hebiri, Mohamed
    Lederer, Johannes
    [J]. BERNOULLI, 2017, 23 (01) : 552 - 581
  • [8] HIGH-DIMENSIONAL ASYMPTOTICS OF PREDICTION: RIDGE REGRESSION AND CLASSIFICATION
    Dobriban, Edgar
    Wager, Stefan
    [J]. ANNALS OF STATISTICS, 2018, 46 (01) : 247 - 279
  • [9] Sure independence screening for ultrahigh dimensional feature space
    Fan, Jianqing
    Lv, Jinchi
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 849 - 883
  • [10] Variable selection via nonconcave penalized likelihood and its oracle properties
    Fan, JQ
    Li, RZ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1348 - 1360