Sparsified simultaneous confidence intervals for high-dimensional linear models

被引:0
|
作者
Zhu, Xiaorui [1 ]
Qin, Yichen [2 ]
Wang, Peng [2 ]
机构
[1] Towson Univ, Dept Business Analyt & Technol Management, Towson, MD 21252 USA
[2] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH USA
关键词
High-dimensional inference; Model confidence bounds; Selection uncertainty; Simultaneous confidence intervals; POST-SELECTION INFERENCE; TRANSCRIPTION FACTORS; VARIABLE-SELECTION; CELL-CYCLE; LONGITUDINAL DATA; EXPRESSION; LASSO; IDENTIFICATION; GENES;
D O I
10.1007/s00184-024-00975-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Statistical inference of the high-dimensional regression coefficients is challenging because the uncertainty introduced by the model selection procedure is hard to account for. Currently, the inference of the model and the inference of the coefficients are separately sought. A critical question remains unsettled; that is, is it possible to embed the inference of the model into the simultaneous inference of the coefficients? If so, then how to properly design a simultaneous inference tool with desired properties? To this end, we propose a notion of simultaneous confidence intervals called the sparsified simultaneous confidence intervals (SSCI). Our intervals are sparse in the sense that some of the intervals' upper and lower bounds are shrunken to zero (i.e., [0, 0]), indicating the unimportance of the corresponding covariates. These covariates should be excluded from the final model. The rest of the intervals, either containing zero (e.g., [-1,1]\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[-1,1]$$\end{document} or [0, 1]) or not containing zero (e.g., [2, 3]), indicate the plausible and significant covariates, respectively. The SSCI intuitively suggests a lower-bound model with significant covariates only and an upper-bound model with plausible and significant covariates. The proposed method can be coupled with various selection procedures, making it ideal for comparing their uncertainty. For the proposed method, we establish desirable asymptotic properties, develop intuitive graphical tools for visualization, and justify its superior performance through simulation and real data analysis.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] CONFIDENCE INTERVALS FOR HIGH-DIMENSIONAL COX MODELS
    Yu, Yi
    Bradic, Jelena
    Samworth, Richard J.
    STATISTICA SINICA, 2021, 31 (01) : 243 - 267
  • [2] Confidence intervals for high-dimensional partially linear single-index models
    Gueuning, Thomas
    Claeskens, Gerda
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 149 : 13 - 29
  • [3] Confidence Intervals and Tests for High-Dimensional Models: A Compact Review
    Buhlmann, Peter
    MODELING AND STOCHASTIC LEARNING FOR FORECASTING IN HIGH DIMENSIONS, 2015, 217 : 21 - 34
  • [4] Simultaneous Inference for High-Dimensional Linear Models
    Zhang, Xianyang
    Cheng, Guang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 757 - 768
  • [5] CONFIDENCE INTERVALS FOR HIGH-DIMENSIONAL LINEAR REGRESSION: MINIMAX RATES AND ADAPTIVITY
    Cai, T. Tony
    Guo, Zijian
    ANNALS OF STATISTICS, 2017, 45 (02): : 615 - 646
  • [6] Confidence intervals for low dimensional parameters in high dimensional linear models
    Zhang, Cun-Hui
    Zhang, Stephanie S.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (01) : 217 - 242
  • [7] On multiple contrast tests and simultaneous confidence intervals in high-dimensional repeated measures designs
    Konietschke, Frank
    Gel, Yulia R.
    Brunner, Edgar
    PERSPECTIVES ON BIG DATA ANALYSIS: METHODOLOGIES AND APPLICATIONS, 2014, 622 : 109 - +
  • [8] Confidence intervals and hypothesis testing for high-dimensional regression
    Javanmard, Adel
    Montanari, Andrea
    Journal of Machine Learning Research, 2014, 15 : 2869 - 2909
  • [9] Confidence intervals for high-dimensional inverse covariance estimation
    Jankova, Jana
    van de Geer, Sara
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 1205 - 1229
  • [10] Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
    Javanmard, Adel
    Montanari, Andrea
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 2869 - 2909