Sparsified simultaneous confidence intervals for high-dimensional linear models

被引:0
作者
Zhu, Xiaorui [1 ]
Qin, Yichen [2 ]
Wang, Peng [2 ]
机构
[1] Towson Univ, Dept Business Analyt & Technol Management, Towson, MD 21252 USA
[2] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH USA
关键词
High-dimensional inference; Model confidence bounds; Selection uncertainty; Simultaneous confidence intervals; POST-SELECTION INFERENCE; TRANSCRIPTION FACTORS; VARIABLE-SELECTION; CELL-CYCLE; LONGITUDINAL DATA; EXPRESSION; LASSO; IDENTIFICATION; GENES;
D O I
10.1007/s00184-024-00975-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Statistical inference of the high-dimensional regression coefficients is challenging because the uncertainty introduced by the model selection procedure is hard to account for. Currently, the inference of the model and the inference of the coefficients are separately sought. A critical question remains unsettled; that is, is it possible to embed the inference of the model into the simultaneous inference of the coefficients? If so, then how to properly design a simultaneous inference tool with desired properties? To this end, we propose a notion of simultaneous confidence intervals called the sparsified simultaneous confidence intervals (SSCI). Our intervals are sparse in the sense that some of the intervals' upper and lower bounds are shrunken to zero (i.e., [0, 0]), indicating the unimportance of the corresponding covariates. These covariates should be excluded from the final model. The rest of the intervals, either containing zero (e.g., [-1,1]\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[-1,1]$$\end{document} or [0, 1]) or not containing zero (e.g., [2, 3]), indicate the plausible and significant covariates, respectively. The SSCI intuitively suggests a lower-bound model with significant covariates only and an upper-bound model with plausible and significant covariates. The proposed method can be coupled with various selection procedures, making it ideal for comparing their uncertainty. For the proposed method, we establish desirable asymptotic properties, develop intuitive graphical tools for visualization, and justify its superior performance through simulation and real data analysis.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Variable Selection in High-Dimensional Partially Linear Models with Longitudinal Data
    Yang Yiping
    Xue Liugen
    RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, : 661 - 667
  • [42] Adaptive k-class estimation in high-dimensional linear models
    Fan, Qingliang
    Yu, Wanlu
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (12) : 3885 - 3913
  • [43] Cluster feature selection in high-dimensional linear models
    Lin, Bingqing
    Pang, Zhen
    Wang, Qihua
    RANDOM MATRICES-THEORY AND APPLICATIONS, 2018, 7 (01)
  • [44] Shrinkage and Sparse Estimation for High-Dimensional Linear Models
    Asl, M. Noori
    Bevrani, H.
    Belaghi, R. Arabi
    Ahmed, Syed Ejaz
    PROCEEDINGS OF THE THIRTEENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, VOL 1, 2020, 1001 : 147 - 156
  • [45] Learning High-Dimensional Generalized Linear Autoregressive Models
    Hall, Eric C.
    Raskutti, Garvesh
    Willett, Rebecca M.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (04) : 2401 - 2422
  • [46] TESTABILITY OF HIGH-DIMENSIONAL LINEAR MODELS WITH NONSPARSE STRUCTURES
    Bradic, Jelena
    Fan, Jianqing
    Zhu, Yinchu
    ANNALS OF STATISTICS, 2022, 50 (02) : 615 - 639
  • [47] High-dimensional robust inference for censored linear models
    Huang, Jiayu
    Wu, Yuanshan
    SCIENCE CHINA-MATHEMATICS, 2024, 67 (04) : 891 - 918
  • [48] Debiased lasso after sample splitting for estimation and inference in high-dimensional generalized linear models
    Vazquez, Omar
    Nan, Bin
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2025, 53 (01):
  • [49] High-dimensional inference for linear model with correlated errors
    Yuan, Panxu
    Guo, Xiao
    METRIKA, 2022, 85 (01) : 21 - 52
  • [50] A Model Selection Criterion for High-Dimensional Linear Regression
    Owrang, Arash
    Jansson, Magnus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (13) : 3436 - 3446