Sparsified simultaneous confidence intervals for high-dimensional linear models

被引：0

作者：

Zhu, Xiaorui ^{[1
]}

Qin, Yichen ^{[2
]}

Wang, Peng ^{[2
]}

机构：

[1] Towson Univ, Dept Business Analyt & Technol Management, Towson, MD 21252 USA

[2] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH USA

来源：

METRIKA | 2024年

关键词：

High-dimensional inference; Model confidence bounds; Selection uncertainty; Simultaneous confidence intervals; POST-SELECTION INFERENCE; TRANSCRIPTION FACTORS; VARIABLE-SELECTION; CELL-CYCLE; LONGITUDINAL DATA; EXPRESSION; LASSO; IDENTIFICATION; GENES;

D O I：

10.1007/s00184-024-00975-z

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Statistical inference of the high-dimensional regression coefficients is challenging because the uncertainty introduced by the model selection procedure is hard to account for. Currently, the inference of the model and the inference of the coefficients are separately sought. A critical question remains unsettled; that is, is it possible to embed the inference of the model into the simultaneous inference of the coefficients? If so, then how to properly design a simultaneous inference tool with desired properties? To this end, we propose a notion of simultaneous confidence intervals called the sparsified simultaneous confidence intervals (SSCI). Our intervals are sparse in the sense that some of the intervals' upper and lower bounds are shrunken to zero (i.e., [0, 0]), indicating the unimportance of the corresponding covariates. These covariates should be excluded from the final model. The rest of the intervals, either containing zero (e.g., [-1,1]\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[-1,1]$$\end{document} or [0, 1]) or not containing zero (e.g., [2, 3]), indicate the plausible and significant covariates, respectively. The SSCI intuitively suggests a lower-bound model with significant covariates only and an upper-bound model with plausible and significant covariates. The proposed method can be coupled with various selection procedures, making it ideal for comparing their uncertainty. For the proposed method, we establish desirable asymptotic properties, develop intuitive graphical tools for visualization, and justify its superior performance through simulation and real data analysis.

引用

页数：25

共 50 条

[31] A Decorrelating and Debiasing Approach to Simultaneous Inference for High-Dimensional Confounded Models
Sun, Yinrui
Ma, Li
Xia, Yin
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 2857 - 2868
[32] Linear Hypothesis Testing in Dense High-Dimensional Linear Models
Zhu, Yinchu
Bradic, Jelena
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (524) : 1583 - 1600
[33] Estimation and inference for the indirect effect in high-dimensional linear mediation models
Zhou, Ruixuan Rachel
Wang, Liewei
Zhao, Sihai Dave
BIOMETRIKA, 2020, 107 (03) : 573 - 589
[34] Inference for high-dimensional linear models with locally stationary error processes
Xia, Jiaqi
Chen, Yu
Guo, Xiao
JOURNAL OF TIME SERIES ANALYSIS, 2024, 45 (01) : 78 - 102
[35] High-dimensional simultaneous inference with the bootstrap
Dezeure, Ruben
Buhlmann, Peter
Zhang, Cun-Hui
TEST, 2017, 26 (04) : 685 - 719
[36] Empirical Bayes posterior concentration in sparse high-dimensional linear models
Martin, Ryan
Mess, Raymond
Walker, Stephen G.
BERNOULLI, 2017, 23 (03) : 1822 - 1847
[37] Optimal estimation of slope vector in high-dimensional linear transformation models
Tan, Xin Lu
JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 169 : 179 - 204
[38] Adaptive group bridge estimation for high-dimensional partially linear models
Wang, Xiuli
Wang, Mingqiu
JOURNAL OF INEQUALITIES AND APPLICATIONS, 2017,
[39] Scalable high-dimensional Bayesian varying coefficient models with unknown within-subject covariance
Bai, Ray
Boland, Mary R.
Chen, Yong
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[40] SCAD-PENALIZED REGRESSION IN HIGH-DIMENSIONAL PARTIALLY LINEAR MODELS
Xie, Huiliang
Huang, Jian
ANNALS OF STATISTICS, 2009, 37 (02) : 673 - 696

← 1 2 3 4 5 →