NONPENALIZED VARIABLE SELECTION IN HIGH-DIMENSIONAL LINEAR MODEL SETTINGS VIA GENERALIZED FIDUCIAL INFERENCE

被引：9

作者：

Williams, Jonathan P. ^{[1
]}

Hannig, Jan ^{[1
]}

机构：

[1] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA

来源：

ANNALS OF STATISTICS | 2019年 / 47卷 / 03期

基金：

美国国家科学基金会;

关键词：

Best subset selection; high-dimensional regression; L-0; minimization; feature selection; REGRESSION;

D O I：

10.1214/18-AOS1733

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Standard penalized methods of variable selection and parameter estimation rely on the magnitude of coefficient estimates to decide which variables to include in the final model. However, coefficient estimates are unreliable when the design matrix is collinear. To overcome this challenge, an entirely new perspective on variable selection is presented within a generalized fiducial inference framework. This new procedure is able to effectively account for linear dependencies among subsets of covariates in a high-dimensional setting where p can grow almost exponentially in n, as well as in the classical setting where p <= n. It is shown that the procedure very naturally assigns small probabilities to subsets of covariates which include redundancies by way of explicit L-0 minimization Furthermore, with a typical sparsity assumption, it is shown that the proposed method is consistent in the sense that the probability of the true sparse subset of covariates converges in probability to 1 as n -> infinity, or as n -> infinity and p -> infinity. Very reasonable conditions are needed, and little restriction is placed on the class of possible subsets of covariates to achieve this consistency result.

引用

页码：1723 / 1753

页数：31

共 50 条

[1] Variable selection for high-dimensional generalized linear model with block-missing data
He, Yifan
Feng, Yang
Song, Xinyuan
SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (03) : 1279 - 1297
[2] High-Dimensional Inference for Generalized Linear Models with Hidden Confounding
Ouyang, Jing
Tan, Kean Ming
Xu, Gongjun
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[3] HIGH-DIMENSIONAL VARIABLE SELECTION
Wasserman, Larry
Roeder, Kathryn
ANNALS OF STATISTICS, 2009, 37 (5A) : 2178 - 2201
[4] Bias-Corrected Inference of High-Dimensional Generalized Linear Models
Tang, Shengfei
Shi, Yanmei
Zhang, Qi
MATHEMATICS, 2023, 11 (04)
[5] Variable selection in high-dimensional partly linear additive models
Lian, Heng
JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (04) : 825 - 839
[6] Statistical Inference for High-Dimensional Generalized Linear Models With Binary Outcomes
Cai, T. Tony
Guo, Zijian
Ma, Rong
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1319 - 1332
[7] Homogeneity detection for the high-dimensional generalized linear model
Jeon, Jong-June
Kwon, Sunghoon
Choi, Hosik
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 114 : 61 - 74
[8] Bayesian adaptive lasso with variational Bayes for variable selection in high-dimensional generalized linear mixed models
Dao Thanh Tung
Minh-Ngoc Tran
Tran Manh Cuong
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (02) : 530 - 543
[9] Partial profile score feature selection in high-dimensional generalized linear interaction models
Xu, Zengchao
Luo, Shan
Chen, Zehua
STATISTICS AND ITS INTERFACE, 2022, 15 (04) : 433 - 447
[10] Selection of Fixed Effects in High-dimensional Generalized Linear Mixed Models
Zhang, Xi Yun
Li, Zai Xing
ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2023, 39 (06) : 995 - 1021

← 1 2 3 4 5 →