Empirical Bayes inference in sparse high-dimensional generalized linear models

被引:0
|
作者
Tang, Yiqi [1 ]
Martin, Ryan [2 ]
机构
[1] Colby Coll, Dept Stat, Waterville, ME 04901 USA
[2] North Carolina State Univ, Dept Stat, Raleigh, NC USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2024年 / 18卷 / 02期
基金
美国国家科学基金会;
关键词
Data-dependent prior; logistic regression; model selection; Poisson log-linear model; posterior asymptotics; POSTERIOR CONCENTRATION; VARIABLE SELECTION; UNCERTAINTY QUANTIFICATION; HORSESHOE; REGULARIZATION; CONSISTENT; ESTIMATOR; NEEDLES; PRIORS; STRAW;
D O I
10.1214/24-EJS2274
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High-dimensional linear models have been widely studied, but the developments in high-dimensional generalized linear models, or GLMs, have been slower. In this paper, we propose an empirical or data-driven prior leading to an empirical Bayes posterior distribution which can be used for estimation of and inference on the coefficient vector in a high- dimensional GLM, as well as for variable selection. We prove that our proposed posterior concentrates around the true/sparse coefficient vector at the optimal rate, provide conditions under which the posterior can achieve variable selection consistency, and prove a Bernstein-von Mises theorem that implies asymptotically valid uncertainty quantification. Computation of the proposed empirical Bayes posterior is simple and efficient, and is shown to perform well in simulations compared to existing Bayesian and non-Bayesian methods in terms of estimation and variable selection.
引用
收藏
页码:3212 / 3246
页数:35
相关论文
共 50 条
  • [1] Empirical Bayes posterior concentration in sparse high-dimensional linear models
    Martin, Ryan
    Mess, Raymond
    Walker, Stephen G.
    BERNOULLI, 2017, 23 (03) : 1822 - 1847
  • [2] High-Dimensional Inference for Generalized Linear Models with Hidden Confounding
    Ouyang, Jing
    Tan, Kean Ming
    Xu, Gongjun
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [3] Empirical Bayes estimators for high-dimensional sparse vectors
    Srinath, K. Pavan
    Venkataramanan, Ramji
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2020, 9 (01) : 195 - 234
  • [4] Penalized empirical likelihood for high-dimensional generalized linear models
    Chen, Xia
    Mao, Liyue
    STATISTICS AND ITS INTERFACE, 2021, 14 (02) : 83 - 94
  • [5] Sparse Markov Models for High-dimensional Inference
    Ost, Guilherme
    Takahashi, Daniel Y.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [6] Inference for High-Dimensional Sparse Econometric Models
    Belloni, Alexandre
    Chernozhukov, Victor
    Hansen, Christian B.
    ADVANCES IN ECONOMICS AND ECONOMETRICS, VOL III: ECONOMETRICS, 2013, (51): : 245 - 295
  • [7] Efficient sparse high-dimensional linear regression with a partitioned empirical Bayes ECM algorithm
    Mclain, Alexander C.
    Zgodic, Anja
    Bondell, Howard
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 207
  • [8] Online inference in high-dimensional generalized linear models with streaming data
    Luo, Lan
    Han, Ruijian
    Lin, Yuanyuan
    Huang, Jian
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 3443 - 3471
  • [9] Statistical Inference for High-Dimensional Generalized Linear Models With Binary Outcomes
    Cai, T. Tony
    Guo, Zijian
    Ma, Rong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1319 - 1332
  • [10] Estimation and Inference for High-Dimensional Generalized Linear Models with Knowledge Transfer
    Li, Sai
    Zhang, Linjun
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1274 - 1285