Empirical Priors for Prediction in Sparse High-dimensional Linear Regression

被引:0
|
作者
Martin, Ryan [1 ]
Tang, Yiqi [1 ]
机构
[1] North Carolina State Univ, Dept Stat, 2311 Stinson Dr, Raleigh, NC 27695 USA
基金
美国国家科学基金会;
关键词
Bayesian inference; data-dependent prior; model averaging; predictive distribution; uncertainty quantification; POSTERIOR CONCENTRATION; VARIABLE SELECTION; HORSESHOE ESTIMATOR; CONVERGENCE-RATES; MODEL SELECTION; LIKELIHOOD; INFERENCE; LASSO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we adopt the familiar sparse, high-dimensional linear regression model and focus on the important but often overlooked task of prediction. In particular, we consider a new empirical Bayes framework that incorporates data in the prior in two ways: one is to center the prior for the non-zero regression coefficients and the other is to provide some additional regularization. We show that, in certain settings, the asymptotic concentration of the proposed empirical Bayes posterior predictive distribution is very fast, and we establish a Bernstein-von Mises theorem which ensures that the derived empirical Bayes prediction intervals achieve the targeted frequentist coverage probability. The empirical prior has a convenient conjugate form, so posterior computations are relatively simple and fast. Finally, our numerical results demonstrate the proposed method's strong finite-sample performance in terms of prediction accuracy, uncertainty quantification, and computation time compared to existing Bayesian methods.
引用
收藏
页数:30
相关论文
共 50 条
  • [31] A Note on High-Dimensional Linear Regression With Interactions
    Hao, Ning
    Zhang, Hao Helen
    AMERICAN STATISTICIAN, 2017, 71 (04) : 291 - 297
  • [32] High-Dimensional Sparse Linear Bandits
    Hao, Botao
    Lattimore, Tor
    Wang, Mengdi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [33] Transfer learning for high-dimensional linear regression: Prediction, estimation and minimax optimality
    Li, Sai
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (01) : 149 - 173
  • [34] High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking
    Wang, Fan
    Mukherjee, Sach
    Richardson, Sylvia
    Hill, Steven M.
    STATISTICS AND COMPUTING, 2020, 30 (03) : 697 - 719
  • [35] Estimation of linear projections of non-sparse coefficients in high-dimensional regression
    Azriel, David
    Schwartzman, Armin
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 174 - 206
  • [36] High-Dimensional Classification by Sparse Logistic Regression
    Abramovich, Felix
    Grinshtein, Vadim
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (05) : 3068 - 3079
  • [37] Robust and sparse learning of varying coefficient models with high-dimensional features
    Xiong, Wei
    Tian, Maozai
    Tang, Manlai
    Pan, Han
    JOURNAL OF APPLIED STATISTICS, 2023, 50 (16) : 3312 - 3336
  • [38] Prediction intervals, factor analysis models, and high-dimensional empirical linear prediction
    Ding, AA
    Hwang, JTG
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) : 446 - 455
  • [39] Stable prediction in high-dimensional linear models
    Lin, Bingqing
    Wang, Qihua
    Zhang, Jun
    Pang, Zhen
    STATISTICS AND COMPUTING, 2017, 27 (05) : 1401 - 1412
  • [40] SCAD-PENALIZED REGRESSION IN HIGH-DIMENSIONAL PARTIALLY LINEAR MODELS
    Xie, Huiliang
    Huang, Jian
    ANNALS OF STATISTICS, 2009, 37 (02) : 673 - 696