Empirical priors for prediction in sparse high-dimensional linear regression

被引:0
|
作者
Martin, Ryan [1 ]
Tang, Yiqi [1 ]
机构
[1] Department of Statistics, North Carolina State University, 2311 Stinson Dr., Raleigh,NC,27695, United States
基金
美国国家科学基金会;
关键词
Inference engines - Numerical methods - Regression analysis - Computation theory - Sampling - Bayesian networks - Probability distributions;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper we adopt the familiar sparse, high-dimensional linear regression model and focus on the important but often overlooked task of prediction. In particular, we consider a new empirical Bayes framework that incorporates data in the prior in two ways: one is to center the prior for the non-zero regression coefficients and the other is to provide some additional regularization. We show that, in certain settings, the asymptotic concentration of the proposed empirical Bayes posterior predictive distribution is very fast, and we establish a Bernstein{von Mises theorem which ensures that the derived empirical Bayes prediction intervals achieve the targeted frequentist coverage probability. The empirical prior has a convenient conjugate form, so posterior computations are relatively simple and fast. Finally, our numerical results demonstrate the proposed method's strong finite-sample performance in terms of prediction accuracy, uncertainty quanti_cation, and computation time compared to existing Bayesian methods. © 2020 Ryan Martin and Yiqi Tang.
引用
收藏
相关论文
共 50 条
  • [31] Benign overfitting of non-sparse high-dimensional linear regression with correlated noise
    Tsuda, Toshiki
    Imaizumi, Masaaki
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 4119 - 4197
  • [32] Fixed-Size Confidence Regions in High-Dimensional Sparse Linear Regression Models
    Ing, Ching-Kang
    Lai, Tze Leung
    SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2015, 34 (03): : 324 - 335
  • [33] Robust Information Criterion for Model Selection in Sparse High-Dimensional Linear Regression Models
    Gohain, Prakash Borpatra
    Jansson, Magnus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 2251 - 2266
  • [34] NEW IMPROVED CRITERION FOR MODEL SELECTION IN SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION MODELS
    Gohain, Prakash B.
    Jansson, Magnus
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 5692 - 5696
  • [35] SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION. ESTIMATING SQUARED ERROR AND A PHASE TRANSITION
    Gamarnik, David
    Zadik, Ilias
    ANNALS OF STATISTICS, 2022, 50 (02): : 880 - 903
  • [36] Sparse Grid Regression for Performance Prediction Using High-Dimensional Run Time Data
    Neumann, Philipp
    EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 601 - 612
  • [37] Multiple outliers detection in sparse high-dimensional regression
    Wang, Tao
    Li, Qun
    Chen, Bin
    Li, Zhonghua
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (01) : 89 - 107
  • [38] Testing Regression Coefficients in High-Dimensional and Sparse Settings
    Xu, Kai
    Tian, Yan
    Cheng, Qing
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2021, 37 (10) : 1513 - 1532
  • [39] THE SPARSE LAPLACIAN SHRINKAGE ESTIMATOR FOR HIGH-DIMENSIONAL REGRESSION
    Huang, Jian
    Ma, Shuangge
    Li, Hongzhe
    Zhang, Cun-Hui
    ANNALS OF STATISTICS, 2011, 39 (04): : 2021 - 2046
  • [40] ADMM for High-Dimensional Sparse Penalized Quantile Regression
    Gu, Yuwen
    Fan, Jun
    Kong, Lingchen
    Ma, Shiqian
    Zou, Hui
    TECHNOMETRICS, 2018, 60 (03) : 319 - 331