Empirical Priors for Prediction in Sparse High-dimensional Linear Regression

被引：0

作者：

Martin, Ryan ^{[1
]}

Tang, Yiqi ^{[1
]}

机构：

[1] North Carolina State Univ, Dept Stat, 2311 Stinson Dr, Raleigh, NC 27695 USA

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2020年 / 21卷

基金：

美国国家科学基金会;

关键词：

Bayesian inference; data-dependent prior; model averaging; predictive distribution; uncertainty quantification; POSTERIOR CONCENTRATION; VARIABLE SELECTION; HORSESHOE ESTIMATOR; CONVERGENCE-RATES; MODEL SELECTION; LIKELIHOOD; INFERENCE; LASSO;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we adopt the familiar sparse, high-dimensional linear regression model and focus on the important but often overlooked task of prediction. In particular, we consider a new empirical Bayes framework that incorporates data in the prior in two ways: one is to center the prior for the non-zero regression coefficients and the other is to provide some additional regularization. We show that, in certain settings, the asymptotic concentration of the proposed empirical Bayes posterior predictive distribution is very fast, and we establish a Bernstein-von Mises theorem which ensures that the derived empirical Bayes prediction intervals achieve the targeted frequentist coverage probability. The empirical prior has a convenient conjugate form, so posterior computations are relatively simple and fast. Finally, our numerical results demonstrate the proposed method's strong finite-sample performance in terms of prediction accuracy, uncertainty quantification, and computation time compared to existing Bayesian methods.

引用

页数：30

共 50 条

[31] A Note on High-Dimensional Linear Regression With Interactions
Hao, Ning
Zhang, Hao Helen
AMERICAN STATISTICIAN, 2017, 71 (04) : 291 - 297
[32] High-Dimensional Sparse Linear Bandits
Hao, Botao
Lattimore, Tor
Wang, Mengdi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[33] Transfer learning for high-dimensional linear regression: Prediction, estimation and minimax optimality
Li, Sai
Cai, T. Tony
Li, Hongzhe
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (01) : 149 - 173
[34] High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking
Wang, Fan
Mukherjee, Sach
Richardson, Sylvia
Hill, Steven M.
STATISTICS AND COMPUTING, 2020, 30 (03) : 697 - 719
[35] Estimation of linear projections of non-sparse coefficients in high-dimensional regression
Azriel, David
Schwartzman, Armin
ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 174 - 206
[36] High-Dimensional Classification by Sparse Logistic Regression
Abramovich, Felix
Grinshtein, Vadim
IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (05) : 3068 - 3079
[37] Robust and sparse learning of varying coefficient models with high-dimensional features
Xiong, Wei
Tian, Maozai
Tang, Manlai
Pan, Han
JOURNAL OF APPLIED STATISTICS, 2023, 50 (16) : 3312 - 3336
[38] Prediction intervals, factor analysis models, and high-dimensional empirical linear prediction
Ding, AA
Hwang, JTG
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) : 446 - 455
[39] Stable prediction in high-dimensional linear models
Lin, Bingqing
Wang, Qihua
Zhang, Jun
Pang, Zhen
STATISTICS AND COMPUTING, 2017, 27 (05) : 1401 - 1412
[40] SCAD-PENALIZED REGRESSION IN HIGH-DIMENSIONAL PARTIALLY LINEAR MODELS
Xie, Huiliang
Huang, Jian
ANNALS OF STATISTICS, 2009, 37 (02) : 673 - 696

← 1 2 3 4 5 →