Two-Stage Metropolis-Hastings for Tall Data

被引:0
作者
Richard D. Payne
Bani K. Mallick
机构
[1] 3143 Texas A&M University,Department of Statistics
来源
Journal of Classification | 2018年 / 35卷
关键词
Bayesian inference; Logistic model; Bayesian multivariate adaptive regression splines; Markov chain monte carlo; Metropolis-hastings algorithm; Tall data;
D O I
暂无
中图分类号
学科分类号
摘要
This paper discusses the challenges presented by tall data problems associated with Bayesian classification (specifically binary classification) and the existing methods to handle them. Current methods include parallelizing the likelihood, subsampling, and consensus Monte Carlo. A new method based on the two-stage Metropolis-Hastings algorithm is also proposed. The purpose of this algorithm is to reduce the exact likelihood computational cost in the tall data situation. In the first stage, a new proposal is tested by the approximate likelihood based model. The full likelihood based posterior computation will be conducted only if the proposal passes the first stage screening. Furthermore, this method can be adopted into the consensus Monte Carlo framework. The two-stage method is applied to logistic regression, hierarchical logistic regression, and Bayesian multivariate adaptive regression splines.
引用
收藏
页码:29 / 51
页数:22
相关论文
共 30 条
[1]  
ANDRIEU C(2009)The Pseudo-Marginal Approach for Efficient Monte Carlo Computations The Annals of Statistics 37 697-725
[2]  
ROBERTS GO(2005)Markov Chain Monte Carlo Using an Approximation Journal of Computational and Graphical Statistics 14 795-810
[3]  
CHRISTEN JA(1991)Multivariate Adaptive Regression Splines The Annals of Statistics 19 1-67
[4]  
FOX C(1995)Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination Biometrika 82 711-732
[5]  
FRIEDMAN JH(2002)A Bayesian Approach to Characterizing Uncertainty in Inverse Problems Using Coarse and Fine-Scale Information Institute of Electrical and Electronics Engineers Transactions on Signal Processing 50 389-399
[6]  
GREEN PJ(2003)Classification with Bayesian MARS Machine Learning 50 159-173
[7]  
HIGDON D(2003)Generalized Nonlinear Modeling with Multivariate Free-Knot Regression Splines Journal of the American Statistical Association 98 352-368
[8]  
LEE H(2015)Spline Regression in the Presence of Categorical Predictors Journal of Applied Econometrics 30 705-717
[9]  
BI Z(2005)Bayesian Classification of Tumours by Using Gene Expression Data Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67 219-234
[10]  
HOLMES C(2014)Bayesian Uncertainty Quantification for Subsurface Inversion Using a Multiscale Hierarchical Model Technometrics 56 381-392