Two-Stage Metropolis-Hastings for Tall Data

被引:9
|
作者
Payne, Richard D. [1 ]
Mallick, Bani K. [1 ]
机构
[1] Texas A&M Univ, College Stn, TX 77843 USA
关键词
Bayesian inference; Logistic model; Bayesian multivariate adaptive regression splines; Markov chain monte carlo; Metropolis-hastings algorithm; Tall data; CLASSIFICATION; UNCERTAINTY;
D O I
10.1007/s00357-018-9248-z
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper discusses the challenges presented by tall data problems associated with Bayesian classification (specifically binary classification) and the existing methods to handle them. Current methods include parallelizing the likelihood, subsampling, and consensus Monte Carlo. A new method based on the two-stage Metropolis-Hastings algorithm is also proposed. The purpose of this algorithm is to reduce the exact likelihood computational cost in the tall data situation. In the first stage, a new proposal is tested by the approximate likelihood based model. The full likelihood based posterior computation will be conducted only if the proposal passes the first stage screening. Furthermore, this method can be adopted into the consensus Monte Carlo framework. The two-stage method is applied to logistic regression, hierarchical logistic regression, and Bayesian multivariate adaptive regression splines.
引用
收藏
页码:29 / 51
页数:23
相关论文
共 50 条