Rao-Blackwellization for Bayesian Variable Selection and Model Averaging in Linear and Binary Regression: A Novel Data Augmentation Approach

被引:24
作者
Ghosh, Joyee [1 ]
Clyde, Merlise A. [2 ]
机构
[1] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
[2] Duke Univ, Dept Stat Sci, Durham, NC 27705 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Gibbs sampling; MCMC; Missing data; Model probability; Model uncertainty; Orthogonal design; Posterior probability; MARGINAL AUGMENTATION; GIBBS SAMPLER; SHRINKAGE; PRIORS; UNCERTAINTY; ALGORITHM; WAVELETS; SCHEMES;
D O I
10.1198/jasa.2011.tm10518
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Choosing the subset of covariates to use in regression or generalized linear models is a ubiquitous problem. The Bayesian paradigm addresses the problem of model uncertainty by considering models corresponding to all possible subsets of the covariates, where the posterior distribution over models is used to select models or combine them via Bayesian model averaging (BMA). Although conceptually straightforward, BMA is often difficult to implement in practice, since either the number of covariates is too large for enumeration of all subsets, calculations cannot be done analytically, or both. For orthogonal designs with the appropriate choice of prior, the posterior probability of any model can be calculated without having to enumerate the entire model space and scales linearly with the number of predictors, p. In this article we extend this idea to a much broader class of nonorthogonal design matrices. We propose a novel method which augments the observed nonorthogonal design by at most p new rows to obtain a design matrix with orthogonal columns and generate the "missing" response variables in a data augmentation algorithm. We show that our data augmentation approach keeps the original posterior distribution of interest unaltered, and develop methods to construct Rao-Blackwellized estimates of several quantities of interest, including posterior model probabilities of any model, which may not be available from an ordinary Gibbs sampler. Our method can be used for BMA in linear regression and binary regression with nonorthogonal design matrices in conjunction with independent "spike and slab" priors with a continuous prior component that is a Cauchy or other heavy tailed distribution that may be represented as a scale mixture of normals. We provide simulated and real examples to illustrate the methodology. Supplemental materials for the manuscript are available online.
引用
收藏
页码:1041 / 1052
页数:12
相关论文
共 55 条
[1]   BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].
ALBERT, JH ;
CHIB, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :669-679
[2]  
[Anonymous], MONTE CARLO QUASIMON
[3]  
Berger James O, 1998, Sankhya: The Indian Journal of Statistics, Series A, P307
[4]   Posterior model probabilities via path-based pairwise priors [J].
Berger, JO ;
Molina, G .
STATISTICA NEERLANDICA, 2005, 59 (01) :3-15
[5]   The horseshoe estimator for sparse signals [J].
Carvalho, Carlos M. ;
Polson, Nicholas G. ;
Scott, James G. .
BIOMETRIKA, 2010, 97 (02) :465-480
[6]   Adaptive Bayesian wavelet shrinkage [J].
Chipman, HA ;
Kolaczyk, ED ;
McCullogh, RE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (440) :1413-1421
[7]   Flexible empirical Bayes estimation for wavelets [J].
Clyde, M ;
George, EI .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2000, 62 :681-698
[8]   Prediction via orthogonalized model mixing [J].
Clyde, M ;
Desimone, M ;
Parmigiani, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (435) :1197-1208
[9]   Model uncertainty [J].
Clyde, M ;
George, EI .
STATISTICAL SCIENCE, 2004, 19 (01) :81-94
[10]   Multiple shrinkage and subset selection in wavelets [J].
Clyde, M ;
Parmigiani, G ;
Vidakovic, B .
BIOMETRIKA, 1998, 85 (02) :391-401