VARIABLE SELECTION VIA GIBBS SAMPLING

被引:1675
作者
GEORGE, EI [1 ]
MCCULLOCH, RE [1 ]
机构
[1] UNIV CHICAGO,GRAD SCH BUSINESS,CHICAGO,IL 60637
关键词
DATA AUGMENTATION; HIERARCHICAL BAYES; LATENT VARIABLES; MIXTURE; MULTIPLE REGRESSION;
D O I
10.2307/2290777
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A crucial problem in building a multiple regression model is the selection of predictors to include. The main thrust of this article is to propose and develop a procedure that uses probabilistic considerations for selecting promising subsets. This procedure entails embedding the regression setup in a hierarchical normal mixture model where latent variables are used to identify subset choices. In this framework the promising subsets of predictors can be identified as those with higher posterior probability. The computational burden is then alleviated by using the Gibbs sampler to indirectly sample from this multinomial posterior distribution on the set of possible subset choices. Those subsets with higher probability-the promising ones-can then be identified by their more frequent appearance in the Gibbs sample.
引用
收藏
页码:881 / 889
页数:9
相关论文
共 21 条
[1]  
[Anonymous], 1991, STAT COMPUT
[2]   EXPLAINING THE GIBBS SAMPLER [J].
CASELLA, G ;
GEORGE, EI .
AMERICAN STATISTICIAN, 1992, 46 (03) :167-174
[3]  
DIEBOLT J, IN PRESS J ROYAL S B
[4]  
DRAPER NR, 1981, APPLIED REGRESSION A
[5]   SAMPLING-BASED APPROACHES TO CALCULATING MARGINAL DENSITIES [J].
GELFAND, AE ;
SMITH, AFM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (410) :398-409
[6]   ILLUSTRATION OF BAYESIAN-INFERENCE IN NORMAL DATA MODELS USING GIBBS SAMPLING [J].
GELFAND, AE ;
HILLS, SE ;
RACINEPOON, A ;
SMITH, AFM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (412) :972-985
[7]  
Lempers F.B., 1971, POSTERIOR PROBABILIT
[8]  
MADIGAN D, 1991, 213 U WASH DEP STAT
[9]  
Miller A., 1990, SUBSET SELECTION REG
[10]  
MITCHELL TJ, 1988, J AM STAT ASSOC, V83, P1023, DOI 10.2307/2290129