Hierarchical Shrinkage Priors and Model Fitting for High-dimensional Generalized Linear Models

被引:11
作者
Yi, Nengjun [1 ]
Ma, Shuangge [2 ]
机构
[1] Univ Alabama Birmingham, Birmingham, AL USA
[2] Yale Univ, New Haven, CT 06520 USA
关键词
adaptive lasso; Bayesian inference; generalized linear model; genetic polymorphisms; grouped variables; hierarchical model; high-dimensional data; shrinkage prior; PENALIZED REGRESSION; VARIABLE SELECTION; BAYESIAN LASSO;
D O I
10.1515/1544-6115.1803
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genetic and other scientific studies routinely generate very many predictor variables, which can be naturally grouped, with predictors in the same groups being highly correlated. It is desirable to incorporate the hierarchical structure of the predictor variables into generalized linear models for simultaneous variable selection and coefficient estimation. We propose two prior distributions: hierarchical Cauchy and double-exponential distributions, on coefficients in generalized linear models. The hierarchical priors include both variable-specific and group-specific tuning parameters, thereby not only adopting different shrinkage for different coefficients and different groups but also providing a way to pool the information within groups. We fit generalized linear models with the proposed hierarchical priors by incorporating flexible expectation-maximization (EM) algorithms into the standard iteratively weighted least squares as implemented in the general statistical package R. The methods are illustrated with data from an experiment to identify genetic polymorphisms for survival of mice following infection with Listeria monocytogenes. The performance of the proposed procedures is further assessed via simulation studies. The methods are implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/).
引用
收藏
页数:24
相关论文
共 25 条
[1]  
[Anonymous], 2003, Bayesian Data Analysis
[2]  
[Anonymous], ARXIV10092300
[3]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[4]  
[Anonymous], 1983, Generalized Linear Models
[5]  
Armagan A., 2010, Biometrika
[6]   Multigenic control of Listeria monocytogenes susceptibility in mice [J].
Boyartchuk, VL ;
Broman, KW ;
Mosher, RE ;
D'Orazio, SEF ;
Starnbach, MN ;
Dietrich, W .
NATURE GENETICS, 2001, 27 (03) :259-260
[7]  
Carvalho C., 2009, JMLR W CP, V5
[8]   The horseshoe estimator for sparse signals [J].
Carvalho, Carlos M. ;
Polson, Nicholas G. ;
Scott, James G. .
BIOMETRIKA, 2010, 97 (02) :465-480
[9]   Adaptive sparseness for supervised learning [J].
Figueiredo, MAT .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (09) :1150-1159
[10]   Prior distributions for variance parameters in hierarchical models(Comment on an Article by Browne and Draper) [J].
Gelman, Andrew .
BAYESIAN ANALYSIS, 2006, 1 (03) :515-533