Weakly Informative Prior for Point Estimation of Covariance Matrices in Hierarchical Models

被引:85
作者
Chung, Yeojin [1 ]
Gelman, Andrew [2 ]
Rabe-Hesketh, Sophia [3 ,4 ]
Liu, Jingchen [2 ]
Dorie, Vincent [5 ]
机构
[1] Kookmin Univ, Sch Business Adm, Seoul 136702, South Korea
[2] Columbia Univ, Dept Stat, New York, NY 10027 USA
[3] Univ Calif Berkeley, Grad Sch Educ, Berkeley, CA 94720 USA
[4] Univ Calif Berkeley, Grad Grp Biostat, Berkeley, CA 94720 USA
[5] NYU, Ctr Promot Res Involving Innovat Stat Methodol, New York, NY USA
基金
美国国家科学基金会;
关键词
Bayes modal estimation; penalized likelihood estimation; variance estimation; Heywood case; mixed-effects model; multilevel model; BAYESIAN-ESTIMATION; LIKELIHOOD; COMPONENTS; VARIANCE;
D O I
10.3102/1076998615570945
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
When fitting hierarchical regression models, maximum likelihood (ML) estimation has computational (and, for some users, philosophical) advantages compared to full Bayesian inference, but when the number of groups is small, estimates of the covariance matrix (sigma) of group-level varying coefficients are often degenerate. One can do better, even from a purely point estimation perspective, by using a prior distribution or penalty function. In this article, we use Bayes modal estimation to obtain positive definite covariance matrix estimates. We recommend a class of Wishart (not inverse-Wishart) priors for sigma with a default choice of hyperparameters, that is, the degrees of freedom are set equal to the number of varying coefficients plus 2, and the scale matrix is the identity matrix multiplied by a value that is large relative to the scale of the problem. This prior is equivalent to independent gamma priors for the eigenvalues of sigma with shape parameter 1.5 and rate parameter close to 0. It is also equivalent to independent gamma priors for the variances with the same hyperparameters multiplied by a function of the correlation coefficients. With this default prior, the posterior mode for sigma is always strictly positive definite. Furthermore, the resulting uncertainty for the fixed coefficients is less underestimated than under classical ML or restricted maximum likelihood estimation. We also suggest an extension of our method that can be used when stronger prior information is available for some of the variances or correlations.
引用
收藏
页码:136 / 157
页数:22
相关论文
共 30 条
[1]   Approximate is better than "exact" for interval estimation of binomial proportions [J].
Agresti, A ;
Coull, BA .
AMERICAN STATISTICIAN, 1998, 52 (02) :119-126
[3]  
[Anonymous], BEHAVIORMETRIKA
[4]  
[Anonymous], CLUSTER LEVEL COVARI
[5]  
[Anonymous], 2016, Upgrade Manual for Latent GOLD 5.1: Basic, Advanced
[6]  
Barnard J, 2000, STAT SINICA, V10, P1281
[7]  
Bates D., 2010, LME4 MIXED EFFECTS M
[8]   A Nondegenerate Penalized Likelihood Estimator for Variance Parameters in Multilevel Models [J].
Chung, Yeojin ;
Rabe-Hesketh, Sophia ;
Dorie, Vincent ;
Gelman, Andrew ;
Liu, Jingchen .
PSYCHOMETRIKA, 2013, 78 (04) :685-709
[9]   Avoiding zero between-study variance estimates in random-effects meta-analysis [J].
Chung, Yeojin ;
Rabe-Hesketh, Sophia ;
Choi, In-Hee .
STATISTICS IN MEDICINE, 2013, 32 (23) :4071-4089
[10]   Penalized maximum likelihood estimator for normal mixtures [J].
Ciuperca, G ;
Ridolfi, A ;
Idier, J .
SCANDINAVIAN JOURNAL OF STATISTICS, 2003, 30 (01) :45-59