Composite Likelihood Bayesian Information Criteria for Model Selection in High-Dimensional Data

被引:94
作者
Gao, Xin [1 ]
Song, Peter X. -K. [2 ]
机构
[1] York Univ, Dept Math & Stat, N York, ON M3J 1P3, Canada
[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
关键词
Consistency; Model selection; Pseudo-likelihood; Variable selection; APPROXIMATIONS; INFERENCE;
D O I
10.1198/jasa.2010.tm09414
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For high-dimensional data sets with complicated dependency structures, the full likelihood approach often leads to intractable computational complexity. This imposes difficulty on model selection, given that most traditionally used information criteria require evaluation of the full likelihood. We propose a composite likelihood version of the Bayes information criterion (BIC) and establish its consistency property for the selection of the true underlying marginal model. Our proposed BIC is shown to be selection-consistent under some mild regularity conditions, where the number of potential model parameters is allowed to increase to infinity at a certain rate of the sample size. Simulation studies demonstrate the empirical performance of this new BIC, especially for the scenario where the number of parameters increases with sample size. Technical proofs of our theoretical results are provided in the online supplemental materials.
引用
收藏
页码:1531 / 1540
页数:10
相关论文
共 34 条
[1]  
Akaike H., 1998, Selected papers of Hirotugu Akaike, P199, DOI DOI 10.1007/978-1-4612-1694-0_15
[2]  
[Anonymous], STAT INFERENCE STOCH
[3]  
[Anonymous], SCAND J STAT
[4]  
[Anonymous], 1997, LIMIT THEOREMS CHANG
[5]  
[Anonymous], EXTENDED BIC S UNPUB
[6]   Approximations and consistency of Bayes factors as model dimension grows [J].
Berger, JO ;
Ghosh, JK ;
Mukhopadhyay, N .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2003, 112 (1-2) :241-258
[7]   A generalization of BIC for the general exponential family [J].
Chakrabarti, Arijit ;
Ghosh, Jayanta K. .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2006, 136 (09) :2847-2872
[8]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[9]  
Cox D.R., 1974, THEORETICAL STAT
[10]   A note on pseudolikelihood constructed from marginal densities [J].
Cox, DR ;
Reid, N .
BIOMETRIKA, 2004, 91 (03) :729-737