A LASSO-penalized BIC for mixture model selection

被引:0
作者
Sakyajit Bhattacharya
Paul D. McNicholas
机构
[1] University of Guelph,Department of Mathematics and Statistics
来源
Advances in Data Analysis and Classification | 2014年 / 8卷
关键词
BIC; LASSO; Mixture models; Model-based clustering; Model selection; 62F99; 62H30; 62H25;
D O I
暂无
中图分类号
学科分类号
摘要
The efficacy of family-based approaches to mixture model-based clustering and classification depends on the selection of parsimonious models. Current wisdom suggests the Bayesian information criterion (BIC) for mixture model selection. However, the BIC has well-known limitations, including a tendency to overestimate the number of components as well as a proclivity for underestimating, often drastically, the number of components in higher dimensions. While the former problem might be soluble by merging components, the latter is impossible to mitigate in clustering and classification applications. In this paper, a LASSO-penalized BIC (LPBIC) is introduced to overcome this problem. This approach is illustrated based on applications of extensions of mixtures of factor analyzers, where the LPBIC is used to select both the number of components and the number of latent factors. The LPBIC is shown to match or outperform the BIC in several situations.
引用
收藏
页码:45 / 61
页数:16
相关论文
共 95 条
  • [1] Akaike H(1974)A new look at the statistical model identification IEEE Trans Autom Control 19 716-723
  • [2] Bozdogan H(1987)Model selection and Akaike’s information criterion (AIC): the general theory and its analytical extensions Psychometrika 52 345-370
  • [3] Celeux G(1995)Gaussian parsimonious clustering models Pattern Recognit 28 781-793
  • [4] Govaert G(2008)Extended Bayesian information criteria for model selection with large model spaces Biometrika 95 759-771
  • [5] Chen J(2007)Current challenges in Bayesian model choice Statistical Challeng Modern Astron IV 371 224-240
  • [6] Chen Z(1979)Smoothing noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation Numer Math 31 377-403
  • [7] Clyde M(1977)Maximum likelihood from incomplete data via the EM algorithm J R Stat Soc Ser B 39 1-38
  • [8] Berger J(2001)Variable selection via nonconcave penalized likelihood and its oracle properties J Am Stat Assoc 96 1348-1360
  • [9] Bullard F(2002)Model-based clustering, discriminant analysis, and density estimation J Am Stat Assoc 97 611-631
  • [10] Ford E(2007)Bayesian regularization for normal mixture estimation and model-based clustering J Classif 24 155-181