Assessing a mixture model for clustering with the integrated completed likelihood

被引:964
作者
Biernacki, C
Celeux, G
Govaert, G
机构
[1] Univ Franche Comte, Dept Math, CNRS, UMR 6623, F-25030 Besancon, France
[2] INRIA Rhone Alpes, ZIRST, F-38330 Monbonnot St Martin, France
[3] Univ Technol Compiegne, Dept Genie Informat, CNRS, UMR 6599, F-60205 Compiegne, France
关键词
mixture model; clustering; integrated likelihood; BIC; integrated completed likelihood; ICL criterion;
D O I
10.1109/34.865189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose assessing a mixture model in a cluster analysis setting with the integrated completed likelihood. With this purpose, the observed data are assigned to unknown clusters using a maximum a posteriori operator. Then, the Integrated Completed Likelihood (ICL) is approximated using an a` la Bayesian information criterion (BIC). Numerical experiments on simulated and real data of the resulting ICL criterion show that it performs well both for choosing a mixture model and a relevant number of clusters. In particular. ICL appears to be more robust than BIC to violation of some of the mixture model assumptions and it can select a number of clusters leading to a sensible partitioning of the data.
引用
收藏
页码:719 / 725
页数:7
相关论文
共 29 条