Parsimonious Gaussian mixture models

被引:215
作者
McNicholas, Paul David [2 ]
Murphy, Thomas Brendan [1 ]
机构
[1] Univ Coll Dublin, Sch Math Sci, Dublin 4, Ireland
[2] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
基金
爱尔兰科学基金会; 美国国家卫生研究院;
关键词
mixture models; factor analysis; probabilistic principal components analysis; cluster analysis; model-based clustering;
D O I
10.1007/s11222-008-9056-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parsimonious Gaussian mixture models are developed using a latent Gaussian model which is closely related to the factor analysis model. These models provide a unified modeling framework which includes the mixtures of probabilistic principal component analyzers and mixtures of factor of analyzers models as special cases. In particular, a class of eight parsimonious Gaussian mixture models which are based on the mixtures of factor analyzers model are introduced and the maximum likelihood estimates for the parameters in these models are found using an AECM algorithm. The class of models includes parsimonious models that have not previously been developed. These models are applied to the analysis of chemical and physical properties of Italian wines and the chemical properties of coffee; the models are shown to give excellent clustering performance.
引用
收藏
页码:285 / 296
页数:12
相关论文
共 35 条
[1]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[2]  
Bartholomew D. J., 1999, LATENT VARIABLE MODE
[3]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[4]   THE DISTRIBUTION OF THE LIKELIHOOD RATIO FOR MIXTURES OF DENSITIES FROM THE ONE-PARAMETER EXPONENTIAL FAMILY [J].
BOHNING, D ;
DIETZ, E ;
SCHAUB, R ;
SCHLATTMANN, P ;
LINDSAY, BG .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1994, 46 (02) :373-388
[5]   GAUSSIAN PARSIMONIOUS CLUSTERING MODELS [J].
CELEUX, G ;
GOVAERT, G .
PATTERN RECOGNITION, 1995, 28 (05) :781-793
[6]  
CHANG WC, 1983, J ROY STAT SOC C, V32, P267
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]  
FORINA M, 1986, VITIS, V25, P189
[9]   How many clusters? Which clustering method? Answers via model-based cluster analysis [J].
Fraley, C ;
Raftery, AE .
COMPUTER JOURNAL, 1998, 41 (08) :578-588
[10]   Model-based clustering, discriminant analysis, and density estimation [J].
Fraley, C ;
Raftery, AE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (458) :611-631