Choosing models in model-based clustering and discriminant analysis

被引:75
作者
Biernacki, C
Govaert, G
机构
[1] INRIA Rhone Alps, ZIRST, F-38330 St Martin, France
[2] Univ Technol Compiegne, CNRS, UMR 6599, F-60205 Compiegne, France
关键词
Gaussian mixture models; eigenvalue decomposition; cross-validation; information; Bayesian and classification criteria;
D O I
10.1080/00949659908811966
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Using an eigenvalue decomposition of variance matrices, Celeux and Govaert (1993) obtained numerous and powerful models for Gaussian model-based clustering and discriminant analysis. Through Monte Carlo simulations, we compare the performances of many classical criteria to select these models: information criteria as AIC, the Bayesian criterion BIG, classification criteria as NEC and cross-validation. In the clustering context, information criteria and BIC outperform the classification criteria. In the discriminant analysis context, cross-validation shows good performance but information criteria and BIC give satisfactory results as well with, by far, less time-computing.
引用
收藏
页码:49 / 71
页数:23
相关论文
共 24 条
[1]  
AITKIN M, 1985, J ROY STAT SOC B MET, V47, P67
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[4]   Inference in model-based cluster analysis [J].
Bensmail, H ;
Celeux, G ;
Raftery, AE ;
Robert, CP .
STATISTICS AND COMPUTING, 1997, 7 (01) :1-10
[5]   Regularized Gaussian discriminant analysis through eigenvalue decomposition [J].
Bensmail, H ;
Celeux, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1743-1748
[6]  
Biernacki C, 1997, COMPUTING SCI STAT, V29, P451
[7]  
BIERNACKI C, 1997, THESIS UTC COMPIEGNE
[9]   ON THE INFORMATION-BASED MEASURE OF COVARIANCE COMPLEXITY AND ITS APPLICATION TO THE EVALUATION OF MULTIVARIATE LINEAR-MODELS [J].
BOZDOGAN, H .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1990, 19 (01) :221-278
[10]   An entropy criterion for assessing the number of clusters in a mixture model [J].
Celeux, G ;
Soromenho, G .
JOURNAL OF CLASSIFICATION, 1996, 13 (02) :195-212