Mixture model clustering for mixed data with missing information

被引:76
作者
Hunt, L [1 ]
Jorgensen, M [1 ]
机构
[1] Univ Waikato, Dept Stat, Hamilton, New Zealand
关键词
clustering; mixed data; missing at random;
D O I
10.1016/S0167-9473(02)00190-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
One difficulty with classification studies is unobserved or missing observations that often occur in multivariate datasets. The mixture likelihood approach to clustering has been well developed and is much used, particularly for mixtures where the component distributions are multivariate normal. It is shown that this approach can be extended to analyse data with mixed categorical and continuous attributes and where some of the data are missing at random in the sense of Little and Rubin (Statistical Analysis with Mixing Data, Wiley, New York). (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:429 / 440
页数:12
相关论文
共 28 条
[1]   MISSING OBSERVATIONS IN MULTIVARIATE STATISTICS .1. IEW OF LITERATURE [J].
AFIFI, AA ;
ELASHOFF, RM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1966, 61 (315) :595-&
[2]  
Andrews D. F., 1985, Springer Series in Statistics
[3]  
[Anonymous], 1964, ED TESTING SERVICE R
[4]  
[Anonymous], 2000, WILEY SERIES PROBABI
[5]  
[Anonymous], P C INF STAT IND SCI
[6]  
[Anonymous], ANAL INCOMPLETE DATA
[7]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[8]  
BYAR DP, 1980, B CANCER, V67, P477
[9]   An entropy criterion for assessing the number of clusters in a mixture model [J].
Celeux, G ;
Soromenho, G .
JOURNAL OF CLASSIFICATION, 1996, 13 (02) :195-212
[10]  
DAY NE, 1969, BIOMETRIKA, V56, P464