Multivariate data clustering for the Gaussian mixture model

被引:0
作者
Kavaliauskas, M
Rudzkis, R
机构
[1] Kaunas Univ Technol, Fac Fundamental Sci, LT-3000 Kaunas, Lithuania
[2] Inst Math & Informat, Dept Appl Stat, LT-08663 Vilnius, Lithuania
关键词
clustering; multivariate data; Gaussian mixture model; projection-based clustering; EM algorithm;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper discusses a soft sample clustering problem for multivariate independent random data satisfying the mixture model of the Gaussian distribution. The theory recommends to estimate the parameters of model by the maximum likelihood method and to use "plug-in" approach for data clustering. Unfortunately, the calculation problem of the maximum likelihood estimate is not completely solved in multivariate case. This work proposes a new constructive a few stage procedure to solve this task. This procedure includes statistical distribution analysis of a large number of the univariate projections of observations, geometric clustering of a multivariate sample and application of EM algorithm. The results of the accuracy analysis of the proposed methods is made by means of Monte-Carlo simulation.
引用
收藏
页码:61 / 74
页数:14
相关论文
共 12 条
[1]  
Berkhin P., 2002, SURVEY CLUSTERING DA
[2]  
BOYLES RA, 1983, J ROY STAT SOC B MET, V45, P47
[3]   Model-based clustering, discriminant analysis, and density estimation [J].
Fraley, C ;
Raftery, AE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (458) :611-631
[4]   MCLUST: Software for model-based cluster analysis [J].
Fraley, C ;
Raftery, AE .
JOURNAL OF CLASSIFICATION, 1999, 16 (02) :297-306
[5]  
FRIEDMAN JH, 1987, J AM STAT ASSOC, V82, P397
[6]   PROJECTION PURSUIT [J].
HUBER, PJ .
ANNALS OF STATISTICS, 1985, 13 (02) :435-475
[7]  
McLachlan G., 1999, J STAT SOFTWARE, V4
[8]   STATISTICAL ESTIMATION OF A MIXTURE OF GAUSSIAN DISTRIBUTIONS [J].
RUDZKIS, R ;
RADAVICIUS, M .
ACTA APPLICANDAE MATHEMATICAE, 1995, 38 (01) :37-54
[9]   Testing hypotheses on discriminant space in the mixture model of Gaussian distributions [J].
Rudzkis, R ;
Radavicius, M .
ACTA APPLICANDAE MATHEMATICAE, 2003, 79 (1-2) :105-114
[10]  
RUDZKIS R, 1999, ACTA APPL MATH, V59, P617