Mixture-based clustering for the ordered stereotype model

被引:24
作者
Fernandez, D. [1 ]
Arnold, R. [1 ]
Pledger, S. [1 ]
机构
[1] Victoria Univ Wellington, Sch Math Stat & Operat Res, Wellington 6140, New Zealand
关键词
Biclustering; Cluster analysis; Dimension reduction; EM-algorithm; Finite mixture model; Fuzzy clustering; Likert scale; Ordinal data; Stereotype model; LATENT CLASS MODELS; MAXIMUM-LIKELIHOOD; CATEGORICAL-DATA; REGRESSION; SELECTION;
D O I
10.1016/j.csda.2014.11.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many of the methods which deal with the reduction of dimensionality in matrices of data are based on mathematical techniques such as distance-based algorithms or matrix decomposition and eigenvalues. Recently a group of likelihood-based finite mixture models for a data matrix with binary or count data, using basic Bernoulli or Poisson building blocks has been developed. This is extended and establishes likelihood-based multivariate methods for a data matrix with ordinal data which applies fuzzy clustering via finite mixtures to the ordered stereotype model. Model-fitting is performed using the expectation-maximization (EM) algorithm, and a fuzzy allocation of rows, columns, and rows and columns simultaneously to corresponding clusters is obtained. A simulation study is presented which includes a variety of scenarios in order to test the reliability of the proposed model. Finally, the results of the application of the model in two real data sets are shown. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:46 / 75
页数:30
相关论文
共 68 条
[1]   QUASI-SYMMETRICAL LATENT CLASS MODELS, WITH APPLICATION TO RATER AGREEMENT [J].
AGRESTI, A ;
LANG, JB .
BIOMETRICS, 1993, 49 (01) :131-139
[2]  
Agresti Alan., 2010, Wiley Series in Probability and Statistics, V2nd, DOI DOI 10.1002/9780470594001
[3]   Bayesian inference for the stereotype regression model: application to a case-control study of prostate cancer [J].
Ahn, Jaeil ;
Mukherjee, Bhramar ;
Banerjee, Mousumi ;
Cooney, Kathleen A. .
STATISTICS IN MEDICINE, 2009, 28 (25) :3139-3157
[4]  
Akaike H., 1992, 2 INT S INF THEOR, P610, DOI [10.1007/978-1-4612-1694-0, 10.1007/978-1-4612-0919-538, 10.1007/978-1-4612-0919-5_38, 10.1007/978-0-387-98135-2, DOI 10.1007/978-1-4612-0919-538]
[5]  
ANDERSON JA, 1984, J R STAT SOC B, V46, P1
[6]  
[Anonymous], 2010, R LANG ENV STAT COMP
[7]  
[Anonymous], 2003, Wiley Series in Probability and Statistics
[8]  
[Anonymous], 2002, Model selection and multimodel inference: a practical informationtheoretic approach
[9]  
[Anonymous], 1996, WILEY, DOI DOI 10.2307/2265928
[10]  
[Anonymous], 2002, Experimental designs and data analysis for biologists