Model-Based Clustering by Probabilistic Self-Organizing Maps

被引:23
作者
Cheng, Shih-Sian [1 ,2 ]
Fu, Hsin-Chia [1 ]
Wang, Hsin-Min [2 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2009年 / 20卷 / 05期
关键词
Classification expectation-maximization (CEM) algorithm; deterministic annealing expectation-maximization (DAEM) algorithm; expectation-maximization (EM) algorithm; model-based clustering; probabilistic self-organizing map (PbSOM); self-organizing map (SOM); VECTOR QUANTIZATION; ALGORITHM; CLASSIFICATION; DENSITY;
D O I
10.1109/TNN.2009.2013708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider the learning process of a probabilistic self-organizing map (PbSOM) as a model-based data clustering procedure that preserves the topological relationships between data clusters in a neural network. Based on this concept, we develop a coupling-likelihood mixture model for the PbSOM that extends the reference vectors in Kohonen's self-organizing map (SOM) to multivariate Gaussian distributions. We also derive three expectation-maximization (EM)-type algorithms, called the SOCEM, SOEM, and SODAEM algorithms, for learning the model (PbSOM) based on the maximum-likelihood criterion. SOCEM is derived by using the classification EM (CEM) algorithm to maximize the classification likelihood; SOEM is derived by using the EM algorithm to maximize the mixture likelihood; and SODAEM is a deterministic annealing (DA) variant of SOCEM and SOEM. Moreover, by shrinking the neighborhood size, SOCEM and SOEM can be interpreted, respectively, as DA variants of the CEM and EM algorithms for Gaussian model-based clustering. The experimental results show that the proposed PbSOM learning algorithms achieve comparable data clustering performance to that of the deterministic annealing EM (DAEM) approach, while maintaining the topology-preserving property.
引用
收藏
页码:805 / 826
页数:22
相关论文
共 39 条
[1]   Constrained clustering and Kohonen self-organizing maps [J].
Ambroise, C ;
Govaert, G .
JOURNAL OF CLASSIFICATION, 1996, 13 (02) :299-313
[2]  
[Anonymous], 1998, TR97021 INT COMP SCI
[3]  
[Anonymous], J MACH LEARN RES
[4]   Probabilistic self-organizing map and radial basis function networks [J].
Anouar, F ;
Badran, F ;
Thiria, S .
NEUROCOMPUTING, 1998, 20 (1-3) :83-96
[5]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[6]   GTM: The generative topographic mapping [J].
Bishop, CM ;
Svensen, M ;
Williams, CKI .
NEURAL COMPUTATION, 1998, 10 (01) :215-234
[7]   A CLASSIFICATION EM ALGORITHM FOR CLUSTERING AND 2 STOCHASTIC VERSIONS [J].
CELEUX, G ;
GOVAERT, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1992, 14 (03) :315-332
[8]   Retroviral vector-mediated transfer of an antisense cyclin G1 construct inhibits osteosarcoma tumor growth in nude mice [J].
Chen, DS ;
Zhu, NL ;
Hung, G ;
Skotzko, MJ ;
Hinton, DR ;
Tolo, V ;
Hall, FL ;
Anderson, WF ;
Gordon, EM .
HUMAN GENE THERAPY, 1997, 8 (14) :1667-1674
[9]   A model-selection-based self-splitting Gaussian mixture learning with application to speaker identification [J].
Cheng, SS ;
Wang, HM ;
Fu, HC .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (17) :2626-2639
[10]   An online cellular probabilistic self-organizing map for static and dynamic data sets [J].
Chow, TWS ;
Wu, SO .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2004, 51 (04) :732-747