An Evolutionary Algorithm with Crossover and Mutation for Model-Based Clustering

被引:6
作者
McNicholas, Sharon M. [1 ]
McNicholas, Paul D. [1 ]
Ashlock, Daniel A. [2 ]
机构
[1] McMaster Univ, Dept Math & Stat, Hamilton, ON L8S 4L8, Canada
[2] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Clustering; Crossover; Evolutionary algorithm; Mixture models; Mutation; Model-based clustering; MIXTURE MODEL; APPROXIMATIONS; SELECTION;
D O I
10.1007/s00357-020-09371-4
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
An evolutionary algorithm (EA) is developed as an alternative to the EM algorithm for parameter estimation in model-based clustering. This EA facilitates a different search of the fitness landscape, i.e., the likelihood surface, utilizing both crossover and mutation. Furthermore, this EA represents an efficient approach to "hard" model-based clustering and so it can be viewed as a sort of generalization of thek-means algorithm, which is itself equivalent to a restricted Gaussian mixture model. The EA is illustrated on several datasets, and its performance is compared with that of other hard clustering approaches and model-based clustering via the EM algorithm.
引用
收藏
页码:264 / 279
页数:16
相关论文
共 59 条
[1]   Using evolutionary algorithms for model-based clustering [J].
Andrews, Jeffrey L. ;
McNicholas, Paul D. .
PATTERN RECOGNITION LETTERS, 2013, 34 (09) :987-992
[2]  
[Anonymous], 2010, Evolutionary Computation for Modeling and Optimization
[3]  
[Anonymous], 2014, MIXTURE MIXTURE MODE
[4]  
[Anonymous], 2018, R: A Language and Environment for Statistical Computing
[5]  
[Anonymous], 2012, Technical Report No. 597
[6]   The multivariate leptokurtic-normal distribution and its application in model-based clustering [J].
Bagnato, Luca ;
Punzo, Antonio ;
Zoia, Maria G. .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (01) :95-119
[7]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[8]   Model-based clustering of high-dimensional data: A review [J].
Bouveyron, Charles ;
Brunet-Saumard, Camille .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 :52-78
[9]   Estimating common principal components in high dimensions [J].
Browne, Ryan P. ;
McNicholas, Paul D. .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2014, 8 (02) :217-226
[10]   Orthogonal Stiefel manifold optimization for eigen-decomposed covariance parameter estimation in mixture models [J].
Browne, Ryan P. ;
McNicholas, Paul D. .
STATISTICS AND COMPUTING, 2014, 24 (02) :203-210