Evaluate the number of clusters in finite mixture models with the penalized histogram difference criterion

被引:1
作者
Lin, Weilu [1 ]
Wang, Yonghong [1 ]
Zhuang, Yingping [1 ]
Zhang, Siliang [1 ]
机构
[1] E China Univ Sci & Technol, Sate Key Lab Bioreactor Engn, Shanghai 200237, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
Finite mixture models; Penalized histogram difference criterion; Gaussian mixtures; Information criteria; EM algorithm;
D O I
10.1016/j.jprocont.2013.06.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aimed at the determination of the number of mixtures for finite mixture models (FMMs), in this work, a new method called the penalized histogram difference criterion (PHDC) is proposed and evaluated with other criteria such as Akaike information criterion (AIC), the minimum message length (MML), the information complexity (ICOMP) and the evidence of data criterion (EDC). The new method, which calculates the penalized histogram difference between the data generated from estimated FMMs and those for modeling purpose, turns out to be better than others for data with complicate mixtures patterns. It is demonstrated in this work that the PHDC can determine the optimal number of clusters of the FMM. Furthermore, the estimated FMMs asymptotically approximate the true model. The utility of the new method is demonstrated through synthetic data sets analysis and the batch-wise comparison of citric acid fermentation processes. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1052 / 1062
页数:11
相关论文
共 28 条