On the upper bound of the number of modes of a multivariate normal mixture

被引:10
作者
Ray, Surajit [1 ]
Ren, Dan [1 ]
机构
[1] Boston Univ, Dept Math & Stat, Boston, MA 02215 USA
基金
美国国家科学基金会;
关键词
Mixture; Modal cluster; Multivariate mode; Clustering; Dimension reduction; Topography; Manifold;
D O I
10.1016/j.jmva.2012.02.006
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The main result of this article states that one can get as many as D + 1 modes from just a two component normal mixture in D dimensions. Multivariate mixture models are widely used for modeling homogeneous populations and for cluster analysis. Either the components directly or modes arising from these components are often used to extract individual clusters. Although in lower dimensions these strategies work well, our results show that high dimensional mixtures are often very complex and researchers should take extra precautions when using mixture models for cluster analysis. Further our analysis shows that the number of modes depends on the component means and eigenvalues of the ratio of the two component covariance matrices, which in turn provides a clear guideline as to when one can use mixture analysis for clustering high dimensional data. (c) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:41 / 52
页数:12
相关论文
共 27 条
[1]  
[Anonymous], WILEY SERIES PROBABI
[2]  
[Anonymous], 2006, FINITE MIXTURE MARKO
[3]   ON MODES OF A MIXTURE OF 2 NORMAL DISTRIBUTIONS [J].
BEHBOODIAN, J .
TECHNOMETRICS, 1970, 12 (01) :131-+
[4]  
Carreira-Perpiñán MA, 2003, LECT NOTES COMPUT SC, V2695, P625
[5]   Inference for multivariate normal mixtures [J].
Chen, Jiahua ;
Tan, Xianming .
JOURNAL OF MULTIVARIATE ANALYSIS, 2009, 100 (07) :1367-1383
[6]   Calibrating the excess mass and dip tests of modality [J].
Cheng, MY ;
Hall, P .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1998, 60 :579-589
[7]  
Cheng MY, 1999, ANN STAT, V27, P1294
[8]   Likelihood ratio testing for hidden markov models under non-standard conditions [J].
Dannemann, Joern ;
Holzmann, Hajo .
SCANDINAVIAN JOURNAL OF STATISTICS, 2008, 35 (02) :309-321
[9]   GENESIS OF BIMODAL DISTRIBUTIONS [J].
EISENBERGER, I .
TECHNOMETRICS, 1964, 6 (04) :357-&
[10]  
Hartigan J. A., 1988, Classification and Related Methods of Data Analysis. Proceedings of the First Conference of the International Federation of Classification Societies (IFCS), P229