Non-parametric Mixture Models for Clustering

被引:0
作者
Mallapragada, Pavan Kumar [1 ]
Jin, Rong [1 ]
Jain, Anil [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
来源
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION | 2010年 / 6218卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mixture models have been widely used for data clustering. However, commonly used mixture models are generally of a parametric form (e.g., mixture of Gaussian distributions or GMM), which significantly limits their capacity in fitting diverse multidimensional data distributions encountered in practice. We propose a non-parametric mixture model (NMM) for data clustering in order to detect clusters generated from arbitrary unknown distributions, using non-parametric kernel density estimates. The proposed model is non-parametric since the generative distribution of each data point depends only on the rest of the data points and the chosen kernel. A leave-one-out likelihood maximization is performed to estimate the parameters of the model. The NMM approach, when applied to cluster high dimensional text datasets significantly outperforms the state-of-the-art and classical approaches such as K-means, Gaussian Mixture Models, spectral clustering and linkage methods.
引用
收藏
页码:334 / 343
页数:10
相关论文
共 20 条
  • [1] Andreetto M, 2007, IEEE I CONF COMP VIS, P1104
  • [2] [Anonymous], 2007, Advances in neural information processing systems
  • [3] [Anonymous], 1994, Kernel smoothing
  • [4] [Anonymous], 2001, Pattern Classification
  • [5] [Anonymous], 1988, Algorithms for Clustering Data
  • [6] [Anonymous], 2006, Pattern recognition and machine learning
  • [7] Banerjee A., 2004, KDD'04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, P515
  • [8] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [9] Mean shift: A robust approach toward feature space analysis
    Comaniciu, D
    Meer, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) : 603 - 619
  • [10] Csiszar I., 1984, INFORM GEOMETRY ALTE