Online variational learning of generalized Dirichlet mixture models with feature selection

被引:13
作者
Fan, Wentao [1 ]
Bouguila, Nizar [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1T7, Canada
[2] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1T7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Online learning; Mixture models; Feature selection; Variational inference; Generalized Dirichlet; Documents clustering; CLASSIFICATION; REPRESENTATION; TEXTURE;
D O I
10.1016/j.neucom.2012.09.047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Three frequently recurring themes in machine learning, data mining and related disciplines are clustering, feature selection and online learning. Motivated by the importance of these themes which are generally interrelated, we propose a statistical framework for simultaneous online clustering and feature selection using finite generalized Dirichlet mixture model. The proposed framework allows to control overfitting by, dynamically and simultaneously, adjusting the mixture model's parameters, number of components and the features weights. We describe a principled variational approach for learning the parameters of the proposed statistical model. Results on both synthetic data and real applications involving online documents and images clustering show the merits of the proposed approach. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:166 / 179
页数:14
相关论文
共 57 条
  • [1] Natural gradient works efficiently in learning
    Amari, S
    [J]. NEURAL COMPUTATION, 1998, 10 (02) : 251 - 276
  • [2] [Anonymous], 1997, APPL MATH
  • [3] [Anonymous], 1999, P ADV NEUR INF PROC
  • [4] [Anonymous], 2006, Pattern recognition and machine learning
  • [5] [Anonymous], P ARTIF INTELL STAT
  • [6] On-line handwriting recognition with support vector machines - A kernel approach
    Bahlmann, C
    Haasdonk, B
    Burkhardt, H
    [J]. EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 49 - 54
  • [7] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [8] Bosch A, 2006, LECT NOTES COMPUT SC, V3954, P517
  • [9] Online clustering via finite mixtures of Dirichlet and minimum message length
    Bouguila, N
    Ziou, D
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2006, 19 (04) : 371 - 379
  • [10] Using unsupervised learning of a finite Dirichlet mixture model to improve pattern recognition applications
    Bouguila, N
    Ziou, D
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (12) : 1916 - 1925