A novel split-and-merge algorithm for hierarchical clustering of Gaussian mixture models

被引:11
作者
Popovic, Branislav [1 ]
Janev, Marko [2 ]
Pekar, Darko [3 ]
Jakovljevic, Niksa [1 ]
Gnjatovic, Milan [1 ,4 ]
Secujski, Milan [1 ]
Delic, Vlado [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia
[2] Serbian Acad Arts & Sci, Math Inst, Belgrade, Serbia
[3] Alfanum Speech Technol, Novi Sad, Serbia
[4] Univ Novi Sad, Dept Power Elect & Commun Engn, Novi Sad 21000, Serbia
关键词
Gaussian mixtures; Split-and-merge operation; Hierarchical clustering; Continuous speech recognition;
D O I
10.1007/s10489-011-0333-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper presents a novel split-and-merge algorithm for hierarchical clustering of Gaussian mixture models, which tends to improve on the local optimal solution determined by the initial constellation. It is initialized by local optimal parameters obtained by using a baseline approach similar to k-means, and it tends to approach more closely to the global optimum of the target clustering function, by iteratively splitting and merging the clusters of Gaussian components obtained as the output of the baseline algorithm. The algorithm is further improved by introducing model selection in order to obtain the best possible trade-off between recognition accuracy and computational load in a Gaussian selection task applied within an actual recognition system. The proposed method is tested both on artificial data and in the framework of Gaussian selection performed within a real continuous speech recognition system, and in both cases an improvement over the baseline method has been observed.
引用
收藏
页码:377 / 389
页数:13
相关论文
共 28 条
  • [1] Subspace constrained Gaussian mixture models for speech recognition
    Axelrod, S
    Goel, V
    Gopinath, RA
    Olsen, PA
    Visweswariah, K
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06): : 1144 - 1160
  • [2] Weighted and constrained possibilistic C-means clustering for online fault detection and isolation
    Bahrampour, Soheil
    Moshiri, Behzad
    Salahshoor, Karim
    [J]. APPLIED INTELLIGENCE, 2011, 35 (02) : 269 - 284
  • [3] Bocchieri E, 1993, P ICASSP 1993 MINN M, V2, pII, DOI 10.1109/ICASSP.1993.319405
  • [4] Delic V., 2007, Keynote lecture at 12th SPECOM (Speech and Computer), P64
  • [5] Gaussian mixture models with covariances or precisions in shared multiple subspaces
    Dharanipragada, Satya
    Visweswariah, Karthik
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1255 - 1266
  • [6] Goldberger J., 2005, P ADV NEUR INF PROC, P505
  • [7] Approximating the Kullback Leibler Divergence between Gaussian Mixture Models
    Hershey, John R.
    Olsen, Peder A.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 317 - 320
  • [8] Eigenvalues Driven Gaussian Selection in continuous speech recognition using HMMs with full covariance matrices
    Janev, Marko
    Pekar, Darko
    Jakovljevic, Niksa
    Delic, Vlado
    [J]. APPLIED INTELLIGENCE, 2010, 33 (02) : 107 - 116
  • [9] Maximum Likelihood Clustering of Gaussians for Speech Recognition
    Kannan, A.
    Ostendorf, M.
    Rohlicek, J. R.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 453 - 455
  • [10] Knill KM, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P470, DOI 10.1109/ICSLP.1996.607156