A novel split-and-merge algorithm for hierarchical clustering of Gaussian mixture models

被引：11

作者：

Popovic, Branislav ^{[1
]}

Janev, Marko ^{[2
]}

Pekar, Darko ^{[3
]}

Jakovljevic, Niksa ^{[1
]}

Gnjatovic, Milan ^{[1
,4
]}

Secujski, Milan ^{[1
]}

Delic, Vlado ^{[1
]}

机构：

[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia

[2] Serbian Acad Arts & Sci, Math Inst, Belgrade, Serbia

[3] Alfanum Speech Technol, Novi Sad, Serbia

[4] Univ Novi Sad, Dept Power Elect & Commun Engn, Novi Sad 21000, Serbia

来源：

APPLIED INTELLIGENCE | 2012年 / 37卷 / 03期

关键词：

Gaussian mixtures; Split-and-merge operation; Hierarchical clustering; Continuous speech recognition;

D O I：

10.1007/s10489-011-0333-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper presents a novel split-and-merge algorithm for hierarchical clustering of Gaussian mixture models, which tends to improve on the local optimal solution determined by the initial constellation. It is initialized by local optimal parameters obtained by using a baseline approach similar to k-means, and it tends to approach more closely to the global optimum of the target clustering function, by iteratively splitting and merging the clusters of Gaussian components obtained as the output of the baseline algorithm. The algorithm is further improved by introducing model selection in order to obtain the best possible trade-off between recognition accuracy and computational load in a Gaussian selection task applied within an actual recognition system. The proposed method is tested both on artificial data and in the framework of Gaussian selection performed within a real continuous speech recognition system, and in both cases an improvement over the baseline method has been observed.

引用

页码：377 / 389

页数：13

共 28 条

[1] Subspace constrained Gaussian mixture models for speech recognition
Axelrod, S
Goel, V
Gopinath, RA
Olsen, PA
Visweswariah, K
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06): : 1144 - 1160
[2] Weighted and constrained possibilistic C-means clustering for online fault detection and isolation
Bahrampour, Soheil
Moshiri, Behzad
Salahshoor, Karim
[J]. APPLIED INTELLIGENCE, 2011, 35 (02) : 269 - 284
[3] Bocchieri E, 1993, P ICASSP 1993 MINN M, V2, pII, DOI 10.1109/ICASSP.1993.319405
[4] Delic V., 2007, Keynote lecture at 12th SPECOM (Speech and Computer), P64
[5] Gaussian mixture models with covariances or precisions in shared multiple subspaces
Dharanipragada, Satya
Visweswariah, Karthik
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1255 - 1266
[6] Goldberger J., 2005, P ADV NEUR INF PROC, P505
[7] Approximating the Kullback Leibler Divergence between Gaussian Mixture Models
Hershey, John R.
Olsen, Peder A.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 317 - 320
[8] Eigenvalues Driven Gaussian Selection in continuous speech recognition using HMMs with full covariance matrices
Janev, Marko
Pekar, Darko
Jakovljevic, Niksa
Delic, Vlado
[J]. APPLIED INTELLIGENCE, 2010, 33 (02) : 107 - 116
[9] Maximum Likelihood Clustering of Gaussians for Speech Recognition
Kannan, A.
Ostendorf, M.
Rohlicek, J. R.
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 453 - 455
[10] Knill KM, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P470, DOI 10.1109/ICSLP.1996.607156

← 1 2 3 →