HIERARCHICAL MIXTURE CLUSTERING AND ITS APPLICATION TO GMM BASED TEXT INDEPENDENT SPEAKER IDENTIFICATION

被引:0
作者
Saeidi, R. [1 ]
Mohammadi, H. R. Sadegh [1 ]
Ganchev, T. [3 ]
Rodman, R. D. [2 ]
机构
[1] Iranian Res Inst Elect Engn, Tehran, Iran
[2] North Carolina State Univ Raleigh, Dept Comp Sci, Raleigh, NC USA
[3] Univ Patras, Wire Commun Lab, Patras 26500, Greece
来源
2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2 | 2008年
关键词
Speaker identification; mixture clustering; GMM; speed-up; VERIFICATION;
D O I
10.1109/ISTEL.2008.4651403
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a hierarchical mixture clustering method and investigate its application for complexity reduction of a GMM based speaker identification system. We show that by using GMM-HMC one can cluster speakers more accurately than that of a sorted GMM with the same acceleration rate. The system was tested on a universal background model-Gaussian mixture model with KL-divergence as the distance measure. While the proposed system's performance is slightly inferior to the baseline system, its comparatively smaller computational load provides the potential to develop systems with higher performance.
引用
收藏
页码:770 / +
页数:2
相关论文
共 13 条
[1]  
AUCKENTHALER R, 2001, P SPEAK OD SPEAK REC
[2]  
Chan Arthur., 2004, Proceedings of INTERSPEECH 2004, P689
[3]  
FORTUNA J, P INTERSPEECH 2005
[4]   Real-time speaker identification and verification [J].
Kinnunen, T ;
Karpov, E ;
Fränti, P .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01) :277-288
[5]  
MOHAMMADI HRS, 2007, P ICASSP 07 APR, P309
[6]   An efficient scoring algorithm for Gaussian mixture model based speaker identification [J].
Pellom, BL ;
Hansen, JHL .
IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (11) :281-284
[7]   ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA ;
ROSE, RC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83
[8]   Speaker verification using adapted Gaussian mixture models [J].
Reynolds, DA ;
Quatieri, TF ;
Dunn, RB .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41
[9]   Gaussian-selection-based non-optimal search for speaker identification [J].
Roch, M .
SPEECH COMMUNICATION, 2006, 48 (01) :85-95
[10]  
Saeidi R., 2007, P ICASSP 07 APR, V1, P305