Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications

被引:36
作者
Apsingekar, Vijendra Raj [1 ]
De Leon, Phillip L. [1 ]
机构
[1] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2009年 / 17卷 / 04期
关键词
Clustering methods; speaker recognition; VERIFICATION;
D O I
10.1109/TASL.2008.2010882
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In large population speaker identification (SI) systems, likelihood computations between an unknown speaker's feature vectors and the registered speaker models can be very time-consuming and impose a bottleneck. For applications requiring fast SI, this is a recognized problem and improvements in efficiency would be beneficial. In this paper, we propose a method whereby GMM-based speaker models are clustered using a simple k-means algorithm. Then, during the test stage, only a small proportion of speaker models in selected clusters are used in the likelihood computations resulting in a significant speed-up with little to no loss in accuracy. In general, as the number of selected clusters is reduced, the identification accuracy decreases; however, this loss can be controlled through proper tradeoff. The proposed method may also be combined with other test stage speed-up techniques resulting in even greater speed-up gains without additional sacrifices in accuracy.
引用
收藏
页码:848 / 853
页数:6
相关论文
共 23 条
[1]   Discriminative in-set/out-of-set speaker recognition [J].
Angkititrakul, Pongtep ;
Hansen, John H. L. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02) :498-508
[2]  
Aronowitz H., 2004, P ICSLP, P609
[3]   Efficient speaker recognition using approximated cross entropy (ACE) [J].
Aronowitz, Hagai ;
Burshtein, David .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :2033-2043
[4]  
BEIGI HSM, 1999, P 6 EUR C SPEECH COM, P2203
[5]  
BEN M, 2002, P IEEE INT C AC SPEE
[6]  
Campbell WM, 2006, INT CONF ACOUST SPEE, P97
[7]  
CAMPBELL WM, 2007, P ICASSP 2007, V4, P217
[8]  
DELEON PL, 2007, P IEEE BIOM S
[9]  
Faloutsos C., 1994, Journal of Intelligent Information Systems: Integrating Artificial Intelligence and Database Technologies, V3, P231, DOI 10.1007/BF00962238
[10]  
GOLDBERGER J, 2005, P INT, P1982