Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications

被引:36
作者
Apsingekar, Vijendra Raj [1 ]
De Leon, Phillip L. [1 ]
机构
[1] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2009年 / 17卷 / 04期
关键词
Clustering methods; speaker recognition; VERIFICATION;
D O I
10.1109/TASL.2008.2010882
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In large population speaker identification (SI) systems, likelihood computations between an unknown speaker's feature vectors and the registered speaker models can be very time-consuming and impose a bottleneck. For applications requiring fast SI, this is a recognized problem and improvements in efficiency would be beneficial. In this paper, we propose a method whereby GMM-based speaker models are clustered using a simple k-means algorithm. Then, during the test stage, only a small proportion of speaker models in selected clusters are used in the likelihood computations resulting in a significant speed-up with little to no loss in accuracy. In general, as the number of selected clusters is reduced, the identification accuracy decreases; however, this loss can be controlled through proper tradeoff. The proposed method may also be combined with other test stage speed-up techniques resulting in even greater speed-up gains without additional sacrifices in accuracy.
引用
收藏
页码:848 / 853
页数:6
相关论文
共 23 条
[21]  
Sun B, 2003, 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, P299
[22]   Automatic speaker clustering using a voice characteristic reference space and maximum purity estimation [J].
Tsai, Wei-Ho ;
Cheng, Shih-Sian ;
Wang, Hsin-Min .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04) :1461-1474
[23]   Efficient text-independent speaker verification with structural Gaussian mixture models and neural network [J].
Xiang, B ;
Berger, T .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05) :447-456