Reducing speaker model search space in speaker identification

被引:0
作者
De Leon, Phillip L. [1 ]
Apsingekar, Vijendra [1 ]
机构
[1] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA
来源
2007 BIOMETRICS SYMPOSIUM | 2007年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For large population speaker identification (SID) systems, likelihood computations between an unknown speaker's test feature set and speaker models can be very time-consuming and detrimental to applications where fast SID is required. In this paper, we propose a method whereby speaker models are clustered during the training stage. Then during the testing stage, only those clusters which are likely to contain high-likelihood speaker models are searched. The proposed method reduces the speaker model space which directly results in faster SID. Although there maybe a slight loss in identification accuracy depending on the number of clusters searched, this loss can be controlled by trading off speed and accuracy.
引用
收藏
页码:90 / 95
页数:6
相关论文
共 15 条
[1]   Discriminative in-set/out-of-set speaker recognition [J].
Angkititrakul, Pongtep ;
Hansen, John H. L. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02) :498-508
[2]   Support vector machines using GMM supervectors for speaker verification [J].
Campbell, WM ;
Sturim, DE ;
Reynolds, DA .
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311
[3]  
CAMPBELL WM, 2007, P INT C AC SPEECH SI
[4]  
Faloutsos C., 1994, Journal of Intelligent Information Systems: Integrating Artificial Intelligence and Database Technologies, V3, P231, DOI 10.1007/BF00962238
[5]  
Gersho A., 1992, VECTOR QUANTIZATION
[6]   Real-time speaker identification and verification [J].
Kinnunen, T ;
Karpov, E ;
Fränti, P .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01) :277-288
[7]  
MASHAO DJ, 2004, IEEE AFRICON S SEP
[8]  
MCLAUGHLIN J, 1999, P 6 EUR C SPEECH COM
[9]   An efficient scoring algorithm for Gaussian mixture model based speaker identification [J].
Pellom, BL ;
Hansen, JHL .
IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (11) :281-284
[10]  
Quatieri T., 2002, DISCRETE TIME SPEECH