Speaker verification using speaker- and test-dependent fast score normalization

被引:26
作者
Ramos-Castro, Daniel [1 ]
Fierrez-Aguilar, Julian [1 ]
Gonzalez-Rodriguez, Joaquin [1 ]
Ortega-Garcia, Javier [1 ]
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, Speech & Signal Proc Grp ATVS, E-28049 Madrid, Spain
关键词
speaker verification; score normalization; Tnorm; Kullback-Leibler divergence; cohort selection; speaker-dependent; test-dependent;
D O I
10.1016/j.patrec.2006.06.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel score normalization scheme for speaker verification is presented. The proposed technique is based on the widely used test-normalization method (Tnorm), which compensates test-dependent variability using a fixed cohort of impostors. The new procedure selects a speaker-dependent subset of impostor models from the fixed cohort using a distance-based criterion. Selection of the sub-cohort is made using a distance measure based on a fast approximation of the Kullback-Leibler (KL) divergence for Gaussian mixture models (GMM). The proposed technique has been called KL-Tnorm, and outperforms Tnorm in computational efficiency. Experimental results using NIST 2005 Speaker Recognition Evaluation protocol also show a stable performance improvement of our method on standard speaker recognition systems. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:90 / 98
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 2005, Proceedings Eurospeech
[2]  
[Anonymous], 2003, European Conference on Speech Communication and Technology
[3]  
[Anonymous], 1997, Proceedings of the uropean Conference on Speech Communication and Technology
[4]  
[Anonymous], P SPEAK LANG REC WOR
[5]  
[Anonymous], P INT C SPOK LANG PR
[6]  
[Anonymous], 2005, P INTERSPEECH
[7]  
Aronowitz H., 2005, P INT, P2433
[8]  
Aronowitz H., 2004, P ICSLP, P609
[9]   Score normalization for text-independent speaker verification systems [J].
Auckenthaler, R ;
Carey, M ;
Lloyd-Thomas, H .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :42-54
[10]  
BEN M, 2005, P INT, P3061