A Text-Independent Speaker Verification System Based on Cross Entropy

被引:0
作者
Lu, Xiaochun [1 ]
Yin, Junxun [1 ]
机构
[1] S China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
来源
COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS | 2009年 / 51卷
关键词
speaker verification; GMM score; cross entropy; Gaussian mixture model; normalization;
D O I
10.1007/978-3-642-04962-0_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method based on information theory to estimate the distortion between the enrolled speaker's model and the test utterance in speaker verification system. It uses the cross entropy (CE) to compute the distance between two parametric models (such as GMMs). Different from the traditional average log-likelihood method, it considers the symmetry between the test utterance and the referenced model. In the verification phase, the Zt - norm is used to compensate the session variability. Experiment results based on the TIMIT database show that the proposed method can efficiently reduce error rates over the standard log-likelihood scoring.
引用
收藏
页码:419 / 426
页数:8
相关论文
共 12 条
[1]  
Aronowitz H, 2005, LECT NOTES COMPUT SC, V3361, P243
[2]   Efficient speaker recognition using approximated cross entropy (ACE) [J].
Aronowitz, Hagai ;
Burshtein, David .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :2033-2043
[3]   Score normalization for text-independent speaker verification systems [J].
Auckenthaler, R ;
Carey, M ;
Lloyd-Thomas, H .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :42-54
[4]  
BRUCE SF, 1989, P ICASSP, P429
[5]  
GAROFOLO JS, 2007, TIMIT ACOUSTIC PHONE
[6]  
Higgins A., 1991, Digital Signal Processing, V1, P89, DOI 10.1016/1051-2004(91)90098-6
[7]  
*NIST, 2000 NIST SPEAK REC
[8]  
OLSEN P, 2003, P EUR GEN SWITZ SEP, V4, P2509
[9]   Speaker verification using adapted Gaussian mixture models [J].
Reynolds, DA ;
Quatieri, TF ;
Dunn, RB .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41
[10]  
SCHMIDT M, 1995, INT CONF ACOUST SPEE, P333, DOI 10.1109/ICASSP.1995.479541