A Text-Independent Speaker Verification System Based on Cross Entropy

被引：0

作者：

Lu, Xiaochun ^{[1
]}

Yin, Junxun ^{[1
]}

机构：

[1] S China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China

来源：

COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS | 2009年 / 51卷

关键词：

speaker verification; GMM score; cross entropy; Gaussian mixture model; normalization;

D O I：

10.1007/978-3-642-04962-0_48

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a method based on information theory to estimate the distortion between the enrolled speaker's model and the test utterance in speaker verification system. It uses the cross entropy (CE) to compute the distance between two parametric models (such as GMMs). Different from the traditional average log-likelihood method, it considers the symmetry between the test utterance and the referenced model. In the verification phase, the Zt - norm is used to compensate the session variability. Experiment results based on the TIMIT database show that the proposed method can efficiently reduce error rates over the standard log-likelihood scoring.

引用

页码：419 / 426

页数：8

共 12 条

[1]

Aronowitz H, 2005, LECT NOTES COMPUT SC, V3361, P243

[2] Efficient speaker recognition using approximated cross entropy (ACE) [J].

Aronowitz, Hagai ;

Burshtein, David .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :2033-2043

[3] Score normalization for text-independent speaker verification systems [J].

Auckenthaler, R ;

Carey, M ;

Lloyd-Thomas, H .

DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :42-54

[4]

BRUCE SF, 1989, P ICASSP, P429

[5]

GAROFOLO JS, 2007, TIMIT ACOUSTIC PHONE

[6]

Higgins A., 1991, Digital Signal Processing, V1, P89, DOI 10.1016/1051-2004(91)90098-6

[7]

*NIST, 2000 NIST SPEAK REC

[8]

OLSEN P, 2003, P EUR GEN SWITZ SEP, V4, P2509

[9] Speaker verification using adapted Gaussian mixture models [J].

Reynolds, DA ;

Quatieri, TF ;

Dunn, RB .

DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41

[10]

SCHMIDT M, 1995, INT CONF ACOUST SPEE, P333, DOI 10.1109/ICASSP.1995.479541

← 1 2 →