Adaptive Individual Background Model for Speaker Verification

被引：0

作者：

Bar-Yosef, Yossi ^{[1
]}

Bistritz, Yuval ^{[1
]}

机构：

[1] Tel Aviv Univ, Dept Elect Engn, IL-69978 Tel Aviv, Israel

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Model adaptation; Gaussian Mixture Models; Kullback-Leibler divergence; speaker verification; cohort selection; score normalization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most techniques for speaker verification today use Gaussian Mixture Models (GMMs) and make the decision by comparing the likelihood of the speaker model to the likelihood of a universal background model (UBM). The paper proposes to replace the UBM by an individual background model (IBM) that is generated for each speaker. The IBM is created using the K-nearest cohort models and the UBM by a simple new adaptation algorithm. The new GMM-IBM speaker verification system can also be combined with various score normalization techniques that have been proposed to increase the robustness of the GMM-UBM system. Comparative experiments were held on the NIST-2004-SRE database with a plain system setting (without score normalization) and also with the combination of adaptive test normalization (ATnorm). Results indicated that the proposed GMM-IBM system outperforms a comparable GMM-UBM system.

引用

页码：1279 / 1282

页数：4

共 12 条

[1]

[Anonymous], SPEECH PROCESSING TR

[2] Score normalization for text-independent speaker verification systems [J].

Auckenthaler, R ;

Carey, M ;

Lloyd-Thomas, H .

DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :42-54

[3] A tutorial on text-independent speaker verification [J].

Bimbot, F ;

Bonastre, JF ;

Fredouille, C ;

Gravier, G ;

Magrin-Chagnolleau, I ;

Meignier, S ;

Merlin, T ;

Ortega-García, J ;

Petrovska-Delacrétaz, D ;

Reynolds, DA .

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :430-451

[4]

Goldberger J., 2005, INTERSPEECH 2005-Eurospeech, 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005, P1985

[5] Simplifying mixture models using the Unscented Transform [J].

Goldberger, Jacob ;

Greenspan, Hayit ;

Dreyfuss, Jeremie .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) :1496-1502

[6]

*LING DAT CONS, SPIDRE DOC FIL

[7]

*NIST, NIST YEAR 2004 SPEAK

[8]

Pelecanos J., 2001, Proc. Speaker Odyssey, V13, P1

[9] Speaker verification using speaker- and test-dependent fast score normalization [J].

Ramos-Castro, Daniel ;

Fierrez-Aguilar, Julian ;

Gonzalez-Rodriguez, Joaquin ;

Ortega-Garcia, Javier .

PATTERN RECOGNITION LETTERS, 2007, 28 (01) :90-98

[10] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].

REYNOLDS, DA .

SPEECH COMMUNICATION, 1995, 17 (1-2) :91-108

← 1 2 →