Adaptive Individual Background Model for Speaker Verification

被引:0
作者
Bar-Yosef, Yossi [1 ]
Bistritz, Yuval [1 ]
机构
[1] Tel Aviv Univ, Dept Elect Engn, IL-69978 Tel Aviv, Israel
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
Model adaptation; Gaussian Mixture Models; Kullback-Leibler divergence; speaker verification; cohort selection; score normalization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most techniques for speaker verification today use Gaussian Mixture Models (GMMs) and make the decision by comparing the likelihood of the speaker model to the likelihood of a universal background model (UBM). The paper proposes to replace the UBM by an individual background model (IBM) that is generated for each speaker. The IBM is created using the K-nearest cohort models and the UBM by a simple new adaptation algorithm. The new GMM-IBM speaker verification system can also be combined with various score normalization techniques that have been proposed to increase the robustness of the GMM-UBM system. Comparative experiments were held on the NIST-2004-SRE database with a plain system setting (without score normalization) and also with the combination of adaptive test normalization (ATnorm). Results indicated that the proposed GMM-IBM system outperforms a comparable GMM-UBM system.
引用
收藏
页码:1279 / 1282
页数:4
相关论文
共 12 条
[1]  
[Anonymous], SPEECH PROCESSING TR
[2]   Score normalization for text-independent speaker verification systems [J].
Auckenthaler, R ;
Carey, M ;
Lloyd-Thomas, H .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :42-54
[3]   A tutorial on text-independent speaker verification [J].
Bimbot, F ;
Bonastre, JF ;
Fredouille, C ;
Gravier, G ;
Magrin-Chagnolleau, I ;
Meignier, S ;
Merlin, T ;
Ortega-García, J ;
Petrovska-Delacrétaz, D ;
Reynolds, DA .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :430-451
[4]  
Goldberger J., 2005, INTERSPEECH 2005-Eurospeech, 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005, P1985
[5]   Simplifying mixture models using the Unscented Transform [J].
Goldberger, Jacob ;
Greenspan, Hayit ;
Dreyfuss, Jeremie .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) :1496-1502
[6]  
*LING DAT CONS, SPIDRE DOC FIL
[7]  
*NIST, NIST YEAR 2004 SPEAK
[8]  
Pelecanos J., 2001, Proc. Speaker Odyssey, V13, P1
[9]   Speaker verification using speaker- and test-dependent fast score normalization [J].
Ramos-Castro, Daniel ;
Fierrez-Aguilar, Julian ;
Gonzalez-Rodriguez, Joaquin ;
Ortega-Garcia, Javier .
PATTERN RECOGNITION LETTERS, 2007, 28 (01) :90-98
[10]   SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA .
SPEECH COMMUNICATION, 1995, 17 (1-2) :91-108