EFFICIENT SPEAKER VERIFICATION SYSTEM USING SPEAKER MODEL CLUSTERING FOR T AND Z NORMALIZATIONS

被引:0
作者
Ravulakollu, Kiran [1 ]
Apsingekar, Vijendra Raj [1 ]
De Leon, Phillip L. [1 ]
机构
[1] New Mexico State Univ, Klipsch Sch Elect Eng, Las Cruces, NM 88003 USA
来源
42ND ANNUAL 2008 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY, PROCEEDINGS | 2008年
关键词
Speaker recognition; Clustering methods;
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In speaker verification (SV) systems based on Gaussian Mixture Model-Universal Background Model (GMM-UBM), normalization is an important component in the decision stage. Many normalization methods including the T- and Z-norms, have been proposed and investigated and these have contributed to state-of-the-art SV systems which have extremely low equal-error rates (EERs). In this paper, we consider application of both T- and Z-norms to a carefully selected subset of speakers using a data driven approach which can significantly reduce computation resulting in faster SV decisions and lower EER. Unfortunately, selection of the subset is critical and must be representative of the entire speaker model space otherwise error rates will increase. In order to properly select the subset of speakers for the normalizations, we propose a novel method which first clusters the speaker models using the K-means algorithm and the Kullback-Leibler (KL) divergence and then selects a set of speakers within the cluster. We evaluate the approach using both the TIMIT, NTIMIT and NIST-2002 corpora and compare against standard T- and Z-normalizations.
引用
收藏
页码:56 / 62
页数:7
相关论文
共 20 条
  • [1] [Anonymous], 1988, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
  • [2] [Anonymous], 2005, P INTERSPEECH
  • [3] Score normalization for text-independent speaker verification systems
    Auckenthaler, R
    Carey, M
    Lloyd-Thomas, H
    [J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 42 - 54
  • [4] Ben M, 2002, INT CONF ACOUST SPEE, P689
  • [5] A tutorial on text-independent speaker verification
    Bimbot, F
    Bonastre, JF
    Fredouille, C
    Gravier, G
    Magrin-Chagnolleau, I
    Meignier, S
    Merlin, T
    Ortega-García, J
    Petrovska-Delacrétaz, D
    Reynolds, DA
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451
  • [6] Towards better making a decision in speaker verification
    Chen, Ke
    [J]. 2003, Elsevier Ltd (36) : 329 - 346
  • [7] DELEON PL, 2007, P IEEE BIOM S
  • [8] Bayesian adaptation for user-dependent multimodal biometric authentication
    Fierrez-Aguilar, J
    Garcia-Romero, D
    Ortega-Garcia, J
    Gonzalez-Rodriguez, J
    [J]. PATTERN RECOGNITION, 2005, 38 (08) : 1317 - 1319
  • [9] Fierrez-Aguilar J, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS, P617
  • [10] LINDBERG J, 1998, P RLA2C, P89