Probabilistic approach using joint clean and noisy i-vectors modeling for speaker recognition

被引:3
作者
Ben Kheder, Waad [1 ]
Matrouf, Driss [1 ]
Ajili, Moez [1 ]
Bonastre, Jean-Francois [1 ]
机构
[1] Univ Avignon, LIA, Avignon, France
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
speaker verification; i-vector; additive noise; joint modeling;
D O I
10.21437/Interspeech.2016-1292
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Additive noise is one of the main challenges for automatic speaker recognition and several compensation techniques have been proposed to deal with this problem. In this paper, we present a new "data-driven" denoising technique operating in the i-vector space based on a joint modeling of clean and noisy i-vectors. The joint distribution is estimated using a large set of i-vectors pairs (clean i-vectors and their noisy versions generated artificially) then integrated in an MMSE estimator in the test phase to compute a "cleaned-up" version of noisy test i-vectors. We show that this algorithm achieves up to 80% of relative improvement in EER. We also present a version of the proposed algorithm that can be used to compensate multiple "unseen" noises. We test this technique on the recently published SITW database and show a significant gain compared to the baseline system performance.
引用
收藏
页码:3638 / 3642
页数:5
相关论文
共 29 条
  • [1] Stereo-Based Stochastic Mapping for Robust Speech Recognition
    Afify, Mohamed
    Cui, Xiaodong
    Gao, Yuqing
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1325 - 1334
  • [2] [Anonymous], P 2007 INT
  • [3] [Anonymous], 2012, Proc. The Speaker and Language Recognition Workshop
  • [4] [Anonymous], 2011, INTERSPEECH
  • [5] [Anonymous], INTERSPEECH 2015
  • [6] [Anonymous], 2000, P ANN C INT SPEECH C
  • [7] [Anonymous], 2014, ICASSP
  • [8] [Anonymous], ICASSP
  • [9] Ben Kheder W, 2015, INT CONF ACOUST SPEE, P4190, DOI 10.1109/ICASSP.2015.7178760
  • [10] Robust Speaker Recognition Using MAP Estimation of Additive Noise in i-vectors Space
    Ben Kheder, Waad
    Matrouf, Driss
    Bousquet, Pierre-Michel
    Bonastre, Jean-Francois
    Ajili, Moez
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 97 - 107