Probabilistic approach using joint clean and noisy i-vectors modeling for speaker recognition

被引：3

作者：

Ben Kheder, Waad ^{[1
]}

Matrouf, Driss ^{[1
]}

Ajili, Moez ^{[1
]}

Bonastre, Jean-Francois ^{[1
]}

机构：

[1] Univ Avignon, LIA, Avignon, France

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

关键词：

speaker verification; i-vector; additive noise; joint modeling;

D O I：

10.21437/Interspeech.2016-1292

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Additive noise is one of the main challenges for automatic speaker recognition and several compensation techniques have been proposed to deal with this problem. In this paper, we present a new "data-driven" denoising technique operating in the i-vector space based on a joint modeling of clean and noisy i-vectors. The joint distribution is estimated using a large set of i-vectors pairs (clean i-vectors and their noisy versions generated artificially) then integrated in an MMSE estimator in the test phase to compute a "cleaned-up" version of noisy test i-vectors. We show that this algorithm achieves up to 80% of relative improvement in EER. We also present a version of the proposed algorithm that can be used to compensate multiple "unseen" noises. We test this technique on the recently published SITW database and show a significant gain compared to the baseline system performance.

引用

页码：3638 / 3642

页数：5

共 29 条

[1] Stereo-Based Stochastic Mapping for Robust Speech Recognition
Afify, Mohamed
Cui, Xiaodong
Gao, Yuqing
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1325 - 1334
[2] [Anonymous], P 2007 INT
[3] [Anonymous], 2012, Proc. The Speaker and Language Recognition Workshop
[4] [Anonymous], 2011, INTERSPEECH
[5] [Anonymous], INTERSPEECH 2015
[6] [Anonymous], 2000, P ANN C INT SPEECH C
[7] [Anonymous], 2014, ICASSP
[8] [Anonymous], ICASSP
[9] Ben Kheder W, 2015, INT CONF ACOUST SPEE, P4190, DOI 10.1109/ICASSP.2015.7178760
[10] Robust Speaker Recognition Using MAP Estimation of Additive Noise in i-vectors Space
Ben Kheder, Waad
Matrouf, Driss
Bousquet, Pierre-Michel
Bonastre, Jean-Francois
Ajili, Moez
[J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 97 - 107

← 1 2 3 →