E-VECTORS: JFA AND I-VECTORS REVISITED

被引：0

作者：

Cumani, Sandro ^{[1
]}

Laface, Pietro ^{[1
]}

机构：

[1] Politecn Torino, Turin, Italy

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

关键词：

Speaker Recognition; eigenvoice; Joint Factor Analysis; i-vectors; e-vectors; SPEAKER; VARIABILITY;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Systems based on i-vectors represent the current state-of-the-art in text-independent speaker recognition. In this work we introduce a new compact representation of a speech segment, similar to the speaker factors of Joint Factor Analysis (JFA) and to i-vectors, that we call "e-vector". The e-vectors derive their name from the eigen-voice space of the JFA speaker modeling approach. Our working hypothesis is that JFA estimates a more informative speaker subspace than the "total variability" i-vector subspace, because the latter is obtained by considering each training segment as belonging to a different speaker. We propose, thus, a simple "i-vector style" modeling and training technique that exploits this observation, and estimates a more accurate subspace with respect to the one provided by the classical i-vector approach, as confirmed by the results of a set of tests performed on the extended core NIST 2012 Speaker Recognition Evaluation dataset. Simply replacing the i-vectors with e-vectors we get approximately 10% average improvement ofthe C-primary cost function, using different systems and classifiers. These performance gains come without any additional memory or computational costs with respect to the standard i-vector systems.

引用

页码：5435 / 5439

页数：5

共 25 条

[1] [Anonymous], NIST YEAR 2008 2010
[2] [Anonymous], 2011, INTERSPEECH
[3] [Anonymous], P EUROSPEECH 03
[4] [Anonymous], DIGITAL SIGNAL PROCE
[5] [Anonymous], 2014, Odyssey
[6] [Anonymous], 2000, INTERSPEECH
[7] Cumani S, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P200
[8] Large-Scale Training of Pairwise Support Vector Machines for Speaker Recognition
Cumani, Sandro
Laface, Pietro
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (11) : 1590 - 1600
[9] Pairwise Discriminative Speaker Verification in the I-Vector Space
Cumani, Sandro
Bruemmer, Niko
Burget, Lukas
Laface, Pietro
Plchot, Oldrich
Vasilakakis, Vasileios
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06): : 1217 - 1227
[10] Cumani Sandro., 2016, Proc. Odyssey, P39

← 1 2 3 →