E-VECTORS: JFA AND I-VECTORS REVISITED

被引:0
作者
Cumani, Sandro [1 ]
Laface, Pietro [1 ]
机构
[1] Politecn Torino, Turin, Italy
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
关键词
Speaker Recognition; eigenvoice; Joint Factor Analysis; i-vectors; e-vectors; SPEAKER; VARIABILITY;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Systems based on i-vectors represent the current state-of-the-art in text-independent speaker recognition. In this work we introduce a new compact representation of a speech segment, similar to the speaker factors of Joint Factor Analysis (JFA) and to i-vectors, that we call "e-vector". The e-vectors derive their name from the eigen-voice space of the JFA speaker modeling approach. Our working hypothesis is that JFA estimates a more informative speaker subspace than the "total variability" i-vector subspace, because the latter is obtained by considering each training segment as belonging to a different speaker. We propose, thus, a simple "i-vector style" modeling and training technique that exploits this observation, and estimates a more accurate subspace with respect to the one provided by the classical i-vector approach, as confirmed by the results of a set of tests performed on the extended core NIST 2012 Speaker Recognition Evaluation dataset. Simply replacing the i-vectors with e-vectors we get approximately 10% average improvement ofthe C-primary cost function, using different systems and classifiers. These performance gains come without any additional memory or computational costs with respect to the standard i-vector systems.
引用
收藏
页码:5435 / 5439
页数:5
相关论文
共 25 条
  • [1] [Anonymous], NIST YEAR 2008 2010
  • [2] [Anonymous], 2011, INTERSPEECH
  • [3] [Anonymous], P EUROSPEECH 03
  • [4] [Anonymous], DIGITAL SIGNAL PROCE
  • [5] [Anonymous], 2014, Odyssey
  • [6] [Anonymous], 2000, INTERSPEECH
  • [7] Cumani S, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P200
  • [8] Large-Scale Training of Pairwise Support Vector Machines for Speaker Recognition
    Cumani, Sandro
    Laface, Pietro
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (11) : 1590 - 1600
  • [9] Pairwise Discriminative Speaker Verification in the I-Vector Space
    Cumani, Sandro
    Bruemmer, Niko
    Burget, Lukas
    Laface, Pietro
    Plchot, Oldrich
    Vasilakakis, Vasileios
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06): : 1217 - 1227
  • [10] Cumani Sandro., 2016, Proc. Odyssey, P39