Nuance - Politecnico di Torino's 2016 NIST Speaker Recognition Evaluation System

被引:6
作者
Colibro, Daniele [1 ]
Vair, Claudio [1 ]
Dalmasso, Emanuele [1 ]
Farrell, Kevin [1 ]
Karvitsky, Gennady [1 ]
Cumani, Sandro [2 ]
Laface, Pietro [2 ]
机构
[1] Nuance Commun Inc, Burlington, MA 01803 USA
[2] Politecn Torino, Turin, Italy
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
Speaker Recognition; i-vector; PLDA; PSVM; AS-Norm; Top-Norm;
D O I
10.21437/Interspeech.2017-797
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the Nuance-Politecnico di Torino (NPT) speaker recognition system submitted to the NIST SRE16 evaluation campaign. Included are the results of post evaluation tests, focusing on the analysis of the performance of generative and discriminative classifiers, and of score normalization. The submitted system combines the results of four GMM-IVector models. two DNN-IVector models and a GMM-SVM acoustic system. Each system exploits acoustic front-end parameters that differ by feature type and dimension. We analyze the main components of our submission, which contributed to obtaining 8.1% EER and 0.532 actual. C-primary in the challenging SRE16 Fixed condition.
引用
收藏
页码:1338 / 1342
页数:5
相关论文
共 17 条
  • [11] RASTA Processing of Speech
    Hermansky, Hynek
    Morgan, Nelson
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 578 - 589
  • [12] Jones K., 2017, P INTERSPEECH
  • [13] Jorrin-Prieto J., 2016, ODYSSEY, P393
  • [14] Kenny Patrick, 2010, SPEAK LANG REC WORKS
  • [15] Pelecanos J., 2001, Proc. Speaker Odyssey, V13, P1
  • [16] Sturim DE, 2005, INT CONF ACOUST SPEE, P741
  • [17] Zigel Y., 2006, OD 2006 SPEAK LANG R