Curriculum Learning based Probabilistic Linear Discriminant Analysis for Noise Robust Speaker Recognition

被引:6
作者
Ranjan, Shivesh [1 ]
Misra, Abhinav [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Richardson, TX 75083 USA
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
curriculum learning; speaker recognition; PLDA; noise robust; DARPA RATS;
D O I
10.21437/Interspeech.2017-1199
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study introduces a novel Curriculum Learning based Probabilistic Linear Discriminant Analysis (CL-PLDA) algorithm for improving speaker recognition in noisy conditions. CL-PLDA operates by initializing the training EM algorithm with cleaner data (easy examples), and successively adds noisier data (difficult examples) as the training progresses. This curriculum learning based approach guides the parameters of CL-PLDA to better local minima compared to regular PLDA. We test CL-PLDA on speaker verification task of the severely noisy and degraded DARPA RATS data, and show it to significantly outperform regular PLDA across test-sets of varying duration.
引用
收藏
页码:3717 / 3721
页数:5
相关论文
共 35 条
  • [1] [Anonymous], 2011, INTERSPEECH
  • [2] [Anonymous], IEEE ICASSP 2016
  • [3] [Anonymous], P INTERSPEECH
  • [4] [Anonymous], 2011, WORKSH AUT SPEECH RE
  • [5] [Anonymous], 2015, ARXIV151202595
  • [6] [Anonymous], ODYSSEY
  • [7] [Anonymous], 2010, ODYSSEY
  • [8] [Anonymous], P NIST 2011 WORKSH
  • [9] Probabilistic approach using joint clean and noisy i-vectors modeling for speaker recognition
    Ben Kheder, Waad
    Matrouf, Driss
    Ajili, Moez
    Bonastre, Jean-Francois
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3638 - 3642
  • [10] Ben Kheder W, 2015, INT CONF ACOUST SPEE, P4190, DOI 10.1109/ICASSP.2015.7178760