Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification

被引:0
|
作者
Pang, Xiaomin [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
来源
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年
关键词
Speaker verification; i-vectors; probabilistic LDA; NIST; 2012; SRE; noise robustness; ACOUSTIC FACTOR-ANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The i-vector representation and probabilistic linear discriminant analysis (PLDA) have shown state-of-the-art performance in many speaker verification systems. However, in real-world environments, additive and convolutive noise cause mismatches between training and recognition conditions, degrading the performance. In this paper, a fusion system that combines a multi-condition PLDA model and a mixture of SNR-dependent PLDA models is proposed to make the verification system noise robust. The SNR of test utterances is used to determine the best SNR-dependent PLDA model to score against the target-speaker's i-vectors. The performance of the fusion system is demonstrated on NIST 2012 SRE. Results show that the SNR-dependent PLDA models can reduce EER and that the fusion system is more robust than the conventional i-vector/PLDA systems under noisy conditions. It is also found that the SNR-dependent PLDA models are insensitive to Z-norm parameters.
引用
收藏
页码:619 / 623
页数:5
相关论文
共 50 条
  • [1] Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA
    Pang, Xiaomin
    Mak, Man-Wai
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (04) : 633 - 648
  • [2] SNR-Invariant PLDA Modeling for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
  • [3] SNR-Invariant PLDA Modeling in Nonparametric Subspace for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (10) : 1648 - 1659
  • [4] DNN-Driven Mixture of PLDA for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    Chien, Jen-Tzung
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
  • [5] SNR-Invariant Multitask Deep Neural Networks for Robust Speaker Verification
    Yao, Qi
    Mak, Man-Wai
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (11) : 1670 - 1674
  • [6] Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification
    Cai, Weicheng
    Li, Ming
    Li, Lin
    Hong, Qingyang
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1027 - 1031
  • [7] Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise
    Hasan, Taufiq
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 381 - 391
  • [8] Supervized Mixture of PLDA Models for Cross-Channel Speaker Verification
    Simonchik, Konstantin
    Pekhovsky, Timur
    Shulipa, Andrey
    Afanasyev, Anton
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1682 - 1685
  • [9] DNN FEATURE COMPENSATION FOR NOISE ROBUST SPEAKER VERIFICATION
    Du, Steven
    Xiao, Xiong
    Chng, Eng Siong
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 871 - 875
  • [10] Noise Robust Speaker Verification with Delta Cepstrum Normalization
    Kanda, Naoyuki
    Takeda, Ryu
    Obuchi, Yasunari
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3111 - 3115