Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification

被引:0
|
作者
Pang, Xiaomin [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
来源
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年
关键词
Speaker verification; i-vectors; probabilistic LDA; NIST; 2012; SRE; noise robustness; ACOUSTIC FACTOR-ANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The i-vector representation and probabilistic linear discriminant analysis (PLDA) have shown state-of-the-art performance in many speaker verification systems. However, in real-world environments, additive and convolutive noise cause mismatches between training and recognition conditions, degrading the performance. In this paper, a fusion system that combines a multi-condition PLDA model and a mixture of SNR-dependent PLDA models is proposed to make the verification system noise robust. The SNR of test utterances is used to determine the best SNR-dependent PLDA model to score against the target-speaker's i-vectors. The performance of the fusion system is demonstrated on NIST 2012 SRE. Results show that the SNR-dependent PLDA models can reduce EER and that the fusion system is more robust than the conventional i-vector/PLDA systems under noisy conditions. It is also found that the SNR-dependent PLDA models are insensitive to Z-norm parameters.
引用
收藏
页码:619 / 623
页数:5
相关论文
共 50 条
  • [21] I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification
    Tan, Zhili
    Mak, Man-Wai
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1562 - 1566
  • [22] Adversarial Network Bottleneck Features for Noise Robust Speaker Verification
    Yu, Hong
    Tan, Zheng-Hua
    Ma, Zhanyu
    Guo, Jun
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1492 - 1496
  • [23] NONNEGATIVE MATRIX FACTORIZATION BASED NOISE ROBUST SPEAKER VERIFICATION
    Liu, S. H.
    Zou, Y. X.
    Ning, H. K.
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 35 - 39
  • [24] PHONETICALLY-CONSTRAINED PLDA MODELING FOR TEXT-DEPENDENT SPEAKER VERIFICATION WITH MULTIPLE SHORT UTTERANCES
    Larcher, Anthony
    Lee, Kong Aik
    Ma, Bin
    Li, Haizhou
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7673 - 7677
  • [25] Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise
    Hasan, Taufiq
    Hansen, John H. L.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3126 - 3130
  • [26] DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification
    Tan, Zhili
    Mak, Man-Wai
    Mak, Brian Kan-Wing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 700 - 712
  • [27] Noise Robust Speaker Verification Using Sub-Band Weighting
    Kim, Sungtak
    Ji, Mikyong
    Kim, Hoirin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (03): : 279 - 284
  • [28] Robust Features Fusion for Text Independent Speaker Verification Enhancement in Noisy Environments
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1863 - 1868
  • [29] Robust model for speaker verification against session-dependent utterance variation
    Matsui, T
    Aikawa, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (04): : 712 - 718
  • [30] Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification
    Michelsanti, Daniel
    Tan, Zheng-Hua
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2008 - 2012