Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification

被引：0

作者：

Pang, Xiaomin ^{[1
]}

Mak, Man-Wai ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China

来源：

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年

关键词：

Speaker verification; i-vectors; probabilistic LDA; NIST; 2012; SRE; noise robustness; ACOUSTIC FACTOR-ANALYSIS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The i-vector representation and probabilistic linear discriminant analysis (PLDA) have shown state-of-the-art performance in many speaker verification systems. However, in real-world environments, additive and convolutive noise cause mismatches between training and recognition conditions, degrading the performance. In this paper, a fusion system that combines a multi-condition PLDA model and a mixture of SNR-dependent PLDA models is proposed to make the verification system noise robust. The SNR of test utterances is used to determine the best SNR-dependent PLDA model to score against the target-speaker's i-vectors. The performance of the fusion system is demonstrated on NIST 2012 SRE. Results show that the SNR-dependent PLDA models can reduce EER and that the fusion system is more robust than the conventional i-vector/PLDA systems under noisy conditions. It is also found that the SNR-dependent PLDA models are insensitive to Z-norm parameters.

引用

页码：619 / 623

页数：5

共 50 条

[1] Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA
Pang, Xiaomin
Mak, Man-Wai
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (04) : 633 - 648
[2] SNR-Invariant PLDA Modeling for Robust Speaker Verification
Li, Na
Mak, Man-Wai
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
[3] SNR-Invariant PLDA Modeling in Nonparametric Subspace for Robust Speaker Verification
Li, Na
Mak, Man-Wai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (10) : 1648 - 1659
[4] DNN-Driven Mixture of PLDA for Robust Speaker Verification
Li, Na
Mak, Man-Wai
Chien, Jen-Tzung
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
[5] SNR-Invariant Multitask Deep Neural Networks for Robust Speaker Verification
Yao, Qi
Mak, Man-Wai
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (11) : 1670 - 1674
[6] Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification
Cai, Weicheng
Li, Ming
Li, Lin
Hong, Qingyang
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1027 - 1031
[7] Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise
Hasan, Taufiq
Hansen, John H. L.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 381 - 391
[8] Supervized Mixture of PLDA Models for Cross-Channel Speaker Verification
Simonchik, Konstantin
Pekhovsky, Timur
Shulipa, Andrey
Afanasyev, Anton
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1682 - 1685
[9] DNN FEATURE COMPENSATION FOR NOISE ROBUST SPEAKER VERIFICATION
Du, Steven
Xiao, Xiong
Chng, Eng Siong
2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 871 - 875
[10] Noise Robust Speaker Verification with Delta Cepstrum Normalization
Kanda, Naoyuki
Takeda, Ryu
Obuchi, Yasunari
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3111 - 3115

← 1 2 3 4 5 →