Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification

被引：0

作者：

Pang, Xiaomin ^{[1
]}

Mak, Man-Wai ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China

来源：

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年

关键词：

Speaker verification; i-vectors; probabilistic LDA; NIST; 2012; SRE; noise robustness; ACOUSTIC FACTOR-ANALYSIS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The i-vector representation and probabilistic linear discriminant analysis (PLDA) have shown state-of-the-art performance in many speaker verification systems. However, in real-world environments, additive and convolutive noise cause mismatches between training and recognition conditions, degrading the performance. In this paper, a fusion system that combines a multi-condition PLDA model and a mixture of SNR-dependent PLDA models is proposed to make the verification system noise robust. The SNR of test utterances is used to determine the best SNR-dependent PLDA model to score against the target-speaker's i-vectors. The performance of the fusion system is demonstrated on NIST 2012 SRE. Results show that the SNR-dependent PLDA models can reduce EER and that the fusion system is more robust than the conventional i-vector/PLDA systems under noisy conditions. It is also found that the SNR-dependent PLDA models are insensitive to Z-norm parameters.

引用

页码：619 / 623

页数：5

共 50 条

[21] I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification
Tan, Zhili
Mak, Man-Wai
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1562 - 1566
[22] Adversarial Network Bottleneck Features for Noise Robust Speaker Verification
Yu, Hong
Tan, Zheng-Hua
Ma, Zhanyu
Guo, Jun
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1492 - 1496
[23] NONNEGATIVE MATRIX FACTORIZATION BASED NOISE ROBUST SPEAKER VERIFICATION
Liu, S. H.
Zou, Y. X.
Ning, H. K.
2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 35 - 39
[24] PHONETICALLY-CONSTRAINED PLDA MODELING FOR TEXT-DEPENDENT SPEAKER VERIFICATION WITH MULTIPLE SHORT UTTERANCES
Larcher, Anthony
Lee, Kong Aik
Ma, Bin
Li, Haizhou
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7673 - 7677
[25] Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise
Hasan, Taufiq
Hansen, John H. L.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3126 - 3130
[26] DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification
Tan, Zhili
Mak, Man-Wai
Mak, Brian Kan-Wing
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 700 - 712
[27] Noise Robust Speaker Verification Using Sub-Band Weighting
Kim, Sungtak
Ji, Mikyong
Kim, Hoirin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (03): : 279 - 284
[28] Robust Features Fusion for Text Independent Speaker Verification Enhancement in Noisy Environments
Mohammadi, Mohsen
Mohammadi, Hamid Reza Sadegh
2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1863 - 1868
[29] Robust model for speaker verification against session-dependent utterance variation
Matsui, T
Aikawa, K
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (04): : 712 - 718
[30] Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification
Michelsanti, Daniel
Tan, Zheng-Hua
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2008 - 2012

← 1 2 3 4 5 →