Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification

被引：0

作者：

Pang, Xiaomin ^{[1
]}

Mak, Man-Wai ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China

来源：

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年

关键词：

Speaker verification; i-vectors; probabilistic LDA; NIST; 2012; SRE; noise robustness; ACOUSTIC FACTOR-ANALYSIS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The i-vector representation and probabilistic linear discriminant analysis (PLDA) have shown state-of-the-art performance in many speaker verification systems. However, in real-world environments, additive and convolutive noise cause mismatches between training and recognition conditions, degrading the performance. In this paper, a fusion system that combines a multi-condition PLDA model and a mixture of SNR-dependent PLDA models is proposed to make the verification system noise robust. The SNR of test utterances is used to determine the best SNR-dependent PLDA model to score against the target-speaker's i-vectors. The performance of the fusion system is demonstrated on NIST 2012 SRE. Results show that the SNR-dependent PLDA models can reduce EER and that the fusion system is more robust than the conventional i-vector/PLDA systems under noisy conditions. It is also found that the SNR-dependent PLDA models are insensitive to Z-norm parameters.

引用

页码：619 / 623

页数：5

共 50 条

[41] Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities
Mporas, Iosif
Safavi, Saeid
Sotudeh, Reza
SPEECH AND COMPUTER, 2016, 9811 : 378 - 385
[42] Evaluation of a noise-robust multi-stream speaker verification method using F0 information
Asami, Taichi
Iwano, Koji
Furui, Sadaoki
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 549 - 557
[43] Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020
Mun, Sung Hwan
Kang, Woo Hyun
Han, Min Hyun
Kim, Nam Soo
INTERSPEECH 2020, 2020, : 741 - 745
[44] SPEECH ENHANCEMENT USING LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS FOR NOISE ROBUST SPEAKER VERIFICATION
Kolbaek, Morten
Tan, Zheng-Hua
Jensen, Jesper
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 305 - 311
[45] Robust Speaker Verification Using Short-Time Frequency with Long-Time Window and Fusion of Multi-Resolutions
Huang, Chien-Lin
Ma, Bin
Wu, Chung-Hsien
Mak, Brian
Li, Haizhou
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1897 - +
[46] A speaker identification-verification approach for noise-corrupted and improved speech using fusion features and a convolutional neural network
Nisa R.
Baba A.M.
International Journal of Information Technology, 2024, 16 (6) : 3493 - 3501
[47] Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models
Zeinali, Hossein
Sameti, Hossein
Burget, Lukas
Cernocky, Jan Honza
COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 53 - 71
[48] Noise-Aware Extended U-Net With Split Encoder and Feature Refinement Module for Robust Speaker Verification in Noisy Environments
Lim, Chan-Yeong
Heo, Jungwoo
Kim, Ju-Ho
Shin, Hyun-Seo
Yu, Ha-Jin
IEEE ACCESS, 2024, 12 : 111673 - 111682
[49] I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification
Saeidi, R.
Lee, K. A.
Kinnunen, T.
Hasan, T.
Fauve, B.
Bousquet, P-M.
Khoury, E.
Martinez, P. L. Sordo
Kua, J. M. K.
You, C. H.
Sun, H.
Larcher, A.
Rajan, P.
Hautamaki, V.
Hanilci, C.
Braithwaite, B.
Gonzales-Hautamaki, R.
Sadjadi, S. O.
Liu, G.
Boril, H.
Shokouhi, N.
Matrouf, D.
El Shafey, L.
Mowlaee, P.
Epps, J.
Thiruvaran, T.
van Leeuwen, D. A.
Ma, B.
Li, H.
Hansen, J. H. L.
Bonastre, J-F.
Marcel, S.
Mason, J.
Ambikairajah, E.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1985 - 1989
[50] Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
Jung, Myunghun
Jung, Youngmoon
Goo, Jahyun
Kim, Hoirin
INTERSPEECH 2020, 2020, : 931 - 935

← 1 2 3 4 5 →