Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification

被引:0
|
作者
Pang, Xiaomin [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
来源
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年
关键词
Speaker verification; i-vectors; probabilistic LDA; NIST; 2012; SRE; noise robustness; ACOUSTIC FACTOR-ANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The i-vector representation and probabilistic linear discriminant analysis (PLDA) have shown state-of-the-art performance in many speaker verification systems. However, in real-world environments, additive and convolutive noise cause mismatches between training and recognition conditions, degrading the performance. In this paper, a fusion system that combines a multi-condition PLDA model and a mixture of SNR-dependent PLDA models is proposed to make the verification system noise robust. The SNR of test utterances is used to determine the best SNR-dependent PLDA model to score against the target-speaker's i-vectors. The performance of the fusion system is demonstrated on NIST 2012 SRE. Results show that the SNR-dependent PLDA models can reduce EER and that the fusion system is more robust than the conventional i-vector/PLDA systems under noisy conditions. It is also found that the SNR-dependent PLDA models are insensitive to Z-norm parameters.
引用
收藏
页码:619 / 623
页数:5
相关论文
共 50 条
  • [41] Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities
    Mporas, Iosif
    Safavi, Saeid
    Sotudeh, Reza
    SPEECH AND COMPUTER, 2016, 9811 : 378 - 385
  • [42] Evaluation of a noise-robust multi-stream speaker verification method using F0 information
    Asami, Taichi
    Iwano, Koji
    Furui, Sadaoki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 549 - 557
  • [43] Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020
    Mun, Sung Hwan
    Kang, Woo Hyun
    Han, Min Hyun
    Kim, Nam Soo
    INTERSPEECH 2020, 2020, : 741 - 745
  • [44] SPEECH ENHANCEMENT USING LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS FOR NOISE ROBUST SPEAKER VERIFICATION
    Kolbaek, Morten
    Tan, Zheng-Hua
    Jensen, Jesper
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 305 - 311
  • [45] Robust Speaker Verification Using Short-Time Frequency with Long-Time Window and Fusion of Multi-Resolutions
    Huang, Chien-Lin
    Ma, Bin
    Wu, Chung-Hsien
    Mak, Brian
    Li, Haizhou
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1897 - +
  • [46] A speaker identification-verification approach for noise-corrupted and improved speech using fusion features and a convolutional neural network
    Nisa R.
    Baba A.M.
    International Journal of Information Technology, 2024, 16 (6) : 3493 - 3501
  • [47] Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models
    Zeinali, Hossein
    Sameti, Hossein
    Burget, Lukas
    Cernocky, Jan Honza
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 53 - 71
  • [48] Noise-Aware Extended U-Net With Split Encoder and Feature Refinement Module for Robust Speaker Verification in Noisy Environments
    Lim, Chan-Yeong
    Heo, Jungwoo
    Kim, Ju-Ho
    Shin, Hyun-Seo
    Yu, Ha-Jin
    IEEE ACCESS, 2024, 12 : 111673 - 111682
  • [49] I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification
    Saeidi, R.
    Lee, K. A.
    Kinnunen, T.
    Hasan, T.
    Fauve, B.
    Bousquet, P-M.
    Khoury, E.
    Martinez, P. L. Sordo
    Kua, J. M. K.
    You, C. H.
    Sun, H.
    Larcher, A.
    Rajan, P.
    Hautamaki, V.
    Hanilci, C.
    Braithwaite, B.
    Gonzales-Hautamaki, R.
    Sadjadi, S. O.
    Liu, G.
    Boril, H.
    Shokouhi, N.
    Matrouf, D.
    El Shafey, L.
    Mowlaee, P.
    Epps, J.
    Thiruvaran, T.
    van Leeuwen, D. A.
    Ma, B.
    Li, H.
    Hansen, J. H. L.
    Bonastre, J-F.
    Marcel, S.
    Mason, J.
    Ambikairajah, E.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1985 - 1989
  • [50] Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
    Jung, Myunghun
    Jung, Youngmoon
    Goo, Jahyun
    Kim, Hoirin
    INTERSPEECH 2020, 2020, : 931 - 935