LEARNABLE NONLINEAR COMPRESSION FOR ROBUST SPEAKER VERIFICATION

被引:2
|
作者
Liu, Xuechen [1 ,2 ]
Sahidullah, Md [2 ]
Kinnunen, Tomi [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
[2] Univ Lorraine, INRIA, CNRS, LORIA, F-54000 Nancy, France
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Speaker Verification; Nonlinear Compression; Multi-Regime Compression; RECOGNITION;
D O I
10.1109/ICASSP43922.2022.9747185
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, we focus on nonlinear compression methods in spectral features for speaker verification based on deep neural network. We consider different kinds of channel-dependent (CD) nonlinear compression methods optimized in a data-driven manner. Our methods are based on power nonlinearities and dynamic range compression (DRC). We also propose multi-regime (MR) design on the nonlinearities, at improving robustness. Results on VoxCeleb1 and VoxMovies data demonstrate improvements brought by proposed compression methods over both the commonly-used logarithm and their static counterparts, especially for ones based on power function. While CD generalization improves performance on VoxCeleb1, MR provides more robustness on VoxMovies, with a maximum relative equal error rate reduction of 21.6%.
引用
收藏
页码:7962 / 7966
页数:5
相关论文
共 50 条
  • [1] Learnable MFCCs for Speaker Verification
    Liu, Xuechen
    Sahidullah, Md
    Kinnunen, Tomi
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [2] Learnable Sparse Filterbank for Speaker Verification
    Peng, Junyi
    Gu, Rongzhi
    Mosner, Ladislav
    Plchot, Oldrich
    Burget, Lukas
    Cernocky, Jan
    INTERSPEECH 2022, 2022, : 5110 - 5114
  • [3] Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification
    Kang, Woo Hyun
    Mun, Sung Hwan
    Han, Min Hyun
    Kim, Nam Soo
    IEEE ACCESS, 2020, 8 : 141838 - 141849
  • [4] DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
    Yi, Lu
    Mak, Man-Wai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7662 - 7666
  • [5] Acoustic Factor Analysis for Robust Speaker Verification
    Hasan, Taufiq
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 842 - 853
  • [6] Attentive Feature Fusion for Robust Speaker Verification
    Liu, Bei
    Chen, Zhengyang
    Qian, Yanmin
    INTERSPEECH 2022, 2022, : 286 - 290
  • [7] A Robust SVM/GMM Classifier for Speaker Verification
    Cirovic, Zoran
    Cirovic, Natasa
    SPEECH AND COMPUTER, 2014, 8773 : 74 - 80
  • [8] Gradient Regularization for Noise-Robust Speaker Verification
    Li, Jianchen
    Han, Jiqing
    Song, Hongwei
    INTERSPEECH 2021, 2021, : 1074 - 1078
  • [9] SNR-Invariant PLDA Modeling for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
  • [10] A nonlinear autoregressive model for speaker verification
    Srinivasan, Sundararajan
    Ma, Tao
    Lazarou, Georgios
    Picone, Joseph
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 17 - 25