LEARNABLE NONLINEAR COMPRESSION FOR ROBUST SPEAKER VERIFICATION

被引:2
|
作者
Liu, Xuechen [1 ,2 ]
Sahidullah, Md [2 ]
Kinnunen, Tomi [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
[2] Univ Lorraine, INRIA, CNRS, LORIA, F-54000 Nancy, France
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Speaker Verification; Nonlinear Compression; Multi-Regime Compression; RECOGNITION;
D O I
10.1109/ICASSP43922.2022.9747185
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, we focus on nonlinear compression methods in spectral features for speaker verification based on deep neural network. We consider different kinds of channel-dependent (CD) nonlinear compression methods optimized in a data-driven manner. Our methods are based on power nonlinearities and dynamic range compression (DRC). We also propose multi-regime (MR) design on the nonlinearities, at improving robustness. Results on VoxCeleb1 and VoxMovies data demonstrate improvements brought by proposed compression methods over both the commonly-used logarithm and their static counterparts, especially for ones based on power function. While CD generalization improves performance on VoxCeleb1, MR provides more robustness on VoxMovies, with a maximum relative equal error rate reduction of 21.6%.
引用
收藏
页码:7962 / 7966
页数:5
相关论文
共 50 条
  • [21] Model Compression for DNN-based Speaker Verification Using Weight Quantization
    Li, Jingyu
    Liu, Wei
    Zhang, Zhaoyang
    Wang, Jiong
    Lee, Tan
    INTERSPEECH 2023, 2023, : 1988 - 1992
  • [22] Robust Speaker Verification using Self Organizing Map
    Das, Pranab
    Bhatacharjee, Utpal
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT, 2014,
  • [23] Robust Methods for Text-Dependent Speaker Verification
    Bhukya, Ramesh K.
    Prasanna, S. R. Mahadeva
    Sarma, Biswajit Dev
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (11) : 5253 - 5288
  • [24] Modified Segmental Histogram Equalization for robust speaker verification
    Skosan, M
    Mashao, D
    PATTERN RECOGNITION LETTERS, 2006, 27 (05) : 479 - 486
  • [25] DNN FEATURE COMPENSATION FOR NOISE ROBUST SPEAKER VERIFICATION
    Du, Steven
    Xiao, Xiong
    Chng, Eng Siong
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 871 - 875
  • [26] Robust Session Variability Compensation for SVM Speaker Verification
    Seo, Hyunson
    Jung, Chi-Sang
    Kang, Hong-Goo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
  • [27] Improved Multitaper PNCC Feature for Robust Speaker Verification
    Liu, Yi
    He, Liang
    Liu, Jia
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 168 - 172
  • [28] Robust Speaker Verification Under Additive Noise Condition
    Zhang E.-H.
    Wang M.-H.
    Tang Z.-M.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (06): : 1244 - 1250
  • [29] Refining Cosine Distance Features for Robust Speaker Verification
    Balasingam, M. D.
    Kumar, C. Santhosh
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 152 - 155
  • [30] Senone I-Vectors for Robust Speaker Verification
    Tan, Zhili
    Zhu, Yingke
    Mak, Man-Wai
    Mak, Brian Kan-Wing
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,