LEARNABLE NONLINEAR COMPRESSION FOR ROBUST SPEAKER VERIFICATION

被引：2

作者：

Liu, Xuechen ^{[1
,2
]}

Sahidullah, Md ^{[2
]}

Kinnunen, Tomi ^{[1
]}

机构：

[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland

[2] Univ Lorraine, INRIA, CNRS, LORIA, F-54000 Nancy, France

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Speaker Verification; Nonlinear Compression; Multi-Regime Compression; RECOGNITION;

D O I：

10.1109/ICASSP43922.2022.9747185

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this study, we focus on nonlinear compression methods in spectral features for speaker verification based on deep neural network. We consider different kinds of channel-dependent (CD) nonlinear compression methods optimized in a data-driven manner. Our methods are based on power nonlinearities and dynamic range compression (DRC). We also propose multi-regime (MR) design on the nonlinearities, at improving robustness. Results on VoxCeleb1 and VoxMovies data demonstrate improvements brought by proposed compression methods over both the commonly-used logarithm and their static counterparts, especially for ones based on power function. While CD generalization improves performance on VoxCeleb1, MR provides more robustness on VoxMovies, with a maximum relative equal error rate reduction of 21.6%.

引用

页码：7962 / 7966

页数：5

共 50 条

[1] Learnable MFCCs for Speaker Verification
Liu, Xuechen
Sahidullah, Md
Kinnunen, Tomi
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[2] Learnable Sparse Filterbank for Speaker Verification
Peng, Junyi
Gu, Rongzhi
Mosner, Ladislav
Plchot, Oldrich
Burget, Lukas
Cernocky, Jan
INTERSPEECH 2022, 2022, : 5110 - 5114
[3] Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification
Kang, Woo Hyun
Mun, Sung Hwan
Han, Min Hyun
Kim, Nam Soo
IEEE ACCESS, 2020, 8 : 141838 - 141849
[4] DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
Yi, Lu
Mak, Man-Wai
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7662 - 7666
[5] Acoustic Factor Analysis for Robust Speaker Verification
Hasan, Taufiq
Hansen, John H. L.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 842 - 853
[6] Attentive Feature Fusion for Robust Speaker Verification
Liu, Bei
Chen, Zhengyang
Qian, Yanmin
INTERSPEECH 2022, 2022, : 286 - 290
[7] A Robust SVM/GMM Classifier for Speaker Verification
Cirovic, Zoran
Cirovic, Natasa
SPEECH AND COMPUTER, 2014, 8773 : 74 - 80
[8] Gradient Regularization for Noise-Robust Speaker Verification
Li, Jianchen
Han, Jiqing
Song, Hongwei
INTERSPEECH 2021, 2021, : 1074 - 1078
[9] SNR-Invariant PLDA Modeling for Robust Speaker Verification
Li, Na
Mak, Man-Wai
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
[10] A nonlinear autoregressive model for speaker verification
Srinivasan, Sundararajan
Ma, Tao
Lazarou, Georgios
Picone, Joseph
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 17 - 25

← 1 2 3 4 5 →