IMPROVING SPEAKER VERIFICATION IN REVERBERANT ENVIRONMENTS

被引：1

作者：

Chen, Xiao ^{[1
]}

Zahorian, Stephen A. ^{[1
]}

机构：

[1] SUNY Binghamton, Dept Elect & Comp Engn, Binghamton, NY 13902 USA

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

CNN; dereverberation; speaker verification; pitch; frontend;

D O I：

10.1109/ICASSP39728.2021.9413731

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speaker verification technology has been successfully adopted and integrated into many applications. However, most of these applications require a microphone located near the talker. For the case of distant microphones, speech signals are corrupted by reverberations caused by the large speaker to microphone distance. In this paper, we first introduce a new feature set that gives more details in the frequency dimension in the 2-D time-frequency space used to represent speech. These features are computed using two sets of basis vectors, both of which are applied directly to the amplitude compressed FFT spectrum. One set of basis vectors accounts for the spectral envelope while the second set accounts for pitch. Those features are used to train a Convolutional Neural Network (CNN), with the goal of reducing the negative effects of reverberation. The proposed frontend is shown to be robust for speaker verification in reverberant environments.

引用

页码：5854 / 5858

页数：5

共 50 条

[1] Geometric contamination for GMM/UBM speaker verification in reverberant environments
Brutti, Alessio
Omologo, Maurizio
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3632 - 3636
[2] Improving Speaker Verification for Reverberant Conditions with Deep Neural Network Dereverberation Processing
Guzewich, Peter
Zahorian, Stephen
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 171 - 175
[3] Ensemble Based Speaker Verification Using Adapted Score Fusion in Noisy Reverberant Environments
Nakanishi, Ryosuke
Shiota, Sayaka
Kiya, Hitoshi
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[4] Speaker Localization by Humanoid Robots in Reverberant Environments
Tourbabin, Vladimir
Rafaely, Boaz
2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
[5] Two-space variability compensation technique for speaker verification in short length and reverberant environments
Reyes-Díaz F.J.
Hernández-Sierra G.
Calvo de Lara J.R.
International Journal of Speech Technology, 2017, 20 (3) : 475 - 485
[6] Improving ASR in Reverberant Environments
Liao, Yen-Lun
Lin, Chi-Han
Lyu, Ren-Yuan
Jang, Jyh-Shing Roger
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 165 - 169
[7] Nonlinear filtering for speaker tracking in noisy and reverberant environments
Vermaak, J
Blake, A
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 3021 - 3024
[8] Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions
Vestman, Ville
Gowda, Dhananjaya
Sahidullah, Md
Alku, Paavo
Kinnunen, Tomi
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1512 - 1516
[9] Model selection toward robustness speaker verification in reverberant conditions
Al-Karawi, Khamis A.
Ahmed, Shaymaa T.
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (30) : 36549 - 36566
[10] Model selection toward robustness speaker verification in reverberant conditions
Khamis A. Al-Karawi
Shaymaa T. Ahmed
Multimedia Tools and Applications, 2021, 80 : 36549 - 36566

← 1 2 3 4 5 →