IMPROVING SPEAKER VERIFICATION IN REVERBERANT ENVIRONMENTS

被引:1
|
作者
Chen, Xiao [1 ]
Zahorian, Stephen A. [1 ]
机构
[1] SUNY Binghamton, Dept Elect & Comp Engn, Binghamton, NY 13902 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
CNN; dereverberation; speaker verification; pitch; frontend;
D O I
10.1109/ICASSP39728.2021.9413731
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker verification technology has been successfully adopted and integrated into many applications. However, most of these applications require a microphone located near the talker. For the case of distant microphones, speech signals are corrupted by reverberations caused by the large speaker to microphone distance. In this paper, we first introduce a new feature set that gives more details in the frequency dimension in the 2-D time-frequency space used to represent speech. These features are computed using two sets of basis vectors, both of which are applied directly to the amplitude compressed FFT spectrum. One set of basis vectors accounts for the spectral envelope while the second set accounts for pitch. Those features are used to train a Convolutional Neural Network (CNN), with the goal of reducing the negative effects of reverberation. The proposed frontend is shown to be robust for speaker verification in reverberant environments.
引用
收藏
页码:5854 / 5858
页数:5
相关论文
共 50 条
  • [1] Geometric contamination for GMM/UBM speaker verification in reverberant environments
    Brutti, Alessio
    Omologo, Maurizio
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3632 - 3636
  • [2] Improving Speaker Verification for Reverberant Conditions with Deep Neural Network Dereverberation Processing
    Guzewich, Peter
    Zahorian, Stephen
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 171 - 175
  • [3] Ensemble Based Speaker Verification Using Adapted Score Fusion in Noisy Reverberant Environments
    Nakanishi, Ryosuke
    Shiota, Sayaka
    Kiya, Hitoshi
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [4] Speaker Localization by Humanoid Robots in Reverberant Environments
    Tourbabin, Vladimir
    Rafaely, Boaz
    2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [5] Two-space variability compensation technique for speaker verification in short length and reverberant environments
    Reyes-Díaz F.J.
    Hernández-Sierra G.
    Calvo de Lara J.R.
    International Journal of Speech Technology, 2017, 20 (3) : 475 - 485
  • [6] Improving ASR in Reverberant Environments
    Liao, Yen-Lun
    Lin, Chi-Han
    Lyu, Ren-Yuan
    Jang, Jyh-Shing Roger
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 165 - 169
  • [7] Nonlinear filtering for speaker tracking in noisy and reverberant environments
    Vermaak, J
    Blake, A
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 3021 - 3024
  • [8] Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions
    Vestman, Ville
    Gowda, Dhananjaya
    Sahidullah, Md
    Alku, Paavo
    Kinnunen, Tomi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1512 - 1516
  • [9] Model selection toward robustness speaker verification in reverberant conditions
    Al-Karawi, Khamis A.
    Ahmed, Shaymaa T.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (30) : 36549 - 36566
  • [10] Model selection toward robustness speaker verification in reverberant conditions
    Khamis A. Al-Karawi
    Shaymaa T. Ahmed
    Multimedia Tools and Applications, 2021, 80 : 36549 - 36566