Speaker Re-identification with Speaker Dependent Speech Enhancement

被引:3
|
作者
Shi, Yanpei [1 ]
Huang, Qiang [1 ]
Hain, Thomas [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Speech & Hearing Res Grp, Sheffield, S Yorkshire, England
来源
基金
“创新英国”项目;
关键词
Speech Enhancement; Speaker Identification; Speaker Verification; Noise Robustness; NOISY;
D O I
10.21437/Interspeech.2020-1772
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments. Here speech enhancement methods have traditionally allowed improved performance. The recent works have shown that adapting speech enhancement can lead to further gains. This paper introduces a novel approach that cascades speech enhancement and speaker recognition. In the first step, a speaker embedding vector is generated, which is used in the second step to enhance the speech quality and re-identify the speakers. Models are trained in an integrated framework with joint optimisation. The proposed approach is evaluated using the Voxceleb1 dataset, which aims to assess speaker recognition in real world situations. In addition three types of noise at different signal-noise-ratios were added for this work. The obtained results show that the proposed approach using speaker dependent speech enhancement can yield better speaker recognition and speech enhancement performances than two baselines in various noise conditions.
引用
收藏
页码:1530 / 1534
页数:5
相关论文
共 50 条
  • [21] Text─Dependent Speaker Identification
    CHEN Ke XIE Dahong CHI Huisheng National Lab of Machine Perception and Center for Information Science Peking University Beijing
    北京大学学报(自然科学版), 1996, (03) : 128 - 137
  • [22] Speaker identification utilizing noncontemporary speech
    Hollien, H
    Schwartz, R
    JOURNAL OF FORENSIC SCIENCES, 2001, 46 (01) : 63 - 67
  • [23] SPEAKER IDENTIFICATION WITH DISTANT MICROPHONE SPEECH
    Jin, Qin
    Li, Runxin
    Yang, Qian
    Laskowski, Kornel
    Schultz, Tanja
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4518 - 4521
  • [24] LOG SPECTRA ENHANCEMENT USING SPEAKER DEPENDENT PRIORS FOR SPEAKER VERIFICATION
    Maina, Ciira Wa
    Walsh, John MacLaren
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4540 - 4543
  • [25] Speaker Identification using Whispered Speech
    Jawarkar, Naresh P.
    Holambe, Raghunath S.
    Basu, Tapan Kumar
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781
  • [26] Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
    Kanda, Naoyuki
    Gaur, Yashesh
    Wang, Xiaofei
    Meng, Zhong
    Chen, Zhuo
    Zhou, Tianyan
    Yoshioka, Takuya
    INTERSPEECH 2020, 2020, : 36 - 40
  • [27] Speech Enhancement for Multimodal Speaker Diarization System
    Ahmad, Rehan
    Zubair, Syed
    Alquhayz, Hani
    IEEE ACCESS, 2020, 8 : 126671 - 126680
  • [28] Speech Enhancement Regularized by a Speaker Verification Model
    Lay, Bunlong
    Gerkmann, Timo
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [29] VoiceID Loss: Speech Enhancement for Speaker Verification
    Shon, Suwon
    Tang, Hao
    Glass, James
    INTERSPEECH 2019, 2019, : 2888 - 2892
  • [30] Gradual Enhancement of GMM for Speaker Identification
    Kacur, Juraj
    PROCEEDINGS OF 2017 INTERNATIONAL SYMPOSIUM ELMAR, 2017, : 145 - 148