Speaker Re-identification with Speaker Dependent Speech Enhancement

被引:3
|
作者
Shi, Yanpei [1 ]
Huang, Qiang [1 ]
Hain, Thomas [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Speech & Hearing Res Grp, Sheffield, S Yorkshire, England
来源
基金
“创新英国”项目;
关键词
Speech Enhancement; Speaker Identification; Speaker Verification; Noise Robustness; NOISY;
D O I
10.21437/Interspeech.2020-1772
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments. Here speech enhancement methods have traditionally allowed improved performance. The recent works have shown that adapting speech enhancement can lead to further gains. This paper introduces a novel approach that cascades speech enhancement and speaker recognition. In the first step, a speaker embedding vector is generated, which is used in the second step to enhance the speech quality and re-identify the speakers. Models are trained in an integrated framework with joint optimisation. The proposed approach is evaluated using the Voxceleb1 dataset, which aims to assess speaker recognition in real world situations. In addition three types of noise at different signal-noise-ratios were added for this work. The obtained results show that the proposed approach using speaker dependent speech enhancement can yield better speaker recognition and speech enhancement performances than two baselines in various noise conditions.
引用
收藏
页码:1530 / 1534
页数:5
相关论文
共 50 条
  • [41] Performance of selective speech features for speaker identification
    Department of Electronics and Communication Engineering, Indian Institute of Technology, Guwahati 781039, India
    J Inst Eng India Part CP, 2008, MAY (38-46):
  • [42] Text Dependent Speaker Identification and Speech Recognition Using Artificial Neural Network
    Swamy, Suma
    Shalini, T.
    Nagabhushan, Sindhu P.
    Nawaz, Sumaiah
    Ramakrishnan, K. V.
    GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 160 - +
  • [43] VOWEL AND SPEAKER IDENTIFICATION IN NATURAL AND SYNTHETIC SPEECH
    LEHISTE, I
    MELTZER, D
    LANGUAGE AND SPEECH, 1973, 16 (OCT-D) : 356 - 364
  • [44] VOWEL AND SPEAKER IDENTIFICATION IN NATURAL AND SYNTHETIC SPEECH
    MELTZER, D
    LEHISTE, I
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 51 (01): : 131 - &
  • [45] ACOUSTIC ANALYSIS FOR SPEAKER IDENTIFICATION OF WHISPERED SPEECH
    Fan, Xing
    Hansen, John H. L.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5046 - 5049
  • [46] Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs
    Boucheron, Laura E.
    De Leon, Phillip L.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 574 - 577
  • [47] Speaker-dependent Mapping of Source and System Features for Enhancement of Throat Microphone Speech
    Joseph, Anand M.
    Reddy, Harish M.
    Yegnanarayana, B.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 985 - 988
  • [48] Speaker indexing and speech enhancement in real meetings/conversations
    Araki, Shoko
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    Sawada, Hiroshi
    Makino, Shoji
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 93 - 96
  • [49] TWO MICROPHONES SPEECH ENHANCEMENT SYSTEMS BASED ON INSTRUMENTAL VARIABLE ALGORITHM FOR SPEAKER IDENTIFICATION
    Gabrea, Marcel
    2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 569 - 572
  • [50] REINFORCING SPEAKER - EFFECTS OF SPEECH, SPEAKER, AND LISTENER
    JOHNSON, RC
    DANKO, GP
    PSYCHOLOGICAL RECORD, 1977, 27 (02): : 489 - 492