Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions

被引:0
|
作者
Sadjadi, Seyed Omid [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Dallas, TX 75230 USA
关键词
feature extraction; gammatone filterbank; Hilbert envelope; speaker identification; speech enhancement; RECOGNITION; SIGNAL;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. In this paper, we report on a study that considers four different single-channel speech enhancement front-ends for robust SID under such conditions. Speech files from the YOHO database are corrupted with four types of noise including babble, car, factory, and white Gaussian at five SNR levels (0-20 dB), and processed using four speech enhancement techniques representing distinct classes of algorithms: spectral subtraction, statistical model-based, subspace, and Wiener filtering. Both processed and unprocessed files are submitted to a SID system trained on clean data. In addition, a new set of acoustic feature parameters based on Hilbert envelope of gammatone filterbank outputs are proposed and evaluated for SID task. Experimental results indicate that: (i) depending on the noise type and SNR level, the enhancement front-ends may help or hurt SID performance, (ii) the proposed feature significantly achieves higher SID accuracy compared to MFCCs under mismatched conditions.
引用
收藏
页码:2138 / 2141
页数:4
相关论文
共 50 条
  • [41] Effectiveness of Single-Channel BLSTM Enhancement for Language Identification
    Frederiksen, Peter Sibbern
    Villalba, Jesus
    Watanabe, Shinji
    Tan, Zheng-Hua
    Dehak, Najim
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1823 - 1827
  • [42] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [43] STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement
    Krawczyk, Martin
    Gerkmann, Timo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1931 - 1940
  • [44] SINGLE-CHANNEL SPEECH ENHANCEMENT IN A TRANSIENT NOISE ENVIRONMENT BY EXPLOITING SPEECH HARMONICITY
    Wu, Kai
    Reju, V. G.
    Khong, Andy W. H.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5088 - 5092
  • [45] Robust Far-Field Speaker Identification under Mismatched Conditions
    Jin, Qin
    Schultz, Tanja
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1893 - 1896
  • [46] Parametric modelling for single-channel blind dereverberation of speech from a moving speaker
    Evers, C.
    Hopgood, J. R.
    IET SIGNAL PROCESSING, 2008, 2 (02) : 59 - 74
  • [47] A SPECTRAL CONVERSION BASED SINGLE-CHANNEL SINGLE-MICROPHONE SPEECH ENHANCEMENT
    Huy-Khoi Do
    Quang Vinh Thai
    FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011), 2011, : 583 - +
  • [48] ON SPEECH QUALITY ES TIMATION OF PHASE-AWARE SINGLE-CHANNEL SPEECH ENHANCEMENT
    Gaich, Andreas
    Mowlaee, Pejman
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 216 - 220
  • [49] Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential
    Mowlaee, Pejman
    Kulmer, Josef
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (08) : 1283 - 1294
  • [50] Glance and gaze: A collaborative learning framework for single-channel speech enhancement
    Li, Andong
    Zheng, Chengshi
    Zhang, Lu
    Li, Xiaodong
    APPLIED ACOUSTICS, 2022, 187