Noise robust estimate of speech dynamics for speaker recognition

被引:0
|
作者
Openshaw, JP
Mason, JS
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for o-priori knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-band filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively.
引用
收藏
页码:925 / 928
页数:4
相关论文
共 50 条
  • [11] Noise robust speaker identification for spontaneous Arabic speech
    Graciarena, Martin
    Kajarekar, Sachin
    Stolcke, Andreas
    Shriberg, Elizabeth
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 245 - +
  • [12] Channel and speaker adaptation techniques for robust speech recognition
    Chen, Jingdong
    Yao, Lei
    Huang, Taiyi
    Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
  • [13] Robust Digital Speech Watermarking For Online Speaker Recognition
    Nematollahi, Mohammad Ali
    Gamboa-Rosales, Hamurabi
    Akhaee, Mohammad Ali
    Al-Haddad, S. A. R.
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [14] Channel Robust MFCCs for Continuous Speech Speaker Recognition
    Chougule, Sharada Vikram
    Chavan, Mahesh S.
    ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 557 - 568
  • [15] Robust speech recognition with speaker localization by a microphone array
    Yamada, T
    Nakamura, S
    Shikano, K
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1317 - 1320
  • [16] An Integrated Approach to Robust Speaker Identification and Speech Recognition
    Kwan, C.
    Yin, J.
    Ayhan, B.
    Chu, S.
    Liu, X.
    Puckett, K.
    Zhao, Y.
    Ho, K. C.
    Kruger, M.
    Sityar, I.
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1635 - +
  • [17] Adaptive wavelet shrinkage for noise robust speaker recognition
    Govindan, Sumithra Manimegalai
    Duraisamy, Prakash
    Yuan, Xiaohui
    DIGITAL SIGNAL PROCESSING, 2014, 33 : 180 - 190
  • [18] Noise Robust Speaker Recognition with Convolutive Sparse Coding
    Hurmalainen, Antti
    Saeidi, Rahim
    Virtanen, Tuomas
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 244 - 248
  • [19] SHORT-TIMED SPEECH DYNAMICS FOR SPEAKER RECOGNITION
    LI, H
    HATON, JP
    SU, J
    ELECTRONICS LETTERS, 1995, 31 (17) : 1416 - 1418
  • [20] Unsupervised speaker adaptation for robust speech recognition in real environments
    Yamade, S
    Baba, A
    Yoshikawa, S
    Lee, A
    Saruwatari, H
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (08): : 30 - 41