A PITCH BASED NOISE ESTIMATION TECHNIQUE FOR ROBUST SPEECH RECOGNITION WITH MISSING DATA

被引:0
作者
Morales-Cordovilla, Juan A. [1 ]
Ma, Ning [2 ]
Sanchez, Victoria [1 ]
Carmona, Jose L. [1 ]
Peinado, Antonio M. [1 ]
Barker, Jon [2 ]
机构
[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain
[2] Univ Sheffield, Dept Comp Sci, Sheffield, South Yorkshire, England
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
基金
英国工程与自然科学研究理事会;
关键词
Robust speech recognition; missing data; noise estimation; VAD; harmonic tunnelling;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a noise estimation technique based on knowledge of pitch information for robust speech recognition. In the first stage the noise is estimated by means of extrapolating the noise from frames where speech is believed to be absent. These frames are detected with a proposed pitch based VAD (Voice Activity Detector). In the second stage the noise estimation is revised in voiced frames using harmonic tunnelling thechnique. The tunnelling noise estimation is used at high SNRs as an upper bound of the noise rather than a suitable estimation. A spectrogram MD (Missing Data) recognition system is chosen to evaluate the proposed noise estimation. The proposed system is compared in Aurora-2 with other similar techniques like cepstral SS (Spectral Subtraction).
引用
收藏
页码:4808 / 4811
页数:4
相关论文
共 50 条
  • [21] Feature Extraction Based on Pitch-Synchronous Averaging for Robust Speech Recognition
    Morales-Cordovilla, Juan A.
    Peinado, Antonio M.
    Sanchez, Victoria
    Gonzalez, Jose A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 640 - 651
  • [22] Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise
    Kim, Wooil
    Stern, Richard M.
    SPEECH COMMUNICATION, 2011, 53 (01) : 1 - 11
  • [23] Robust nonparametric estimation with missing data
    Boente, Graciela
    Gonzalez-Manteiga, Wenceslao
    Perez-Gonzalez, Ana
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (02) : 571 - 592
  • [24] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
    CUNG, HM
    NORMANDIN, Y
    SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
  • [25] A Robust Pitch Extractor Based on DTW Lines and CASA with Application in Noisy Speech Recognition
    Morales-Cordovilla, Juan A.
    Cabanas-Molero, Pablo
    Peinado, Antonio M.
    Sanchez, Victoria
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 197 - 206
  • [26] Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment
    Keronen, Sami
    Kallasjoki, Heikki
    Remes, Ulpu
    Brown, Guy J.
    Gemmeke, Jort F.
    Palomaki, Kalle J.
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03) : 798 - 819
  • [27] A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation
    Palomäki, KJ
    Brown, GJ
    Wang, DL
    SPEECH COMMUNICATION, 2004, 43 (04) : 361 - 378
  • [28] Speech extraction based on AuxIVA with weighted source variance and noise dependence for robust speech recognition
    Shin, Ui-Hyeop
    Park, Hyung-Min
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2022, 41 (03): : 326 - 334
  • [29] IPW-based robust estimation of the SAR model with missing data
    Luo, Guowang
    Wu, Mixia
    Xu, Liwen
    STATISTICS & PROBABILITY LETTERS, 2021, 172
  • [30] Issues with Uncertainty Decoding for Noise Robust Speech Recognition
    Liao, H.
    Gales, M. J. F.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1121 - 1124