A PITCH BASED NOISE ESTIMATION TECHNIQUE FOR ROBUST SPEECH RECOGNITION WITH MISSING DATA

被引:0
作者
Morales-Cordovilla, Juan A. [1 ]
Ma, Ning [2 ]
Sanchez, Victoria [1 ]
Carmona, Jose L. [1 ]
Peinado, Antonio M. [1 ]
Barker, Jon [2 ]
机构
[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain
[2] Univ Sheffield, Dept Comp Sci, Sheffield, South Yorkshire, England
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
基金
英国工程与自然科学研究理事会;
关键词
Robust speech recognition; missing data; noise estimation; VAD; harmonic tunnelling;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a noise estimation technique based on knowledge of pitch information for robust speech recognition. In the first stage the noise is estimated by means of extrapolating the noise from frames where speech is believed to be absent. These frames are detected with a proposed pitch based VAD (Voice Activity Detector). In the second stage the noise estimation is revised in voiced frames using harmonic tunnelling thechnique. The tunnelling noise estimation is used at high SNRs as an upper bound of the noise rather than a suitable estimation. A spectrogram MD (Missing Data) recognition system is chosen to evaluate the proposed noise estimation. The proposed system is compared in Aurora-2 with other similar techniques like cepstral SS (Spectral Subtraction).
引用
收藏
页码:4808 / 4811
页数:4
相关论文
共 50 条
  • [41] Maximum Confidence Measure Based Interaural Phase Difference Estimation for Noise Masking in Dual-Microphone Robust Speech Recognition
    Liao, Hsien-Cheng
    Liao, Yuan-Fu
    Lee, Chin-Hui
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 480 - +
  • [42] MODIFIED SPLICE AND ITS EXTENSION TO NON-STEREO DATA FOR NOISE ROBUST SPEECH RECOGNITION
    Kumar, D. S. Pavan
    Prasad, N. Vishnu
    Joshi, Vikas
    Umesh, S.
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 174 - 179
  • [43] Robust Noise Estimation Based on Noise Injection
    Tang, Chongwu
    Yang, Xiaokang
    Zhai, Guangtao
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 69 - 78
  • [44] An effective subband OSF-based VAD with noise reduction for robust speech recognition
    Ramírez, J
    Segura, JC
    Benítez, C
    de la Torre, A
    Rubio, A
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06): : 1119 - 1129
  • [45] Speech-enhanced and Noise-aware Networks for Robust Speech Recognition
    Lee, Hung-Shin
    Chen, Pin-Yuan
    Cheng, Yao-Fei
    Tsao, Yu
    Wang, Hsin-Min
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 145 - 149
  • [46] Robust Noise Estimation Based on Noise Injection
    Chongwu Tang
    Xiaokang Yang
    Guangtao Zhai
    Journal of Signal Processing Systems, 2014, 74 : 69 - 78
  • [47] Transfer learning for acoustic modeling of noise robust speech recognition
    Yi J.
    Tao J.
    Liu B.
    Wen Z.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (01): : 55 - 60
  • [48] Feature domain compensation of nonstationary noise for robust speech recognition
    Kim, NS
    SPEECH COMMUNICATION, 2002, 37 (3-4) : 231 - 248
  • [49] Accurate estimation of missing data under noise distribution
    Koh, Sung-Shik
    Zin, Thi Thi
    Hama, Hiromitsu
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (02) : 528 - 535
  • [50] Probabilistic Class Histogram Equalization Based on Posterior Mean Estimation for Robust Speech Recognition
    Suh, Youngjoo
    Kim, Hoirin
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (12) : 2421 - 2424