Robust Speech Recognition Using a Harmonic Model

被引:0
作者
许超
曹志刚
机构
[1] China
[2] Tsinghua University
[3] Beijing 100084
[4] Department of Electronic Engineering
基金
中国国家自然科学基金;
关键词
robust speech recognition; speech enhancement; pitch extraction; harmonic model;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
Automatic speech recognition under conditions of a noisy environment remains a challenging problem. Traditionally, methods focused on noise structure, such as spectral subtraction, have been em-ployed to address this problem, and thus the performance of such methods depends on the accuracy in noise estimation. In this paper, an alternative method, using a harmonic-based spectral reconstruction algo-rithm, is proposed for the enhancement of robust automatic speech recognition. Neither noise estimation nor noise-model training are required in the proposed approach. A spectral subtraction integrated autocorrela-tion function is proposed to determine the pitch for the harmonic model. Recognition results show that the harmonic-based spectral reconstruction approach outperforms spectral subtraction in the middle- and low-signal noise ratio (SNR) ranges. The advantage of the proposed method is more manifest for non-stationary noise, as the algorithm does not require an assumption of stationary noise.
引用
收藏
页码:202 / 206
页数:5
相关论文
共 50 条
  • [21] Combined speech enhancement and auditory modelling for robust distributed speech recognition
    Flynn, Ronan
    Jones, Edward
    SPEECH COMMUNICATION, 2008, 50 (10) : 797 - 809
  • [22] Speech parameters for the robust emotional speech recognition
    Kim W.-G.
    Journal of Institute of Control, Robotics and Systems, 2010, 16 (12) : 1137 - 1142
  • [23] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
  • [24] ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
    Mueller, Florian
    Mertins, Alfred
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4905 - 4908
  • [25] Robust recognition of fast speech
    Lee, Ki-Seung
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (08) : 2456 - 2459
  • [26] ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
    Mueller, Florian
    Mertins, Alfred
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4905 - 4908
  • [27] Robust Speech Recognition using a Small Power Boosting Algorithm
    Kim, Chanwoo
    Kumar, Kshitiz
    Stern, Richard M.
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 243 - 248
  • [28] Speech recognition using FHMMS robust against nonstationary noise
    Betkowska, Agnieszka
    Shinoda, Koichi
    Furui, Sadaoki
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1029 - +
  • [29] Robust speech recognition using discrete-mixture HMMs
    Kosaka, T
    Katoh, M
    Kohda, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (12): : 2811 - 2818
  • [30] Domain Adaptation Using Class Similarity for Robust Speech Recognition
    Zhu, Han
    Zhao, Jiangjiang
    Ren, Yuling
    Wang, Li
    Zhang, Pengyuan
    INTERSPEECH 2020, 2020, : 4367 - 4371