Robust Speech Recognition Using a Harmonic Model

被引：0

作者：

许超

曹志刚

机构：

[1] China

[2] Tsinghua University

[3] Beijing 100084

[4] Department of Electronic Engineering

来源：

TsinghuaScienceandTechnology | 2004年 / 02期

基金：

中国国家自然科学基金;

关键词：

robust speech recognition; speech enhancement; pitch extraction; harmonic model;

D O I：

暂无

中图分类号：

TN912.3 [语音信号处理];

学科分类号：

0711 ;

摘要：

Automatic speech recognition under conditions of a noisy environment remains a challenging problem. Traditionally, methods focused on noise structure, such as spectral subtraction, have been em-ployed to address this problem, and thus the performance of such methods depends on the accuracy in noise estimation. In this paper, an alternative method, using a harmonic-based spectral reconstruction algo-rithm, is proposed for the enhancement of robust automatic speech recognition. Neither noise estimation nor noise-model training are required in the proposed approach. A spectral subtraction integrated autocorrela-tion function is proposed to determine the pitch for the harmonic model. Recognition results show that the harmonic-based spectral reconstruction approach outperforms spectral subtraction in the middle- and low-signal noise ratio (SNR) ranges. The advantage of the proposed method is more manifest for non-stationary noise, as the algorithm does not require an assumption of stationary noise.

引用

页码：202 / 206

页数：5

共 50 条

[21] Combined speech enhancement and auditory modelling for robust distributed speech recognition
Flynn, Ronan
Jones, Edward
SPEECH COMMUNICATION, 2008, 50 (10) : 797 - 809
[22] Speech parameters for the robust emotional speech recognition
Kim W.-G.
Journal of Institute of Control, Robotics and Systems, 2010, 16 (12) : 1137 - 1142
[23] Combining speech enhancement and auditory feature extraction for robust speech recognition
Kleinschmidt, M
Tchorz, J
Kollmeier, B
SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
[24] ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
Mueller, Florian
Mertins, Alfred
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4905 - 4908
[25] Robust recognition of fast speech
Lee, Ki-Seung
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (08) : 2456 - 2459
[26] ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
Mueller, Florian
Mertins, Alfred
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4905 - 4908
[27] Robust Speech Recognition using a Small Power Boosting Algorithm
Kim, Chanwoo
Kumar, Kshitiz
Stern, Richard M.
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 243 - 248
[28] Speech recognition using FHMMS robust against nonstationary noise
Betkowska, Agnieszka
Shinoda, Koichi
Furui, Sadaoki
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1029 - +
[29] Robust speech recognition using discrete-mixture HMMs
Kosaka, T
Katoh, M
Kohda, M
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (12): : 2811 - 2818
[30] Domain Adaptation Using Class Similarity for Robust Speech Recognition
Zhu, Han
Zhao, Jiangjiang
Ren, Yuling
Wang, Li
Zhang, Pengyuan
INTERSPEECH 2020, 2020, : 4367 - 4371

← 1 2 3 4 5 →