Improvement of the speech recognition in noisy environments using a nonparametric regression

被引:3
作者
Amrouche, A. [1 ]
Taleb-Ahmed, A. [2 ]
Rouvaen, J. M. [3 ]
Yagoub, M. C. E. [4 ]
机构
[1] USTHB, Fac Elect & Comp Sci, Bab Ezzouar, Algeria
[2] Valenciennes Univ, UMR CNRS 8530, LAMIH, Le Mont Houy, France
[3] Valenciennes Univ, UMR CNRS 8520, OAE IEMN, Le Mont Houy, France
[4] Univ Ottawa, SITE, Ottawa, ON, Canada
关键词
speech recognition; general regression neural network; hidden Markov model; nonparametric regression; noisy environment; Arabic digits;
D O I
10.1080/17445760802227054
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, an efficient speech recognition system based on the general regression neural network (GRNN) has been presented. The GRNN has been previously applied for phoneme identification and isolated word recognition in quiet environment. We propose to extend this method to Arabic spoken word recognition in adverse conditions because noise robustness is one of the most challenging problems in automatic speech recognition (ASR). The proposed system has been tested for Arabic digit recognition at different signal-to-noise ratio (SNR) levels in various noisy conditions, including stationary and nonstationary background noises issued from NOISEX-92 database. The proposed scheme is compared with the similar recognisers based on the multilayer perceptron (MLP), the Elman recurrent neural network (RNN) and the discrete hidden Markov model (HMM). The experimental results show that the use of the neural network approach including nonparametric regression improves the global performance of the speech recogniser in noisy environments.
引用
收藏
页码:49 / 67
页数:19
相关论文
共 29 条
[1]  
Alotaibi Y.A., 2005, INFORM COMPUTER SCI, V173, P105
[2]  
Amrouche A., 2003, 46 IEEE MWSCAS
[3]  
Amrouche A., 2004, 9 INT C SPEECH COMP
[4]  
Bhatti M.W., 2004, INT S CIRC SYST ISCA
[5]  
Bolat B., 2004, 12 IEEE SIGN PROC CO
[6]  
Bourlard H., 1994, CONNEXIONNIST SPEECH
[7]  
Bourouba H., 2006, IEEE INT C INF COMM
[8]   ESTIMATION OF A MULTIVARIATE DENSITY [J].
CACOULLOS, T .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1966, 18 (02) :179-+
[9]  
Chung K., 1996, 6 AUSTR C SPEECH SCI
[10]   Noise robust speech recognition using feature compensation based on polynomial fly regression of utterance SNR [J].
Cui, XD ;
Alwan, A .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06) :1161-1172