Automatic speech recognition using a predictive echo state network classifier

被引：171

作者：

Skowronski, Mark D. ^{[1
]}

Harris, John G. ^{[1
]}

机构：

[1] Univ Florida, Computat NeuroEngn Lab, Gainesville, FL 32611 USA

来源：

NEURAL NETWORKS | 2007年 / 20卷 / 03期

关键词：

echo state network; automatic speech recognition; mixture of experts; noise robustness; SYSTEMS; MODEL;

D O I：

10.1016/j.neunet.2007.04.006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We have combined an echo state network (ESN) with a competitive state machine framework to create a classification engine called the predictive ESN classifier. We derive the expressions for training the predictive ESN classifier and show that the model was significantly more noise robust compared to a hidden Markov model in noisy speech classification experiments by 8 +/- 1 dB signal-to-noise ratio. The simple training algorithm and noise robustness of the predictive ESN classifier make it an attractive classification engine for automatic speech recognition. (c) 2007 Elsevier Ltd. All rights reserved.

引用

页码：414 / 423

页数：10

共 45 条

[1]

[Anonymous], 2001, Adaptive Filter Theory

[2]

[Anonymous], P ISCA ITRW ASR

[3] EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].

ATAL, BS .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312

[4] New results on recurrent network training: Unifying the algorithms and accelerating convergence [J].

Atiya, AF ;

Parlos, AG .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :697-709

[5] LEARNING THE DYNAMIC NATURE OF SPEECH WITH BACKPROPAGATION FOR SEQUENCES [J].

BENGIO, Y ;

DEMORI, R ;

GORI, M .

PATTERN RECOGNITION LETTERS, 1992, 13 (05) :375-385

[6]

Bishop CM, 1995, Neural Networks for Pattern Recognition

[7]

BOURLAND HA, 1993, CONNECTIONIST SPEECH

[8] GEOMETRICAL AND STATISTICAL PROPERTIES OF SYSTEMS OF LINEAR INEQUALITIES WITH APPLICATIONS IN PATTERN RECOGNITION [J].

COVER, TM .

IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1965, EC14 (03) :326-&

[9] Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion [J].

Deng, L ;

Droppo, J ;

Acero, A .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03) :412-421

[10]

DODDINGTON GR, 1981, IEEE SPECTRUM SEP, P26

← 1 2 3 4 5 →