Dynamic programming prediction errors of recurrent neural fuzzy networks for speech recognition

被引：8

作者：

Juang, Chia-Feng ^{[1
]}

Lai, Chun-Lung ^{[1
]}

Tu, Chiu-Chuan ^{[1
]}

机构：

[1] Natl Chung Hsing Univ, Dept Elect Engn, Taichung 402, Taiwan

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2009年 / 36卷 / 03期

关键词：

Phrase recognition; Recurrent fuzzy systems; Fuzzy neural networks; Recurrent neural fuzzy networks; Noisy speech recognition; HIDDEN MARKOV-MODELS; NOISE; ADAPTATION; SYSTEM;

D O I：

10.1016/j.eswa.2008.07.061

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes Mandarin phrase recognition using dynamic programming (DP) prediction errors of singleton-type recurrent neural fuzzy networks (SRNFNs). This method is called DP-SRNFN. The recurrent property of SRNFN makes it suitable for processing temporal speech patterns. A Mandarin phrase comprises monosyllabic words. SRNFN training is based on the word unit. There are N-w SRNFNs for modeling Nw words, and each SRNFN receives the current frame feature and predicts the next one of its modeling word. In recognizing N-P phrases, the prediction error of each trained SRNFN is computed, and DP is used to find the optimal path that maps the input frames to the best matched SRNFNs (words) for each of the N-P phrases. The accumulated error of each phrase model is computed from its optimal path and the one with the minimum error is the recognition result. To verify DP-SRNFN performance, this study conducted experiments on recognizing 30 Mandarin phrases. SRNFN training with noisy features for phrase recognition under different noisy environments was also conducted. DP-SRNFN performance is compared with the hidden Markov models (HMMs). Results show that DP-SRNFN achieves higher recognition rates than HMM in both clean and noisy environments. (C) 2008 Elsevier Ltd. All rights reserved.

引用

页码：6368 / 6374

页数：7

共 23 条

[1]

Ahmad AM, 2004, IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2, P98

[2]

BREMS DJ, 1994, P 2 IEEE WORKSH INT, P117

[3] Adaptation of hidden Markov models using maximum model distance algorithm [J].

He, QH ;

Kwong, S ;

Hong, QY .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2004, 34 (02) :270-276

[4]

Hunt MJ, 2001, ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, P455, DOI 10.1109/ASRU.2001.1034683

[5]

ISO K, 1990, P IEEE INT C SIGNAL, V1, P441

[6]

JIN GZ, 2006, P INT JOINT C NEUR N, P653

[7] A TSK-type recurrent fuzzy network for dynamic systems processing by neural network and genetic algorithms [J].

Juang, CF .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2002, 10 (02) :155-170

[8] Hierarchical singleton-type recurrent neural fuzzy networks for noisy speech recognition [J].

Juang, Chia-Feng ;

Chiou, Chyi-Tian ;

Lai, Chun-Lung .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (03) :833-843

[9] Identification and prediction using recurrent compensatory neuro-fuzzy systems [J].

Lin, CJ ;

Chen, CH .

FUZZY SETS AND SYSTEMS, 2005, 150 (02) :307-330

[10] A robust word boundary detection algorithm for variable noise-level environment in cars [J].

Lin, CT ;

Lin, JY ;

Wu, GD .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2002, 3 (01) :89-101

← 1 2 3 →