A new parameter smoothing method in the hybrid TDNN/HMM architecture for speech recognition

被引:3
作者
Jang, CS [1 ]
Un, CK [1 ]
机构
[1] KOREA ADV INST SCI & TECHNOL, DEPT ELECT ENGN, COMMUN RES LAB, TAEJON 305701, SOUTH KOREA
关键词
D O I
10.1016/S0167-6393(96)00052-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new parameter smoothing method in the hybrid time-delay neural network (TDNN)/hidden Markov model (HMM) architecture for speech recognition. In the hybrid architecture, the TDNN and the HMM are combined using the activations from the second hidden layer of TDNN as the outputs of a fuzzy vector quantizer (FVQ). The HMM algorithm is modified to accommodate these FVQ outputs. in our modular construction of TDNN, the input layer is divided into two states to deal with the temporal structure of phonemic features, and the second hidden layer consists of two states in a time sequence. To improve the performance of the hybrid architecture, a new smoothing method has been proposed. The average values of the activation vectors from the second hidden layer of the modular TDNN are used to generate the smoothing matrix from which smoothed output symbol observation probability is obtained. With this proposed approach, our simulation results performed on speaker-independent Korean isolated words show the reduction of the error rate by 44.9% as compared to the floor smoothing method.
引用
收藏
页码:317 / 324
页数:8
相关论文
共 50 条
[21]   Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system [J].
Pujol, P ;
Pol, S ;
Nadeu, C ;
Hagen, A ;
Bourlard, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01) :14-22
[22]   A Hybrid HMM/ANN Approach for Automatic Gujarati Speech Recognition [J].
Valaki, Sanjay ;
Jethva, Harikrishna .
2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
[23]   An HMM/MLP hybrid approach for improving discrimination in speech recognition [J].
Na, K ;
Chae, SI .
IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, :156-159
[24]   Recognition of Chinese speech using hybrid HMM/HNN models [J].
Jia, Ying ;
Du, Limin ;
Hou, Ziqiang .
International Conference on Signal Processing Proceedings, ICSP, 1998, 1 :726-729
[25]   Hybrid SVM/HMM Method for Face Recognition [J].
刘江华 ;
陈佳品 ;
程君实 .
Journal of DongHua University, 2004, (01) :34-38
[26]   Recognition of Chinese speech using hybrid HMM HNN models [J].
Jia, Y ;
Du, LM ;
Hou, ZQ .
ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, :726-729
[27]   A survey of hybrid ANN/HMM models for automatic speech recognition [J].
Trentin, E ;
Gori, M .
NEUROCOMPUTING, 2001, 37 :91-126
[28]   Hybrid HMM/BLSTM-RNN for Robust Speech Recognition [J].
Sun, Yang ;
ten Bosch, Louis ;
Boves, Lou .
TEXT, SPEECH AND DIALOGUE, 2010, 6231 :400-407
[29]   Speech/speaker recognition using a HMM/GMM hybrid model [J].
Rodriguez, E ;
Ruiz, B ;
Garcia-Crespo, A ;
Garcia, F .
AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 :227-234
[30]   Improved hybrid ANN/HMM model applies in speech recognition [J].
Xi, XJ ;
Lin, KH .
ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, :1373-1376