A new parameter smoothing method in the hybrid TDNN/HMM architecture for speech recognition

被引:3
|
作者
Jang, CS [1 ]
Un, CK [1 ]
机构
[1] KOREA ADV INST SCI & TECHNOL, DEPT ELECT ENGN, COMMUN RES LAB, TAEJON 305701, SOUTH KOREA
关键词
D O I
10.1016/S0167-6393(96)00052-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new parameter smoothing method in the hybrid time-delay neural network (TDNN)/hidden Markov model (HMM) architecture for speech recognition. In the hybrid architecture, the TDNN and the HMM are combined using the activations from the second hidden layer of TDNN as the outputs of a fuzzy vector quantizer (FVQ). The HMM algorithm is modified to accommodate these FVQ outputs. in our modular construction of TDNN, the input layer is divided into two states to deal with the temporal structure of phonemic features, and the second hidden layer consists of two states in a time sequence. To improve the performance of the hybrid architecture, a new smoothing method has been proposed. The average values of the activation vectors from the second hidden layer of the modular TDNN are used to generate the smoothing matrix from which smoothed output symbol observation probability is obtained. With this proposed approach, our simulation results performed on speaker-independent Korean isolated words show the reduction of the error rate by 44.9% as compared to the floor smoothing method.
引用
收藏
页码:317 / 324
页数:8
相关论文
共 50 条
  • [1] Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech Recognition
    Kipyatkova, Irina
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 362 - 369
  • [2] Combining TDNN and HMM in a Hybrid System for Improved Continuous-Speech Recognition
    Dugast, Christian
    Devillers, Laurence
    Aubert, Xavier
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 217 - 223
  • [3] Chinese Speech Recognition Based on a Hybrid SVM and HMM Architecture
    Luo, Xingxian
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT III, 2011, 6677 : 629 - 635
  • [4] A speech recognition system based on a hybrid HMM/SVM architecture
    Qu Zhi-yi
    Liu Yu
    Zhang Li-hong
    Shao Ming-xin
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 100 - +
  • [5] Distributed TDNN-Fuzzy Vector Quantization For HMM Speech Recognition
    Debyeche, Mohamed
    Amrouche, Aderrahmane.
    Haton, Jean Paul
    2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 72 - +
  • [6] FUZZY SMOOTHING OF HMM PARAMETERS IN SPEECH RECOGNITION
    KOO, JM
    UN, CK
    ELECTRONICS LETTERS, 1990, 26 (11) : 743 - 744
  • [7] DELETED SMOOTHING OF HMM PARAMETERS IN SPEECH RECOGNITION
    KIM, NS
    UN, CK
    ELECTRONICS LETTERS, 1993, 29 (09) : 735 - 736
  • [8] New feedback method of hybrid HMM/ANN methods for continuous speech recognition
    Lee, TZ
    Chen, DW
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 509 - 512
  • [9] A new hybrid HMM/ANN model for speech recognition
    Xi, XJ
    Lin, KH
    Zhou, CL
    Cai, J
    Artificial Intelligence Applications and Innovations II, 2005, 187 : 223 - 230
  • [10] USE OF KOHONEN SELF-ORGANIZING FEATURE MAPS FOR HMM PARAMETER SMOOTHING IN SPEECH RECOGNITION
    ZHAO, Z
    ROWDEN, CG
    IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1992, 139 (06) : 385 - 390