A new parameter smoothing method in the hybrid TDNN/HMM architecture for speech recognition

被引:3
作者
Jang, CS [1 ]
Un, CK [1 ]
机构
[1] KOREA ADV INST SCI & TECHNOL, DEPT ELECT ENGN, COMMUN RES LAB, TAEJON 305701, SOUTH KOREA
关键词
D O I
10.1016/S0167-6393(96)00052-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new parameter smoothing method in the hybrid time-delay neural network (TDNN)/hidden Markov model (HMM) architecture for speech recognition. In the hybrid architecture, the TDNN and the HMM are combined using the activations from the second hidden layer of TDNN as the outputs of a fuzzy vector quantizer (FVQ). The HMM algorithm is modified to accommodate these FVQ outputs. in our modular construction of TDNN, the input layer is divided into two states to deal with the temporal structure of phonemic features, and the second hidden layer consists of two states in a time sequence. To improve the performance of the hybrid architecture, a new smoothing method has been proposed. The average values of the activation vectors from the second hidden layer of the modular TDNN are used to generate the smoothing matrix from which smoothed output symbol observation probability is obtained. With this proposed approach, our simulation results performed on speaker-independent Korean isolated words show the reduction of the error rate by 44.9% as compared to the floor smoothing method.
引用
收藏
页码:317 / 324
页数:8
相关论文
共 50 条
  • [31] Improved hybrid ANN/HMM model applies in speech recognition
    Xi, XJ
    Lin, KH
    [J]. ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1373 - 1376
  • [32] A hybrid speech recognition model based on HMM and fuzzy PPM
    Bao, P
    Sim, A
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 4148 - 4153
  • [33] HMM/ANN hybrid model for continuous Malayalam speech recognition
    Mohamed, Anuj
    Nair, K. N. Ramachandran
    [J]. INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 616 - 622
  • [34] A hybrid HMM/BN acoustic model for automatic speech recognition
    Markov, K
    Nakamura, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03): : 438 - 445
  • [35] Nonspecific Speech Recognition based on HMM/LVQ Hybrid Network
    Liang Shuling
    Wang Chaoli
    Du Jiaming
    [J]. ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 645 - 648
  • [36] Gas Mixture Recognition Method with New Hybrid Architecture
    Zhang, Shuangyan
    Yu, Jun
    Wei, Guangfen
    Tang, Zhen'an
    Chen, Yi
    Cui, Yuanhui
    [J]. ADVANCES IN SCIENCE AND ENGINEERING, PTS 1 AND 2, 2011, 40-41 : 604 - +
  • [37] A new approach to hybrid HMM/ANN speech recognition using mutual information neural networks
    Rigoll, G
    Neukirchen, C
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 772 - 778
  • [38] A new robust hybrid speech recognition algorithm based on FVQ/HMM and neural nets classification
    Asghar, S
    Cong, L
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1810 - 1816
  • [39] A comparison between HMM and hybrid ANN-HMM based systems for continuous speech recognition
    Ynoguti, CA
    Morais, ED
    Violaro, F
    [J]. ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 135 - 140
  • [40] Hybrid Architecture for Robust Speech Recognition System
    Pasricha, Vishal
    Aggarwal, Rajesh
    [J]. 2016 INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2016,