On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition

被引:0
作者
Leutnant, Volker [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, Paderborn, Germany
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
speech recognition; hybrid decoder architecture; acoustic modeling; linear dynamic models; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Linear dynamic models (LDMs) have been shown to be a viable alternative to hidden MARKOV models (HMMs) on small-vocabulary recognition tasks, such as phone classification. In this paper we investigate various statistical model combination approaches for a hybrid HMM-LDM recognizer, resulting in a phone classification performance that outperforms the best individual classifier. Further, we report on continuous speech recognition experiments on the AURORA4 corpus, where the model combination is carried out on wordgraph rescoring. While the hybrid system improves the HMM system in the case of monophone HMMs, the performance of the triphone HMM model could not be improved by monophone LDMs, asking for the need to introduce context-dependency also in the LDM model inventory.
引用
收藏
页码:2946 / 2949
页数:4
相关论文
共 50 条
  • [41] Boosted Mixture Learning of Gaussian Mixture Hidden Markov Models Based on Maximum Likelihood for Speech Recognition
    Du, Jun
    Hu, Yu
    Jiang, Hui
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2091 - 2100
  • [42] Generalized linear latent models for multivariate longitudinal measurements mixed with hidden Markov models
    Xia, Ye-Mao
    Tang, Nian-Sheng
    Gou, Jian-Wei
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 152 : 259 - 275
  • [43] HYBRID DNN-LATENT STRUCTURED SVM ACOUSTIC MODELS FOR CONTINUOUS SPEECH RECOGNITION
    Ravuri, Suman
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 37 - 44
  • [44] EXPERIMENTS WITH A NONLINEAR SPECTRAL SUBTRACTOR (NSS), HIDDEN MARKOV-MODELS AND THE PROJECTION, FOR ROBUST SPEECH RECOGNITION IN CARS
    LOCKWOOD, P
    BOUDY, J
    SPEECH COMMUNICATION, 1992, 11 (2-3) : 215 - 228
  • [45] Diversified learning for continuous hidden Markov models with application to fault diagnosis
    Li, Zefang
    Fang, Huajing
    Huang, Ming
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) : 9165 - 9173
  • [46] THE HIDDEN MARKOV MODEL OF CO-ARTICULATION AND ITS APPLICATION TO THE CONTINUOUS SPEECH RECOGNITION
    Lee Tranzai Zheng Fang Wu Wenhu Chen Daowen(Speech Lab.
    Journal of Electronics(China), 2000, (03) : 242 - 247
  • [47] HAC-models: a Novel Approach to Continuous Speech Recognition
    Van Hamme, Hugo
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2554 - 2557
  • [48] TASK ADAPTATION IN SYLLABLE TRIGRAM MODELS FOR CONTINUOUS SPEECH RECOGNITION
    MATSUNAGA, S
    YAMADA, T
    SHIKANO, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (01) : 38 - 43
  • [49] USE OF DIFFERENT NUMBERS OF MIXTURES IN CONTINUOUS DENSITY HIDDEN MARKOV-MODELS
    CHUNG, YJ
    UN, CK
    ELECTRONICS LETTERS, 1993, 29 (09) : 824 - 825
  • [50] Unsupervised Parameter Selection for Gesture Recognition with Vector Quantization and Hidden Markov Models
    Glomb, Przemyslaw
    Romaszewski, Michal
    Sochan, Arkadiusz
    Opozda, Sebastian
    HUMAN-COMPUTER INTERACTION - INTERACT 2011, PT IV, 2011, 6949 : 170 - 177