On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition

被引:0
作者
Leutnant, Volker [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, Paderborn, Germany
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
speech recognition; hybrid decoder architecture; acoustic modeling; linear dynamic models; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Linear dynamic models (LDMs) have been shown to be a viable alternative to hidden MARKOV models (HMMs) on small-vocabulary recognition tasks, such as phone classification. In this paper we investigate various statistical model combination approaches for a hybrid HMM-LDM recognizer, resulting in a phone classification performance that outperforms the best individual classifier. Further, we report on continuous speech recognition experiments on the AURORA4 corpus, where the model combination is carried out on wordgraph rescoring. While the hybrid system improves the HMM system in the case of monophone HMMs, the performance of the triphone HMM model could not be improved by monophone LDMs, asking for the need to introduce context-dependency also in the LDM model inventory.
引用
收藏
页码:2946 / 2949
页数:4
相关论文
共 50 条
  • [1] Continuous speech recognition using linear dynamic models
    Ma, Tao
    Srinivasan, Sundararajan
    Lazarou, Georgios
    Picone, Joseph
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 11 - 16
  • [2] Noisy Hidden Markov Models for Speech Recognition
    Audhkhasi, Kartik
    Osoba, Osonde
    Kosko, Bart
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [3] Application of continuous state Hidden Markov Models to a classical problem in speech recognition
    Champion, Colin
    Houghton, S. M.
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 347 - 364
  • [4] A speech recognition IC using hidden markov models with continuous observation densities
    Han, Wei
    Hon, Kwok-Wai
    Chan, Cheong-Fat
    Choy, Chiu-Sing
    Pun, Kong-Pang
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2007, 47 (03): : 223 - 232
  • [5] A Speech Recognition IC Using Hidden Markov Models with Continuous Observation Densities
    Wei Han
    Kwok-Wai Hon
    Cheong-Fat Chan
    Chiu-Sing Choy
    Kong-Pang Pun
    The Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, 2007, 47 : 223 - 232
  • [6] A configurable logic based architecture for real-time continuous speech recognition using hidden Markov models
    Stogiannos, P
    Dollas, A
    Digalakis, V
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2000, 24 (2-3): : 223 - 240
  • [7] A Configurable Logic Based Architecture for Real-Time Continuous Speech Recognition Using Hidden Markov Models
    Panagiotis Stogiannos
    Apostolos Dollas
    Vassilis Digalakis
    Journal of VLSI signal processing systems for signal, image and video technology, 2000, 24 : 223 - 240
  • [8] HYBRID APPROACH TO SPEECH RECOGNITION USING HIDDEN MARKOV-MODELS AND MARKOV-CHAINS
    DAI, J
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (05): : 273 - 279
  • [9] BAYESIAN SENSING HIDDEN MARKOV MODELS FOR SPEECH RECOGNITION
    Saon, George
    Chien, Jen-Tzung
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5056 - 5059
  • [10] Hidden-articulator Markov models for speech recognition
    Richardson, M
    Bilmes, J
    Diorio, C
    SPEECH COMMUNICATION, 2003, 41 (2-3) : 511 - 529