Online hierarchical transformation of hidden Markov models for speech recognition

被引:31
|
作者
Chien, JT [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1999年 / 7卷 / 06期
关键词
approximate Bayesian estimate; EM algorithm; hidden Markov models; online hierarchical transformation; speaker adaptation; speech recognition;
D O I
10.1109/89.799691
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a novel framework of online hierarchical transformation of hidden Markov model (HMM) parameters for adaptive speech recognition. Our goal is to incrementally transform (or adapt) all the HMM parameters to a new acoustical environment even though most of HMM units are unseen in observed adaptation data. We establish a hierarchical tree of HMM units and apply the tree to dynamically search the transformation parameters for individual HMM mixture components. In this paper, the transformation framework is formulated according to the approximate Bayesian estimate, which the prior statistics and the transformation parameters can be jointly and incrementally refreshed after each consecutive adaptation data is presented. Using this formulation, only the refreshed prior statistics and the current block of data are needed for online transformation. In a series of speaker adaptation experiments on the recognition of 408 Mandarin syllables, we examine the effects on constructing various types of hierarchical trees. The efficiency and effectiveness of proposed method on incremental adaptation of overall HMM units are also confirmed. Besides, we demonstrate the superiority of proposed online transformation to Hue's on-line adaptation [16] for a wide range of adaptation data.
引用
收藏
页码:656 / 667
页数:12
相关论文
共 50 条
  • [1] Online human activity recognition employing hierarchical hidden Markov models
    Parviz Asghari
    Elnaz Soleimani
    Ehsan Nazerfard
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 1141 - 1152
  • [2] Online human activity recognition employing hierarchical hidden Markov models
    Asghari, Parviz
    Soleimani, Elnaz
    Nazerfard, Ehsan
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (03) : 1141 - 1152
  • [3] HIDDEN MARKOV MODELS IN SPEECH RECOGNITION
    Krajcovic, J.
    Hrncar, M.
    Muzikarova, E.
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2008, 7 (1-2) : 250 - 252
  • [4] Online unsupervised learning of hidden Markov models for adaptive speech recognition
    Chien, JT
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2001, 148 (05): : 315 - 324
  • [5] The Application of Hidden Markov Models in Speech Recognition
    Gales, Mark
    Young, Steve
    FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304
  • [6] Noisy Hidden Markov Models for Speech Recognition
    Audhkhasi, Kartik
    Osoba, Osonde
    Kosko, Bart
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [7] Hidden Markov models for speech and signal recognition
    Rose, RC
    Juang, BH
    CONTINUOUS WAVE-FORM ANALYSIS, 1996, (45): : 137 - 152
  • [8] HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
    JUANG, BH
    RABINER, LR
    TECHNOMETRICS, 1991, 33 (03) : 251 - 272
  • [9] Graphical Models for Discrete Hidden Markov Models in Speech Recognition
    Miguel, Antonio
    Ortega, Alfonso
    Buera, Luis
    Lleida, Eduardo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1387 - 1390
  • [10] Hidden-articulator Markov models for speech recognition
    Richardson, M
    Bilmes, J
    Diorio, C
    SPEECH COMMUNICATION, 2003, 41 (2-3) : 511 - 529