Efficient training algorithms for HMM's using incremental estimation

被引:18
作者
Gotoh, Y [1 ]
Hochberg, MM [1 ]
Silverman, HF [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 06期
基金
美国国家科学基金会;
关键词
HMM training algorithm; incremental estimation; MAP estimation;
D O I
10.1109/89.725320
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Typically, parameter estimation for a hidden Markov model (HMM) is performed using an expectation-maximization (EM) algorithm with the maximum-likelihood (ML) criterion. The EM algorithm is an iterative scheme that is well-defined and numerically stable, but convergence may require a large number of iterations. For speech recognition systems utilizing large amounts of training material, this results in long training times. This paper presents an incremental estimation approach to speed-up the training of HMM's without any loss of recognition performance, The algorithm selects a subset of data from the training set, updates the model parameters based on the subset, and then iterates the process until convergence of the parameters. The advantage of this approach is a substantial increase in the number of iterations of the EM algorithm per training token, which leads to faster training, In order to achieve reliable estimation from a small fraction of the complete data set at each iteration, two training criteria are studied; ML and maximum a posteriori (MAP) estimation. Experimental results show that the training of the incremental algorithms is substantially faster than the conventional (batch) method and suffers no loss of recognition performance. Furthermore, the incremental MAP based training algorithm improves performance over the batch version.
引用
收藏
页码:539 / 548
页数:10
相关论文
共 20 条
  • [11] HOCHBERG MM, 1993, THESIS BROWN U PROVI
  • [12] Huo Q, 1997, IEEE T SPEECH AUDI P, V5, P161, DOI 10.1109/89.554778
  • [13] HIERARCHICAL MIXTURES OF EXPERTS AND THE EM ALGORITHM
    JORDAN, MI
    JACOBS, RA
    [J]. NEURAL COMPUTATION, 1994, 6 (02) : 181 - 214
  • [14] ONLINE ESTIMATION OF HIDDEN MARKOV MODEL PARAMETERS BASED ON THE KULLBACK-LEIBLER INFORMATION MEASURE
    KRISHNAMURTHY, V
    MOORE, JB
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (08) : 2557 - 2573
  • [15] Morgan N., 1995, IEEE Signal Processing Magazine, V12, P24, DOI 10.1109/79.382443
  • [16] NEAL RM, 1993, UNPUB BIOMETRIKA
  • [17] MIXTURE DENSITIES, MAXIMUM-LIKELIHOOD AND THE EM ALGORITHM
    REDNER, RA
    WALKER, HF
    [J]. SIAM REVIEW, 1984, 26 (02) : 195 - 237
  • [18] SILVERMAN HF, 1994, LEMS MONOGRAPH SER B
  • [19] SEQUENTIAL ALGORITHMS FOR PARAMETER-ESTIMATION BASED ON THE KULLBACK-LEIBLER INFORMATION MEASURE
    WEINSTEIN, E
    FEDER, M
    OPPENHEIM, AV
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (09): : 1652 - 1654
  • [20] Wellekens C. J., 1992, ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech and Signal Processing (Cat. No.92CH3103-9), P361, DOI 10.1109/ICASSP.1992.225897