Speech Recognition Using Hidden Markov Models with Polynomial Regression Functions as Nonstationary States

被引:84
作者
Deng, Li [1 ]
Aksmanovic, Mike [1 ]
Sun, Xiaodong [2 ]
Wu, C. F. Jeff [2 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1994年 / 2卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
11;
D O I
10.1109/89.326610
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose, implement, and evaluate a class of nonstationary-state hidden Markov models (HMM's) having each state associated with a distinct polynomial regression function of time plus white Gaussian noise. The model represents the transitional acoustic trajectories of speech in a parametric manner, and includes the standard stationary-state HMM as a special, degenerated case. We develop an efficient dynamic programming technique which includes the state sojourn time as an optimization variable, in conjunction with a state-dependent orthogonal polynomial regression method, for estimating the model parameters. Experiments on fitting models to speech data and on limited-vocabulary speech recognition demonstrate consistent superiority of these nonstationary-state HMM's over the traditional stationary-state HMM's.
引用
收藏
页码:507 / 520
页数:14
相关论文
共 11 条