Adaptive Compensation Algorithm in Open Vocabulary Mandarin Speaker-Independent Speech Recognition

被引:0
作者
FadhilH.T.Al-dulaimy
王作英
田野
机构
关键词
mel-frequency; cepstrum coefficients; speech recognition; duration distribution based hidden Markov model;
D O I
暂无
中图分类号
TN912 [电声技术和语音信号处理];
学科分类号
081002 ;
摘要
In speech recognition systems, the physiological characteristics of the speech production model cause the voiced sections of the speech signal to have an attenuation of approximately 20 dB per decade. Many speech recognition algorithms have been developed to solve this problem by filtering the input signal with a single-zero high pass filter. Unfortunately, this technique increases the noise energy at high frequencies above 4 kHz, which in some cases degrades the recognition accuracy. This paper solves the problem using a pre-emphasis filter in the front end of the recognizer. The aim is to develop a modified parameterization approach taking into account the whole energy zone in the spectrum to improve the performance of the existing baseline recognition system in the acoustic phase. The results show that a large vocabulary speaker-independent continuous speech recognition system using this approach has a greatly improved recognition rate.
引用
收藏
页码:521 / 526
页数:6
相关论文
共 4 条
  • [1] Vergin R,O’ Shaughnessy D.Generalized mel frequency cepstral coefficients for large vocabulary speaker -independent continuous speech recognition. I EEE Transaction on Speech and Audio Processing . 1999
  • [2] Allen J B.How do humans process and recognize speech. I EEE Transaction on Speech and Audio Processing . 1994
  • [3] Wang Zuoying.An inhomogeneous HMM speech recognition algorithm. C hinese Journal of Electronics . 1998
  • [4] Pcone J W.Signal modeling techniques in speech recognition. Proceedings of the IEEE . 1993