A study on continuous Chinese speech recognition based on stochastic trajectory models

被引：0

作者：

Ma, XH

Gong, YF

Fu, YQ

Lu, J

Haton, JP

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper first introduces the theory of Stochastic Trajectory Models (STMs). STM represents the acoustic observations of a speech unit as clusters of trajectories in a parameter space. The trajectories are modeled by mixture of probability density functions of random sequence of states. Each state is associated with a multi-variate Gaussian density function, optimized at state sequence level. The effect of not using the HMM assumptions in STM is that STM fan exploite information, such as time correlation within an observation sequence, which is hidden by HMM assumptions. After analyzing the characteristics of Chinese speech, the acoustic units for recognizing continuous Chinese speech taking advantage of Stochastic Trajectory Models are discussed and phone-like units. which are similar to or smaller than initial-Final-like units, are suggested. The total number of the phone-like units( about 50) is the smallest in almost,all Chinese speech recognition systems. Consequently, the training database can be very small. The performance of continuous Chinese speech recognition based on STM is studied using the VINICS system. The experimental results demonstrate the efficiency of STM and the consistency of phone-like units.

引用

页码：482 / 485

页数：4