A feature-based hierarchical speech recognition system for Hindi

被引：0

作者：

K Samudravijaya

R Ahuja

N Bondale

T Jose

S Krishnan

P Poddar

xxPVS Rao

R Raveendran

机构：

[1] Tata Institute of Fundamental Research,Computer Systems and Communications Group

来源：

Sadhana | 1998年 / 23卷

关键词：

Speech recognition; hierarchical approach; Hindi; knowledge integration; natural language processing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper presents a description of a speech recognition system forHindi. The system follows a hierarchic approach to speech recognition and integrates multiple knowledge sources within statistical pattern recognition paradigms at various stages of signal decoding. Rather than make hard decisions at the level of each processing unit, relative confidence scores of individual units are propagated to higher levels. Phoneme recognition is achieved in two stages: broad acoustic classification of a frame is followed by fine acoustic classification. A semi-Markov model processes the frame level outputs of a broad acoustic maximum likelihood classifier to yield a sequence of segments with broad acoustic labels. The phonemic identities of selected classes of segments are decoded by class-dependent neural nets which are trained with class-specific feature vectors as input. Lexical access is achieved by string matching using a dynamic programming technique. A novel language processor disambiguates between multiple choices given by the acoustic recognizer to recognize the spoken sentence.

引用

页码：313 / 340

页数：27

共 14 条

[1]

Davis K(1994)Stop voicing in Hindi J. Phonet. 22 177-193

[2]

Davis S(1980)Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Trans- Acoust. Speech Signal Process. 28 357-366

[3]

Mermelstein P(1996)Synthesis of unlimited speech in Indian languages using formant-based rules Sadhana 21 345-362

[4]

Furtado X A(1971)On dimensionality and sample size in statistical pattern classification Pattern Recogn. 3 225-234

[5]

Sen A(1985)Structural methods in automatic speech recognition Proc. IEEE 73 1625-1650

[6]

Kanal L N(1975)An algorithm for determining the end-points of isolated utterances Bell Syst. Tech. J. 54 297-315

[7]

Chandrasekaran B(1993)VOICE: an integrated speech recognition synthesis system for Hindi language Speech Commun. 13 197-205

[8]

Levinson S E(1967)Error bounds for convolutional codes and an asymptotically optimum decoding algorithm IEEE Trans. Inf. Theor. 13 260-269

[9]

Rabiner L R(1991)Signal processing issues in realizing voice input to computers Asia-Pacific Eng. J. 1 197-217

[10]

Sambur M R(undefined)undefined undefined undefined undefined-undefined

← 1 2 →