Network optimizations for large-vocabulary speech recognition

被引:13
作者
Mohri, M [1 ]
Riley, M [1 ]
机构
[1] AT&T Bell Labs, Res, Florham Pk, NJ 07932 USA
关键词
large-vocabulary speech recognition; search; network optimization; weighted finite-state transducers; stochastic automata;
D O I
10.1016/S0167-6393(98)00043-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The redundancy and the size of networks in large-vocabulary speech recognition systems can have a critical effect on their overall performance. We describe the use of two new algorithms: weighted determinization and minimization (Mohri, 1997a). These algorithms transform recognition labeled networks into equivalent ones that require much less time and space in large-vocabulary speech recognition. They are both optimal: weighted determinization eliminates the number of alternatives at each state to the minimum, and weighted minimization reduces the size of deterministic networks to the smallest possible number of states and transitions. These algorithms generalize classical automata determinization and minimization to deal properly with the probabilities of alternative hypotheses and with the relationships between units (distributions, phones, words) at different levels in the recognition system. We illustrate their use in several applications, and report the results of our experiments. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 22 条
[1]  
Aho A.V., 1974, The Design and Analysis of Computer Algorithms
[2]  
Aho Alfred V., 2007, COMPILERS PRINCIPLES
[3]  
[Anonymous], 1994, P HUMAN LANG TECHN W
[4]  
[Anonymous], 1997, COMPUTATIONAL LINGUI
[5]  
ANTONIOL G, 1995, P INT C AC SPEECH SI, P588
[6]  
Berstel J., 1979, TEUBNER STUDIENBUCHE, V38
[7]  
Eilenberg S., 1974, AUTOMATA LANGUAGES M, VA
[8]  
GOPALAKRISHNAN PS, 1995, INT CONF ACOUST SPEE, P572, DOI 10.1109/ICASSP.1995.479662
[9]  
KAPLAN R, 1994, COMPUTATIONAL LINGUI, V20
[10]   CONTEXT-DEPENDENT PHONETIC HIDDEN MARKOV-MODELS FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION [J].
LEE, KF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (04) :599-609