A CACHE-BASED NATURAL-LANGUAGE MODEL FOR SPEECH RECOGNITION

被引:215
作者
KUHN, R [1 ]
DEMORI, R [1 ]
机构
[1] CTR RECH INFORMAT MONTREAL INC,MONTREAL,QUEBEC,CANADA
关键词
3g-gram language model; Cache-based language model; language models for speech recognition; Markov language models; natural language; perplexity; probabilistic language model;
D O I
10.1109/34.56193
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition systems must often decide between competing ways of breaking up the acoustic input into strings of words. Since the possible strings may be acoustically similar, a language model is required; given a word string, the model returns its linguistic probability. This paper discusses several Markov language models. Subsequently, we present a new kind of language model which reflects short-term patterns of word use by means of a “cache component,” analogous to “cache memory” in hardware terminology. The model also contains a “3g-gram component” of the traditional type. The combined model and a pure 3g-gram model were tested on samples drawn from the LOB (Lancaster-Oslo/Bergen) corpus of English text. We discuss the relative performance of the two models, and make suggestions for future improvements. © 1990 IEEE
引用
收藏
页码:570 / 583
页数:14
相关论文
共 13 条
[1]   NATURAL-LANGUAGE MODELING FOR PHONEME-TO-TEXT TRANSCRIPTION [J].
DEROUAULT, AM ;
MERIALDO, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) :742-749
[2]  
DEROUAULT AM, 1984, 7TH P INT C PATT REC, V2, P1373
[3]   THE DEVELOPMENT OF AN EXPERIMENTAL DISCRETE DICTATION RECOGNIZER [J].
JELINEK, F .
PROCEEDINGS OF THE IEEE, 1985, 73 (11) :1616-1624
[4]  
JELINEK F, 1983, IEEE T PATTERNS ANAL, V5, P179
[5]  
JELINEK F, 1981, PATTERN RECOGN, P381
[6]   WORD-FREQUENCY AND TEXT TYPE - SOME OBSERVATIONS BASED ON THE LOB CORPUS OF BRITISH ENGLISH-TEXTS [J].
JOHANSSON, S .
COMPUTERS AND THE HUMANITIES, 1985, 19 (01) :23-36
[7]  
Johansson S., 1986, TAGGED LOB CORPUS US
[8]  
Johansson S., 1985, ITL REV APPL LINGUIS, V67-68, P117
[9]  
KATZ S, IN PRESS RECURSIVE M
[10]  
MUCKSTEIN EM, 1981, IBM RC751638450 RES