Testing the correlation of word error rate and perplexity

被引:142
作者
Klakow, D [1 ]
Peters, J [1 ]
机构
[1] Philips GmbH Forschungslab, D-52066 Aachen, Germany
关键词
language model training; perplexity; correlation with word error rate;
D O I
10.1016/S0167-6393(01)00041-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many groups have investigated the relationship of word error rate and perplexity of language models. This issue is of central interest because perplexity optimization can be done independent of a recognizer and in most cases it is possible to find simple perplexity optimization procedures. Moreover, many tasks in language model training such as the optimization of word classes may use perplexity as target function resulting in explicit optimization formulas which are not available if error rates are used as target. This paper first presents some theoretical arguments for a close relationship between perplexity and word error rate. Thereafter the notion of uncertainty of a measurement is introduced and is then used to test the hypothesis that word error rate and perplexity are correlated by a power law. There is no evidence to reject this hypothesis. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 17 条
[1]  
[Anonymous], 1993, P EUROSPEECH
[2]  
[Anonymous], P INT C AC SPEECH SI
[3]  
BESLING S, 1995, EUROSPEECH, P1755
[4]  
BEYERLEIN P, 1998, DAPRA BROADCAST NEWS
[5]  
Bronshtein IN., 2013, Handbook of Mathematics
[6]  
CHEN S, 1998, DARPA BROADCAST NEWS
[7]  
CLARKSON P, 1999, P EUR, P2707
[8]  
DUGAST C, 1995, ARPA SPOKEN LANGUAGE
[9]  
ITO A, 1999, P EUR, P1591
[10]  
IYER R, 1997, P ASRU, P254