Speech Transcript Evaluation for Information Retrieval

被引:0
作者
van der Werff, Laurens [1 ]
Kraaij, Wessel [2 ]
de Jong, Franciska [1 ]
机构
[1] Univ Twente, POB 217, NL-7500 AE Enschede, Netherlands
[2] Radboud Univ Nijmegen, Inst Comp & Informat Sci, Nijmegen, Netherlands
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
evaluation; speech recognition; information retrieval; speech retrieval; rank correlation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition transcripts are being used in various fields of research and practical applications, putting various demands on their accuracy. Traditionally ASR research has used intrinsic evaluation measures such as word error rate to determine transcript quality. In non-dictation-type applications such as speech retrieval, it is better to use extrinsic (or task specific) measures. Indexation and the associated processing may eliminate certain errors, whereas the search query may reveal others. In this work, we argue that the standard extrinsic speech retrieval measure average precision is unpractical for ASR evaluation. As an alternative we propose the use of ranked correlation measures on the output of the speech retrieval task, with the goal of predicting relative mean average precision. The measures we used showed a reasonably high correlation with average precision, but require much less human effort to calculate and can be more easily deployed in a variety of real-life settings.
引用
收藏
页码:1536 / +
页数:2
相关论文
共 16 条
  • [1] [Anonymous], 2008, INT ACM SIGIR C RES, DOI DOI 10.1145/1390334.1390435
  • [2] Rank correlation - An alternative measure
    Blest, DC
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2000, 42 (01) : 101 - 111
  • [3] Cieri C., 1999, P DARPA BROADCAST NE, P57
  • [4] Doddington G., 1998, P DARPA BROADC NEWS
  • [5] Fagin R, 2003, SIAM PROC S, P28
  • [6] Garofolo J. S., 2000, P RIAO CONT BAS MULT
  • [7] Garofolo John S., 1998, P 7 TEXT RETR C TREC
  • [8] Jones Sparck K., 2000, INFORM PROCESSING MA, V36, P779
  • [9] Kamvar M., 2006, Conference on Human Factors in Computing Systems. CHI2006, P701
  • [10] Kendall M., 1990, Correlation methods