On the practice of error analysis for machine translation evaluation

被引:0
作者
Stymne, Sara [1 ]
Ahrenberg, Lars [1 ]
机构
[1] Linkoping Univ, Linkoping, Sweden
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
关键词
Error analysis; Inter-annotator agreement; Machine translation evaluation;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Error analysis is a means to assess machine translation output in qualitative terms, which can be used as a basis for the generation of error profiles for different systems. As for other subjective approaches to evaluation it runs the risk of low inter-annotator agreement, but very often in papers applying error analysis to MT, this aspect is not even discussed. In this paper, we report results from a comparative evaluation of two systems where agreement initially was low, and discuss the different ways we used to improve it. We compared the effects of using more or less fine-grained taxonomies, and the possibility to restrict analysis to short sentences only. We report results on inter-annotator agreement before and after measures were taken, on error categories that are most likely to be confused, and on the possibility to establish error profiles also in the absence of a high inter-annotator agreement.
引用
收藏
页码:1785 / 1790
页数:6
相关论文
共 17 条
[1]  
[Anonymous], P 15 ANN C EUR ASS M
[2]  
[Anonymous], 2006, Proceedings of LREC
[3]  
[Anonymous], 2008, P 12 ANN C EUROPEAN
[4]  
Avramidis E., 2008, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, P763
[5]  
Bayerl Petra Saskia, 2011, COMPUTATIONAL LINGUI, V37, P727
[6]  
Callison-Burch Chris., 2007, Proceedings of the second workshop on statistical machine translation, P136, DOI 10.3115/1626355.1626373
[7]  
Elliott D, 2004, LECT NOTES COMPUT SC, V3265, P64
[8]  
Farrus Mireia., 2010, Proceedings of EAMT, P52
[9]  
Flanagan Mary., 1994, Proceedings ofthe Association of Machine Translation ofthe Americas (AMTA-94), P65
[10]  
Koehn P., 2005, P MACH TRANSL SUMM 1, P79