Using Contextual Information for Machine Translation Evaluation

被引：0

作者：

Fomicheva, Marina ^{[1
]}

Bel, Nuria ^{[1
]}

机构：

[1] Univ Pompeu Fabra, IULA, Barcelona, Spain

来源：

LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2016年

关键词：

Machine Translation; Evaluation; Local Context; Alignment;

D O I：

暂无

中图分类号：

H [语言、文字];

学科分类号：

05 ;

摘要：

Automatic evaluation of Machine Translation (MT) is typically approached by measuring similarity between the candidate MT and a human reference translation. An important limitation of existing evaluation systems is that they are unable to distinguish candidate-reference differences that arise due to acceptable linguistic variation from the differences induced by MT errors. In this paper we present a new metric, UPF-Cobalt, that addresses this issue by taking into consideration the syntactic contexts of candidate and reference words. The metric applies a penalty when the words are similar but the contexts in which they occur are not equivalent. In this way, Machine Translations (MTs) that are different from the human translation but still essentially correct are distinguished from those that share high number of words with the reference but alter the meaning of the sentence due to translation errors. The results show that the method proposed is indeed beneficial for automatic MT evaluation. We report experiments based on two different evaluation tasks with various types of manual quality assessment. The metric significantly outperforms state-of-the-art evaluation systems in varying evaluation settings.

引用

页码：2755 / 2761

页数：7

共 35 条

[1]

Ahrenberg L., 2005, P 10 ANN C EUR ASS M

[2] Regression for machine translation evaluation at the sentence level [J].