Machine translation evaluation with neural networks

被引:16
作者
Guzman, Francisco [1 ]
Joty, Shafiq [1 ]
Marquez, Lluis [1 ]
Nakov, Preslav [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Qatar Fdn, Doha, Qatar
关键词
Machine translation; Reference-based MT evaluation; Deep neural networks; Distributed representation of texts; Textual similarity;
D O I
10.1016/j.csl.2016.12.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for machine translation evaluation using neural networks in a pairwise setting, where the goal is to select the better translation from a pair of hypotheses, given the reference translation. In this framework, lexical, syntactic and semantic information from the reference and the two hypotheses is embedded into compact distributed vector representations, and fed into a multi-layer neural network that models nonlinear interactions between each of the hypotheses and the reference, as well as between the two hypotheses. We experiment with the benchmark datasets from the WMT Metrics shared task, on which we obtain the best results published so far, with the basic network configuration. We also perform a series of experiments to analyze and understand the contribution of the different components of the network. We evaluate variants and extensions, including fine-tuning of the semantic embeddings, and sentence-based representations modeled with convolutional and recurrent neural networks. In summary, the proposed framework is flexible and generalizable, allows for efficient learning and scoring, and provides an MT evaluation metric that correlates with human judgments, and is on par with the state of the art. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:180 / 200
页数:21
相关论文
共 63 条
  • [1] Regression for machine translation evaluation at the sentence level
    Albrecht, Joshua S.
    Hwa, Rebecca
    [J]. MACHINE TRANSLATION, 2008, 22 (1-2) : 1 - 27
  • [2] [Anonymous], 2008, P 3 WORKSHOP STAT MA
  • [3] [Anonymous], 2013, P 2013 C EMPIRICAL M
  • [4] [Anonymous], 2014, P 9 WORKSH STAT MACH
  • [5] [Anonymous], 2013, P 2013 C N AM CHAPTE
  • [6] [Anonymous], 2013, ACL
  • [7] [Anonymous], 2013, P 8 WORKSHOP STAT MA
  • [8] [Anonymous], 2014, P 2014 C EMPIRICAL M, DOI [DOI 10.3115/V1/D14-1027, 10.3115/v1/D14-1027]
  • [9] [Anonymous], 1997, Neural Computation
  • [10] [Anonymous], P 9 WORKSH STAT MACH