Assembling translations from multi-engine machine translation outputs

被引:21
作者
Banik, Debajyoty [1 ,2 ]
Ekbal, Asif [1 ,2 ]
Bhattacharyya, Pushpak [1 ,2 ]
Bhattacharyya, Siddhartha [3 ,4 ]
机构
[1] Dept Comp Sci & Engn, Patna, Bihar, India
[2] Indian Inst Technol Patna, Patna, Bihar, India
[3] Dept Informat Technol, Patna, Bihar, India
[4] RCC Inst Informat Technol, Kolkata, W Bengal, India
关键词
Statistical approach; Neural network; Machine translation; Neural Machine Translation (NMT); Statistical Machine Translation (SMT);
D O I
10.1016/j.asoc.2019.02.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a hybrid architecture for developing a system combination model that works in three layers to achieve better translated outputs. In the first layer, we have various machine translation models (i.e. Neural Machine Translation (NMT), Statistical Machine Translation (SMT), etc.). In the second layer, the outputs of these models are combined to leverage the advantages of both the systems (i.e SMT and NMT systems) by using the statistical approach and neural-based approach. But each approach has some advantages and limitations. So, instead of selecting an individual combined system's output as the final one, we apply these outputs in the final layer to produce the target output by assigning appropriate preferences to SMT based and neural-based combinations. Though there are some techniques for system combination but no such approach exists which uses preferences from various system combination models (statistical and neural) for the purpose of better assembling. Empirical results show improved performance in the terms of translation accuracy. Our experiments on two benchmark datasets of English-Hindi and Hindi-English pairs show that the proposed model performs significantly better than the participating models. Apparently, the efficacy of proposed model is significantly better than the state-of-the art machine translation combination systems (6.10 and 4.69 BLEU points for English-to-Hindi, and Hindi-to-English, respectively). (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:230 / 239
页数:10
相关论文
共 46 条
[1]  
[Anonymous], NIST OP MACH TRANSL
[2]  
[Anonymous], NNNN
[3]  
[Anonymous], 2016, P 26 INT C COMP LING
[4]  
[Anonymous], 2017, ARXIV170406393
[5]  
[Anonymous], 2014, P SSST EMNLP 2014 8
[6]  
[Anonymous], 2009, P 4 WORKSHOP STAT MA
[7]  
[Anonymous], ANUSAARAKA OVERCOMIN
[8]  
[Anonymous], 2009, PRAGUE B MATH LINGUI
[9]  
[Anonymous], INT C YOUNG COMP SCI
[10]  
[Anonymous], 2016, ARXIV161005243