Multilingual sequence to sequence convolutional machine translation

被引:4
作者
Bansal, Mani [1 ]
Lobiyal, D. K. [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India
关键词
Convolution Neural Network; English-Punjabi; Punjabi-English; Hindi-Punjabi; Punjabi-Hindi; Neural Machine Translation; ENGLISH; SYSTEM;
D O I
10.1007/s11042-021-11345-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, to improve the translation quality of a sentence, Convolutional sequence to sequence architecture has been applied to English-Punjabi, Punjabi-English, Hindi-Punjabi, Punjabi-Hindi language pairs. The Convolution architecture consists of Gated Linear Unit (GLU) and a Multi-Hop attention mechanism. The GLU mechanism controls the information flow through hidden units to produce good translations. The Multi-Hop attention is an enhanced version of the attention mechanism that allows the model to make repeated gaze on the sentence than looking at the sentence only once. As compared to the Recurrent Neural Network (RNN), the Convolution based technique provides better accuracy, captures long-range dependencies between the words, capture local context, and allows parallelization over every element in a sequence without depending on the computations of the previous time steps. In this work, we have applied the architecture on four Indian language pairs and compared the performance of our model with conventional models using BLEU, METEOR and WER as evaluation metrics. Our model outperforms recurrent neural networks by 2 BLEU points on English-Punjabi, 2.5 BLEU points on Punjabi-English, 3.5 BLEU points on Hindi-Punjabi translations, and 1.5 BLEU points on Punjabi-Hindi.
引用
收藏
页码:33701 / 33726
页数:26
相关论文
共 34 条
[1]  
Agrawal R, 2017, THESIS INT I INFORM
[2]  
Chen SF, 1998, DARPA BROADC NEWS TR
[3]  
Dauphin YN, 2017, PR MACH LEARN RES, V70
[4]   Apertium: a free/open-source platform for rule-based machine translation [J].
Forcada, Mikel L. ;
Ginesti-Rosell, Mireia ;
Nordfalk, Jacob ;
O'Regan, Jim ;
Ortiz-Rojas, Sergio ;
Antonio Perez-Ortiz, Juan ;
Sanchez-Martinez, Felipe ;
Ramirez-Sanchez, Gema ;
Tyers, Francis M. .
MACHINE TRANSLATION, 2011, 25 (02) :127-144
[5]  
Gehring J, 2017, PR MACH LEARN RES, V70
[6]  
Goyal V, 2011, P 49 ANN M ASS COMP
[7]  
Goyal V., 2019, GEORGE RONCHI FDN J, V64, P2009
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]   English to Punjabi statistical machine translation using moses (Corpus Based) [J].
Jindal, Shishpal ;
Goyal, Vishal ;
Bhullar, Jaskarn Singh .
JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2018, 21 (04) :553-560
[10]  
Josan G.S., 2008, P 22 INT C ON COMP L, P157