Unsupervised dialectal neural machine translation

被引:25
作者
Farhan, Wael [1 ]
Talafha, Bashar [1 ]
Abuammar, Analle [1 ]
Jaikat, Ruba [1 ]
Al-Ayyoub, Mahmoud [2 ]
Tarakji, Ahmad Bisher [1 ]
Toma, Anas [1 ]
机构
[1] Samsung R&D Inst Jordan, Amman, Jordan
[2] Jordan Univ Sci & Technol, Irbid, Jordan
关键词
Neural machine translation; Unsupervised dialectal translation; Regression-based decoding; Shared embedding;
D O I
10.1016/j.ipm.2019.102181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the first work on unsupervised dialectal Neural Machine Translation (NMT), where the source dialect is not represented in the parallel training corpus. Two systems are proposed for this problem. The first one is the Dialectal to Standard Language Translation (D2SLT) system, which is based on the standard attentional sequence-to-sequence model while introducing two novel ideas leveraging similarities among dialects: using common words as anchor points when learning word embeddings and a decoder scoring mechanism that depends on cosine similarity and language models. The second system is based on the celebrated Google NMT (GNMT) system. We first evaluate these systems in a supervised setting (where the training and testing are done using our parallel corpus of Jordanian dialect and Modern Standard Arabic (MSA)) before going into the unsupervised setting (where we train each system once on a SaudiMSA parallel corpus and once on an Egyptian-MSA parallel corpus and test them on the Jordanian-MSA parallel corpus). The highest BLEU score obtained in the unsupervised setting is 32.14 (by D2SLT trained on Saudi-MSA data), which is remarkably high compared with the highest BLEU score obtained in the supervised setting, which is 48.25.
引用
收藏
页数:15
相关论文
共 69 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Al-Ayyoub M., 2018, INFORM PROCESSING MA
[3]   Deep learning for Arabic NLP: A survey [J].
Al-Ayyoub, Mahmoud ;
Nuseir, Aya ;
Alsmearat, Kholoud ;
Jararweh, Yaser ;
Gupta, Brij .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 26 :522-531
[4]  
Al-Gaphari G.H., 2012, International Journal of Information Science and Management (IJISM), V8, P39
[5]  
[Anonymous], 2013, ENGLISH ACCENTS DIAL
[6]  
[Anonymous], 2014, P 17 ANN C EUROPEAN
[7]  
[Anonymous], 2017, Transactions of the Association for Computational Linguistics, DOI [DOI 10.1162/TACL_A_00065, 10.1162/tacl_a_00065]
[8]  
[Anonymous], 3 WORKSH OP SOURC AR
[9]  
[Anonymous], 2017, INFORM PROCESSING MA
[10]  
[Anonymous], ARXIV170503122V1