English-Basque Statistical and Neural Machine Translation

被引:0
作者
Unanue, Inigo Jauregi [1 ,2 ]
Garmendia Arratibel, Lierni
Borzeshi, Ehsan Zare [2 ]
Piccardi, Massimo [1 ]
机构
[1] Univ Technol Sydney UTS, Sydney, NSW, Australia
[2] Capital Markets Cooperat Res Ctr CMCRC, Sydney, NSW, Australia
来源
PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018) | 2018年
关键词
Neural Machine Translation; Statistical Machine Translation; English-Basque; Basque;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural Machine Translation (NMT) has attracted increasing attention in the recent years. However, it tends to require very large training corpora which could prove problematic for languages with low resources. For this reason, Statistical Machine Translation (SMT) continues to be a popular approach for low-resource language pairs. In this work, we address English-Basque translation and compare the performance of three contemporary statistical and neural machine translation systems: OpenNMT, Moses SMT and Google Translate. For evaluation, we employ an open-domain and an IT-domain corpora from the WMT16 resources for machine translation. In addition, we release a small dataset (Berriak) of 500 highly-accurate English-Basque translations of complex sentences useful for a thorough testing of the translation systems.
引用
收藏
页码:880 / 885
页数:6
相关论文
共 50 条
[31]   BILINGUAL RECURRENT NEURAL NETWORKS FOR IMPROVED STATISTICAL MACHINE TRANSLATION [J].
Zhao, Bing ;
Tam, Yik-Cheung .
2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, :66-70
[32]   Orthographic and morphological processing for English-Arabic statistical machine translation [J].
El Kholy, Ahmed ;
Habash, Nizar .
MACHINE TRANSLATION, 2012, 26 (1-2) :25-45
[33]   The Effect of Shallow Segmentation on English-Tigrinya Statistical Machine Translation [J].
Tedla, Yemane ;
Yamamoto, Kazuhide .
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, :79-82
[34]   Statistical Machine Translation from Slovenian to English Using Reduced Morphology [J].
Maucec, Mirjam Sepesy ;
Brest, Janez .
HUMAN LANGUAGE TECHNOLOGY: CHALLENGES OF THE INFORMATION SOCIETY, 2009, 5603 :451-460
[35]   Linguistically Motivated Evaluation of English-Latvian Statistical Machine Translation [J].
Skadina, Inguna ;
Levane-Petrova, Kristine ;
Rabante, Guna .
HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 :221-229
[36]   Benefits of morphosyntactic features on English-Arabic Statistical Machine Translation [J].
Berrichi, Safae ;
Mazroui, Azzeddine .
2018 IEEE 5TH INTERNATIONAL CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'18), 2018, :244-248
[37]   Morphology generation for English-Indian language statistical machine translation [J].
S. Sreelekha .
Soft Computing, 2021, 25 :3657-3664
[38]   Morphology in Statistical Machine Translation from English to a Highly Inflectional Language [J].
Maucec, Mirjam S. ;
Donaj, Gregor .
INFORMATION TECHNOLOGY AND CONTROL, 2018, 47 (01) :63-74
[39]   Phrase-based Chinese-English Statistical Machine Translation [J].
Shi, Zijuan ;
Luo, Gaofeng .
AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03) :3557-3560
[40]   Morphology generation for English-Indian language statistical machine translation [J].
Sreelekha, S. .
SOFT COMPUTING, 2021, 25 (05) :3657-3664