English-Basque Statistical and Neural Machine Translation

被引:0
作者
Unanue, Inigo Jauregi [1 ,2 ]
Garmendia Arratibel, Lierni
Borzeshi, Ehsan Zare [2 ]
Piccardi, Massimo [1 ]
机构
[1] Univ Technol Sydney UTS, Sydney, NSW, Australia
[2] Capital Markets Cooperat Res Ctr CMCRC, Sydney, NSW, Australia
来源
PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018) | 2018年
关键词
Neural Machine Translation; Statistical Machine Translation; English-Basque; Basque;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural Machine Translation (NMT) has attracted increasing attention in the recent years. However, it tends to require very large training corpora which could prove problematic for languages with low resources. For this reason, Statistical Machine Translation (SMT) continues to be a popular approach for low-resource language pairs. In this work, we address English-Basque translation and compare the performance of three contemporary statistical and neural machine translation systems: OpenNMT, Moses SMT and Google Translate. For evaluation, we employ an open-domain and an IT-domain corpora from the WMT16 resources for machine translation. In addition, we release a small dataset (Berriak) of 500 highly-accurate English-Basque translations of complex sentences useful for a thorough testing of the translation systems.
引用
收藏
页码:880 / 885
页数:6
相关论文
共 28 条
[1]  
Aikhenvald AY, 2007, LANGUAGE TYPOLOGY AND SYNTACTIC DESCRIPTION, VOL III: GRAMMATICAL CATEGORIES AND THE LEXICON, 2ND EDITION, P1
[2]  
[Anonymous], 1997, Neural Computation
[3]  
[Anonymous], 2016, P 2016 C EMPIRICAL M, DOI DOI 10.18653/V1/D16-1163
[4]  
[Anonymous], 2016, Google's neural machine translation system: Bridging the gap between human and machine translation
[5]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[6]  
Del Gaudio Rosa., 2016, Proceedings of the First Conference on Machine Translation, P435
[7]   De-identification of patient notes with recurrent neural networks [J].
Dernoncourt, Franck ;
Lee, Ji Young ;
Uzuner, Ozlem ;
Szolovits, Peter .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (03) :596-606
[8]  
Diaz de Ilarraza A., 2008, Mixing Approaches to Machine Translation, P35
[9]  
Heafeld Kenneth, 2011, P 6 WORKSH STAT MACH, P187
[10]  
Hughes M., 2017, Transactions of the Association for Computational Linguistics, V5, P339, DOI [10.1162/tacl_a_00065, DOI 10.1162/TACL_A_00065, 10.1162/TACLA00065/43400]