The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

被引:142
作者
Goyal, Naman [1 ]
Gao, Cynthia [1 ]
Chaudhary, Vishrav [1 ]
Chen, Peng-Jen [1 ]
Wenzek, Guillaume [2 ]
Ju, Da [1 ]
Krishnan, Sanjana [1 ]
Ranzato, Marc'Aurelio [1 ]
Guzman, Francisco [1 ]
Fan, Angela [2 ,3 ]
机构
[1] Facebook AI Res, Menlo Pk, CA 94025 USA
[2] Facebook AI Res, Paris, France
[3] LORIA, Vandoeuvre Les Nancy, France
关键词
51;
D O I
10.1162/tacl_a_00474
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the biggest challenges hindering progress in low-resource and multilingual machine translation is the lack of good evaluation benchmarks. Current evaluation benchmarks either lack good coverage of low-resource languages, consider only restricted domains, or are low quality because they are constructed using semi-automatic procedures. In this work, we introduce the Flores-101 evaluation benchmark, consisting of 3001 sentences extracted from English Wikipedia and covering a variety of different topics and domains. These sentences have been translated in 101 languages by professional translators through a carefully controlled process. The resulting dataset enables better assessment of model quality on the long tail of low-resource languages, including the evaluation of many-to-many multilingual translation systems, as all translations are fully aligned. By publicly releasing such a high-quality and high-coverage dataset, we hope to foster progress in the machine translation community and beyond.
引用
收藏
页码:522 / 538
页数:17
相关论文
共 50 条
[1]  
Abbott Jade Z., 2019, P 2019 WORKSH WID NL, P98
[2]  
Adelani DI, 2021, Arxiv, DOI arXiv:2103.08647
[3]  
Agic E, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P3204
[4]  
Aharoni R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3874
[5]  
Ali FDMA, 2021, Arxiv, DOI arXiv:2104.05753
[6]  
Anastasopoulos A., 2020, Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020, DOI [10.18653/v1/2020.nlpcovid19-2.5, DOI 10.18653/V1/2020.NLPCOVID19-2.5]
[7]  
Arivazhagan N, 2019, Arxiv, DOI arXiv:1907.05019
[8]  
Aulamo Mikko., 2021, NODALIDA 2021, P351
[9]  
Barrault L, 2019, FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), P1
[10]   Human platelet lysates for human cell propagation [J].
Barro, Lassina ;
Burnouf, Pierre-Alain ;
Chou, Ming-Li ;
Nebie, Ouada ;
Wu, Yu-Wen ;
Chen, Ming-Sheng ;
Radosevic, Miryana ;
Knutson, Folke ;
Burnouf, Thierry .
PLATELETS, 2021, 32 (02) :153-162