The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

被引：142

作者：

Goyal, Naman ^{[1
]}

Gao, Cynthia ^{[1
]}

Chaudhary, Vishrav ^{[1
]}

Chen, Peng-Jen ^{[1
]}

Wenzek, Guillaume ^{[2
]}

Ju, Da ^{[1
]}

Krishnan, Sanjana ^{[1
]}

Ranzato, Marc'Aurelio ^{[1
]}

Guzman, Francisco ^{[1
]}

Fan, Angela ^{[2
,3
]}

机构：

[1] Facebook AI Res, Menlo Pk, CA 94025 USA

[2] Facebook AI Res, Paris, France

[3] LORIA, Vandoeuvre Les Nancy, France

来源：

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS | 2022年 / 10卷

关键词：

51;

D O I：

10.1162/tacl_a_00474

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One of the biggest challenges hindering progress in low-resource and multilingual machine translation is the lack of good evaluation benchmarks. Current evaluation benchmarks either lack good coverage of low-resource languages, consider only restricted domains, or are low quality because they are constructed using semi-automatic procedures. In this work, we introduce the Flores-101 evaluation benchmark, consisting of 3001 sentences extracted from English Wikipedia and covering a variety of different topics and domains. These sentences have been translated in 101 languages by professional translators through a carefully controlled process. The resulting dataset enables better assessment of model quality on the long tail of low-resource languages, including the evaluation of many-to-many multilingual translation systems, as all translations are fully aligned. By publicly releasing such a high-quality and high-coverage dataset, we hope to foster progress in the machine translation community and beyond.

引用

页码：522 / 538

页数：17

共 50 条

[1]

Abbott Jade Z., 2019, P 2019 WORKSH WID NL, P98

[2]

Adelani DI, 2021, Arxiv, DOI arXiv:2103.08647

[3]

Agic E, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P3204

[4]

Aharoni R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3874

[5]

Ali FDMA, 2021, Arxiv, DOI arXiv:2104.05753

[6]

Anastasopoulos A., 2020, Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020, DOI [10.18653/v1/2020.nlpcovid19-2.5, DOI 10.18653/V1/2020.NLPCOVID19-2.5]

[7]

Arivazhagan N, 2019, Arxiv, DOI arXiv:1907.05019

[8]

Aulamo Mikko., 2021, NODALIDA 2021, P351

[9]

Barrault L, 2019, FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), P1

[10] Human platelet lysates for human cell propagation [J].

Barro, Lassina ;

Burnouf, Pierre-Alain ;

Chou, Ming-Li ;

Nebie, Ouada ;

Wu, Yu-Wen ;

Chen, Ming-Sheng ;

Radosevic, Miryana ;

Knutson, Folke ;

Burnouf, Thierry .

PLATELETS, 2021, 32 (02) :153-162

← 1 2 3 4 5 →