Towards Cross-Lingual Generalization of Translation Gender Bias

被引:12
作者
Cho, Won Ik [1 ,2 ]
Kim, Jiwon
Yang, Jaeyeong [3 ]
Kim, Nam Soo [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept ECE, Seoul, South Korea
[2] Seoul Natl Univ, INMC, Seoul, South Korea
[3] Seoul Natl Univ, Dept Linguist, Seoul, South Korea
来源
PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021 | 2021年
关键词
machine translation; gender bias; evaluation; cross-linguality;
D O I
10.1145/3442188.3445907
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cross-lingual generalization issues for less explored languages have been broadly tackled in recent NLP studies. In this study, we apply the philosophy on the problem of translation gender bias, which necessarily involves multilingualism and socio-cultural diversity. Beyond the conventional evaluation criteria for the social bias, we aim to put together various aspects of linguistic viewpoints into the measuring process, to create a template that makes evaluation less tilted to specific types of language pairs. With a manually constructed set of content words and template, we check both the accuracy of gender inference and the fluency of translation, for German, Korean, Portuguese, and Tagalog. Inference accuracy and disparate impact, namely the biasedness factors associated with each other, show that the failure of bias mitigation threatens the delicacy of translation. Furthermore, our analyses on each system and language indicate that the translation fluency and inference accuracy are not necessarily correlated. The results implicitly suggest that the amount of available language resources that boost up the performance might amplify the bias cross-linguistically.
引用
收藏
页码:449 / 457
页数:9
相关论文
共 32 条
[1]  
[Anonymous], 2018, Transactions of the Association for Computational Linguistics
[2]  
Bender Emily, 2019, GRADIENT, V14, P34
[3]  
Binns R., 2018, P MACHINE LEARNING R, P149, DOI DOI 10.1145/3178876.3186150
[4]  
Bjorkman BM, 2017, GLOSSA-UK, V2, DOI 10.5334/gjgl.374
[5]  
Bolukbasi T, 2016, ADV NEUR IN, V29
[6]  
Chen B., 2014, P 9 WORKSHOP STAT MA, P362
[7]  
Cho WI, 2019, GENDER BIAS IN NATURAL LANGUAGE PROCESSING (GEBNLP 2019), P173
[8]  
Clinchant S., 2019, P 3 WORKSHOP NEURAL, P108, DOI [10.18653/v1/D19-5611, 10.18653/ v1/D19-5611]
[9]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]  
Font JE, 2019, GENDER BIAS IN NATURAL LANGUAGE PROCESSING (GEBNLP 2019), P147