Examining Gender Bias in Languages with Grammatical Gender

被引:0
|
作者
Zhou, Pei [1 ,2 ]
Shi, Weijia [1 ]
Zhao, Jieyu [1 ]
Huang, Kuan-Hao [1 ]
Chen, Muhao [1 ,3 ]
Cotterell, Ryan [4 ]
Chang, Kai-Wei [1 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
[2] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
[3] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
来源
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies have shown that word embeddings exhibit gender bias inherited from the training corpora. However, most studies to date have focused on quantifying and mitigating such bias only in English. These analyses cannot be directly extended to languages that exhibit morphological agreement on gender, such as Spanish and French. In this paper, we propose new metrics for evaluating gender bias in word embeddings of these languages and further demonstrate evidence of gender bias in bilingual embeddings which align these languages with English. Finally, we extend an existing approach to mitigate gender bias in word embedding under both monolingual and bilingual settings. Experiments on modified Word Embedding Association Test, word similarity, word translation, and word pair translation tasks show that the proposed approaches effectively reduce the gender bias while preserving the utility of the embeddings.
引用
收藏
页码:5276 / 5284
页数:9
相关论文
共 50 条