Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology

被引：0

作者：

Hada, Rishav ^{[1
]}

Husain, Safiya ^{[2
]}

Gumma, Varun ^{[1
]}

Diddee, Harshita ^{[3
]}

Yadavalli, Aditya ^{[2
]}

Seth, Agrima ^{[4
]}

Kulkarni, Nidhi ^{[2
]}

Gadiraju, Ujwal ^{[5
]}

Vashistha, Aditya ^{[6
]}

Seshadri, Vivek ^{[7
]}

Bali, Kalika ^{[1
]}

机构：

[1] Microsoft Res, Bengaluru, India

[2] Karya, Bengaluru, India

[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[4] Univ Michigan, Ann Arbor, MI 48109 USA

[5] Delft Univ Technol, Delft, Netherlands

[6] Cornell Univ, Ithaca, NY USA

[7] Microsoft Res, Karya, Bengaluru, India

来源：

PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024 | 2024年

关键词：

Gender bias; Indic languages; Global South; India; Hindi; Community centric;

D O I：

10.1145/3630106.3659017

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing research in measuring and mitigating gender bias predominantly centers on English, overlooking the intricate challenges posed by non-English languages and the Global South. This paper presents the first comprehensive study delving into the nuanced landscape of gender bias in Hindi, the third most spoken language globally. Our study employs diverse mining techniques, computational models, field studies and sheds light on the limitations of current methodologies. Given the challenges faced with mining gender biased statements in Hindi using existing methods, we conducted field studies to bootstrap the collection of such sentences. Through field studies involving rural and low-income community women, we uncover diverse perceptions of gender bias, underscoring the necessity for context-specific approaches. This paper advocates for a community-centric research design, amplifying voices often marginalized in previous studies. Our findings not only contribute to the understanding of gender bias in Hindi but also establish a foundation for further exploration of Indic languages. By exploring the intricacies of this understudied context, we call for thoughtful engagement with gender bias, promoting inclusivity and equity in linguistic and cultural contexts beyond the Global North.

引用

页码：1926 / 1939

页数：14

共 86 条

[1] Abraham B, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P2819
[2] Ahuja K, 2023, 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, P4232
[3] Albalak Alon, 2023, 17 C EUROPEAN CHAPTE, P1
[4] Artetxe M, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4623
[5] Ready Player One! Eliciting Diverse Knowledge Using A Configurable Game
Balayn, Agathe
He, Gaole
Hu, Andrea
Yang, Jie
Gadiraju, Ujwal
[J]. PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1709 - 1719
[6] Barikeri S, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), P1941
[7] Bender E. M., 2018, Transactions of the Association for Computational Linguistics, V6, P587, DOI DOI 10.1162/TACL_A_00041
[8] On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
Bender, Emily M.
Gebru, Timnit
McMillan-Major, Angelina
Shmitchell, Shmargaret
[J]. PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 610 - 623
[9] Biester L., 2022, P 1 WORKSHOP PERSPEC, P10
[10] Power to the People? Opportunities and Challenges for Participatory AI
Birhane, Abeba
Isaac, William
Prabhakaran, Vinodkumar
Diaz, Mark
Elish, Madeleine Clare
Gabriel, Iason
Mohamed, Shakir
[J]. ACM CONFERENCE ON EQUITY AND ACCESS IN ALGORITHMS, MECHANISMS, AND OPTIMIZATION, EAAMO 2022, 2022,

← 1 2 3 4 5 6 7 8 9 →