Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology

被引:0
作者
Hada, Rishav [1 ]
Husain, Safiya [2 ]
Gumma, Varun [1 ]
Diddee, Harshita [3 ]
Yadavalli, Aditya [2 ]
Seth, Agrima [4 ]
Kulkarni, Nidhi [2 ]
Gadiraju, Ujwal [5 ]
Vashistha, Aditya [6 ]
Seshadri, Vivek [7 ]
Bali, Kalika [1 ]
机构
[1] Microsoft Res, Bengaluru, India
[2] Karya, Bengaluru, India
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Michigan, Ann Arbor, MI 48109 USA
[5] Delft Univ Technol, Delft, Netherlands
[6] Cornell Univ, Ithaca, NY USA
[7] Microsoft Res, Karya, Bengaluru, India
来源
PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024 | 2024年
关键词
Gender bias; Indic languages; Global South; India; Hindi; Community centric;
D O I
10.1145/3630106.3659017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing research in measuring and mitigating gender bias predominantly centers on English, overlooking the intricate challenges posed by non-English languages and the Global South. This paper presents the first comprehensive study delving into the nuanced landscape of gender bias in Hindi, the third most spoken language globally. Our study employs diverse mining techniques, computational models, field studies and sheds light on the limitations of current methodologies. Given the challenges faced with mining gender biased statements in Hindi using existing methods, we conducted field studies to bootstrap the collection of such sentences. Through field studies involving rural and low-income community women, we uncover diverse perceptions of gender bias, underscoring the necessity for context-specific approaches. This paper advocates for a community-centric research design, amplifying voices often marginalized in previous studies. Our findings not only contribute to the understanding of gender bias in Hindi but also establish a foundation for further exploration of Indic languages. By exploring the intricacies of this understudied context, we call for thoughtful engagement with gender bias, promoting inclusivity and equity in linguistic and cultural contexts beyond the Global North.
引用
收藏
页码:1926 / 1939
页数:14
相关论文
共 86 条
  • [1] Abraham B, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P2819
  • [2] Ahuja K, 2023, 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, P4232
  • [3] Albalak Alon, 2023, 17 C EUROPEAN CHAPTE, P1
  • [4] Artetxe M, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4623
  • [5] Ready Player One! Eliciting Diverse Knowledge Using A Configurable Game
    Balayn, Agathe
    He, Gaole
    Hu, Andrea
    Yang, Jie
    Gadiraju, Ujwal
    [J]. PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1709 - 1719
  • [6] Barikeri S, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), P1941
  • [7] Bender E. M., 2018, Transactions of the Association for Computational Linguistics, V6, P587, DOI DOI 10.1162/TACL_A_00041
  • [8] On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
    Bender, Emily M.
    Gebru, Timnit
    McMillan-Major, Angelina
    Shmitchell, Shmargaret
    [J]. PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 610 - 623
  • [9] Biester L., 2022, P 1 WORKSHOP PERSPEC, P10
  • [10] Power to the People? Opportunities and Challenges for Participatory AI
    Birhane, Abeba
    Isaac, William
    Prabhakaran, Vinodkumar
    Diaz, Mark
    Elish, Madeleine Clare
    Gabriel, Iason
    Mohamed, Shakir
    [J]. ACM CONFERENCE ON EQUITY AND ACCESS IN ALGORITHMS, MECHANISMS, AND OPTIMIZATION, EAAMO 2022, 2022,