Gender Representation Among Contributors to Open-Source Infrastructure

被引:4
|
作者
Qiu, Huilian Sophie [1 ,3 ]
Zhao, Zihe H. [2 ]
Yu, Tielin Katy [1 ]
Wang, Justin [1 ]
Ma, Alexander [1 ]
Fang, Hongbo [1 ]
Dabbish, Laura [1 ]
Vasilescu, Bogdan [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Rice Univ, Houston, TX USA
[3] Northwestern Univ, Evanston, IL USA
关键词
open-source software; gender diversity; SOFTWARE-DEVELOPMENT;
D O I
10.1109/ICSE-SEIS58686.2023.00025
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
While the severe underrepresentation of women and non-binary people in open source is widely recognized, there is little empirical data on how the situation has changed over time and which subcommunities have been more effectively reducing the gender imbalance. To obtain a clearer image of gender representation in open source, we compiled and synthesized existing empirical data from the literature, and computed historical trends in the representation of women across 20 open source ecosystems. While inherently limited by the ability of automatic name-based gender inference to capture true gender identities at an individual level, our census still provides valuable population-level insights. Across all and in most ecosystems, we observed a promising upward trend in the percentage of women among code contributors over time, but also high variation in the percentage of women contributors across ecosystems. We also found that, in most ecosystems, women withdraw earlier from open-source participation than men. General Abstract-The representation of women and non-binary people has been extremely low in the open-source software community. Most of the statistics reported by prior studies are below 10%. However, the majority of the prior works were based on subsamples instead of the entire population. Our work started with a review of the gender distributions reported in the literature. Then we provided an overview of the gender distribution in 20 of the largest open-source ecosystem, i.e., grouped by package managers such as npm and PyPI, and investigated its change over time. Moreover, we analyzed the turnover rate between men and women contributors. Across all and in most ecosystems, we observed a promising upward trend in the percentage of women among code contributors over time, but also high variation in the percentage of women contributors across ecosystems. We also found that, in most ecosystems, women withdraw earlier from open-source participation than men.
引用
收藏
页码:180 / 187
页数:8
相关论文
共 50 条
  • [1] The dynamics of open-source contributors
    Lerner, Josh
    Pathak, Parag A.
    Tirole, Jean
    AMERICAN ECONOMIC REVIEW, 2006, 96 (02): : 114 - 118
  • [2] iEDA: An Open-source infrastructure of EDA
    Li, Xingquan
    Huang, Zengrong
    Tao, Simin
    Huang, Zhipeng
    Zhuang, Chunan
    Wang, Hao
    Li, Yifan
    Qiu, Yihang
    Luo, Guojie
    Li, Huawei
    Shen, Haihua
    Chen, Mingyu
    Bu, Dongbo
    Zhu, Wenxing
    Cai, Ye
    Xiong, Xiaoming
    Jiang, Ying
    Heng, Yi
    Zhang, Peng
    Yu, Bei
    Xie, Biwei
    Bao, Yungang
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 77 - 82
  • [3] Skill Recommendation for New Contributors in Open-Source Software
    Santos, Fabio
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS, ICSE-COMPANION, 2023, : 311 - 313
  • [4] Eucalyptus: an open-source cloud computing infrastructure
    Nurmi, Daniel
    Wolski, Rich
    Grzegorczyk, Chris
    Obertelli, Graziano
    Soman, Sunil
    Youseff, Lamia
    Zagorodnov, Dmitrii
    SCIDAC 2009: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2009, 180
  • [5] An extensible open-source compiler infrastructure for testing
    Quinlan, Dan
    Ur, Shmuel
    Vuduc, Richard
    HARDWARE AND SOFTWARE VERIFICATION AND TESTING, 2006, 3875 : 116 - 133
  • [6] EUCALYPTUS OPEN-SOURCE PRIVATE CLOUD INFRASTRUCTURE
    Bogdanov, A. V.
    Dmitriev, M.
    Naing, Ye Myint
    DISTRIBUTED COMPUTING AND GRID-TECHNOLOGIES IN SCIENCE AND EDUCATION, 2010, : 57 - 62
  • [7] Towards Extracting the Role and Behavior of Contributors in Open-source Projects
    Papamichail, Michail D.
    Diamantopoulos, Themistoklis
    Matsoukas, Vasileios
    Athanasiadis, Christos
    Symeonidis, Andreas L.
    ICSOFT: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2019, : 536 - 543
  • [8] Design and implementation of an open-source infrastructure and an intelligent thermostat
    Loumpas, Anastasios
    Panaras, Georgios
    Dasygenis, Minas
    2018 7TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2018,
  • [9] An open-source representation for 2-DE-centric proteomics and support infrastructure for data storage and analysis
    Romesh Stanislaus
    John M Arthur
    Balaji Rajagopalan
    Rick Moerschell
    Brian McGlothlen
    Jonas S Almeida
    BMC Bioinformatics, 9
  • [10] An open-source representation for 2-DE-centric proteomics and support infrastructure for data storage and analysis
    Stanislaus, Romesh
    Arthur, John M.
    Rajagopalan, Balaji
    Moerschell, Rick
    McGlothlen, Brian
    Almeida, Jonas S.
    BMC BIOINFORMATICS, 2008, 9 (1)