A Data Set for Social Diversity Studies of GitHub Teams

被引:53
作者
Vasilescu, Bogdan [1 ]
Serebrenik, Alexander [2 ]
Filkov, Vladimir [1 ]
机构
[1] Univ Calif Davis, Davis, CA 95616 USA
[2] Eindhoven Univ Technol, NL-5600 MB Eindhoven, Netherlands
来源
12TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2015) | 2015年
关键词
SOFTWARE; IMPACT;
D O I
10.1109/MSR.2015.77
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Like any other team oriented activity, the software development process is effected by social diversity in the programmer teams. The effect of team diversity can be significant, but also complex, especially in decentralized teams. Discerning the precise contribution of diversity on teams' effectiveness requires quantitative studies of large data sets. Here we present for the first time a large data set of social diversity attributes of programmers in GITHUB teams. Using alias resolution, location data, and gender inference techniques, we collected a team social diversity data set of 23,493 GITHUB projects. We illustrate how the data set can be used in practice with a series of case studies, and we hope its availability will foster more interest in studying diversity issues in software teams.
引用
收藏
页码:514 / 517
页数:4
相关论文
共 27 条
[1]   Coordination and Productivity Issues in Free Software: the Role of Brooks' Law [J].
Adams, Paul J. ;
Capiluppi, Andrea ;
Boldyreff, Cornelia .
2009 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, CONFERENCE PROCEEDINGS, 2009, :319-328
[2]   MEASURES OF INEQUALITY [J].
ALLISON, PD .
AMERICAN SOCIOLOGICAL REVIEW, 1978, 43 (06) :865-880
[3]  
[Anonymous], 1977, INEQUALITY HETEROGEN
[4]  
[Anonymous], CHI IN PRESS
[5]  
[Anonymous], CHASE IN PRESS
[6]  
[Anonymous], EUROPEAN REV SOCIAL
[7]  
Bettenburg Nicolas, 2010, Proceedings of the 18th IEEE International Conference on Program Comprehension (ICPC 2010), P124, DOI 10.1109/ICPC.2010.46
[8]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[9]  
Chen JL, 2010, CHI2010: PROCEEDINGS OF THE 28TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, P821
[10]   The Effects of Diversity in Global, Distributed Collectives: A Study of Open Source Project Success [J].
Daniel, Sherae ;
Agarwal, Ritu ;
Stewart, Katherine J. .
INFORMATION SYSTEMS RESEARCH, 2013, 24 (02) :312-333