Tell Me Who Are You Talking to and I Will Tell You What Issues Need Your Skills

被引:5
作者
Santos, Fabio [1 ]
Penney, Jacob [1 ]
Pimentel, Joao Felipe [1 ]
Wiese, Igor [2 ]
Steinmacher, Igor [1 ]
Gerosa, Marco A. [1 ]
机构
[1] No Arizona Univ, Flagstaff, AZ 86011 USA
[2] Univ Tecnol Fed Parana, Campo Mour ao, PR, Brazil
来源
2023 IEEE/ACM 20TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR | 2023年
基金
美国国家科学基金会;
关键词
Labels; Tags; Skills; Human Factors; Mining Software Repositories; Social Network Analysis; Open Source Software; Machine Learning; SOFTWARE; CENTRALITY;
D O I
10.1109/MSR59073.2023.00087
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Selecting an appropriate task is challenging for newcomers to Open Source Software (OSS) projects. To facilitate task selection, researchers and OSS projects have leveraged machine learning techniques, historical information, and textual analysis to label tasks (a.k.a. issues) with information such as the issue type and domain. These approaches are still far from mainstream adoption, possibly because of a lack of good predictors. Inspired by previous research, we advocate that label prediction might benefit from leveraging metrics derived from communication data and social network analysis (SNA) for issues in which social interaction occurs. Thus, we study how these "social metrics" can improve the automatic labeling of open issues with API domains-categories of APIs used in the source code that solves the issue-which the literature shows that newcomers to the project consider relevant for task selection. We mined data from OSS projects' repositories and organized it in periods to reflect the seasonality of the contributors' project participation. We replicated metrics from previous work and added social metrics to the corpus to predict API-domain labels. Social metrics improved the performance of the classifiers compared to using only the issue description text in terms of precision, recall, and F-measure. Precision (0.922) increased by 15.82% and F-measure (0.942) by 15.89% for a project with high social activity. These results indicate that social metrics can help capture the patterns of social interactions in a software project and improve the labeling of issues in an issue tracker.
引用
收藏
页码:611 / 623
页数:13
相关论文
共 69 条
[1]   An Initial Exploration of the "Good First Issue" Label for Newcomer Developers [J].
Alderliesten, Jan Willem David ;
Zaidman, Andy .
2021 IEEE/ACM 13TH INTERNATIONAL WORKSHOP ON COOPERATIVE AND HUMAN ASPECTS OF SOFTWARE ENGINEERING (CHASE 2021), 2021, :117-118
[2]  
Antoniol G., 2008, P 2008 C CTR ADV STU, P304
[3]   Managing Episodic Volunteers in Free/Libre/Open Source Software Communities [J].
Barcomb, Ann ;
Stol, Klaas-Jan ;
Fitzgerald, Brian ;
Riehle, Dirk .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (01) :260-277
[4]  
Behl D, 2014, PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), P294, DOI 10.1109/ICROIT.2014.6798341
[5]   Studying the impact of social interactions on software quality [J].
Bettenburg, Nicolas ;
Hassan, Ahmed E. .
EMPIRICAL SOFTWARE ENGINEERING, 2013, 18 (02) :375-431
[6]  
Bicer S., 2011, Proceedings of International Conference on Software and Systems Process, P63
[7]  
Bilgin M, 2017, 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), P661, DOI 10.1109/UBMK.2017.8093492
[8]  
Bird Christian, 2009, 2009 20th International Symposium on Software Reliability Engineering (ISSRE 2009), P109, DOI 10.1109/ISSRE.2009.17
[9]  
BURT RS, 1992, NETWORKS AND ORGANIZATIONS : STRUCTURE, FORM, AND ACTION, P57
[10]  
Cahyani D., 2021, Bull. Electr. Eng. Inform., V10