Pull request latency explained: an empirical overview

被引:10
作者
Zhang, Xunhui [1 ]
Yu, Yue [1 ]
Wang, Tao [1 ]
Rastogi, Ayushi [2 ]
Wang, Huaimin [1 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
[2] Univ Groningen, Groningen, Netherlands
关键词
Pull-based development; Pull request latency; Distributed software development; GitHub;
D O I
10.1007/s10664-022-10143-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Pull request latency evaluation is an essential application of effort evaluation in the pull-based development scenario. It can help the reviewers sort the pull request queue, remind developers about the review processing time, speed up the review process and accelerate software development. There is a lack of work that systematically organizes the factors that affect pull request latency. Also, there is no related work discussing the differences and variations in characteristics in different scenarios and contexts. In this paper, we collected relevant factors through a literature review approach. Then we assessed their relative importance in five scenarios and six different contexts using the mixed-effects linear regression model. The most important factors differ in different scenarios. The length of the description is most important when pull requests are submitted. The existence of comments is most important when closing pull requests, using CI tools, and when the contributor and the integrator are different. When there exist comments, the latency of the first comment is the most important. Meanwhile, the influence of factors may change in different contexts. For example, the number of commits in a pull request has a more significant impact on pull request latency when closing than submitting due to changes in contributions brought about by the review process. Both human and bot comments are positively correlated with pull request latency. In contrast, the bot's first comments are more strongly correlated with latency, but the number of comments is less correlated. Future research and tool implementation needs to consider the impact of different contexts. Researchers can conduct related studies based on our publicly available datasets and replication scripts.
引用
收藏
页数:38
相关论文
共 71 条
[1]  
Altaleb A, 2020, 2020 9TH INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY AND MANAGEMENT (ICITM 2020), P291, DOI [10.1109/ICITM48982.2020.9080362, 10.1109/icitm48982.2020.9080362]
[2]  
[Anonymous], 2007, GUIDELINES PERFORMIN
[3]  
[Anonymous], 2011, P 7 INT C PRED MOD S
[4]  
[Anonymous], 2011, NATL ACAD PR
[5]  
Atkins M, 2012, GERRIT CODE REV GITH
[6]  
Bacchelli A, 2013, PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), P712, DOI 10.1109/ICSE.2013.6606617
[7]  
Baysal O., 2012, 2012 19th Working Conference on Reverse Engineering (WCRE), P447, DOI 10.1109/WCRE.2012.54
[8]   Investigating technical and non-technical factors influencing modern code review [J].
Baysal, Olga ;
Kononenko, Oleksii ;
Holmes, Reid ;
Godfrey, Michael W. .
EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (03) :932-959
[9]   Studying the Impact of Adopting Continuous Integration on the Delivery Time of Pull Requests [J].
Bernardo, Joao Helis ;
da Costa, Daniel Alencar ;
Kulesza, Uira .
2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2018, :131-141
[10]  
Bernhart Mario, 2010, Proceedings of the 2010 Agile Conference (AGILE 2010), P44, DOI 10.1109/AGILE.2010.18