Correlations between bugginess and time-based commit characteristics

被引:29
作者
Eyolfson, Jon [1 ]
Tan, Lin [1 ]
Lam, Patrick [1 ]
机构
[1] Univ Waterloo, Waterloo, ON N2L 3G1, Canada
关键词
Bug detection; Empirical study; Source control system; SOFTWARE; FAULTS;
D O I
10.1007/s10664-013-9245-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern software is often developed over many years with hundreds of thousands of commits. Commit metadata is a rich source of time-based characteristics, including the commit's time of day and the commit frequency and seniority of its author. The "bugginess" of a commit is also a critical property of that commit. In this paper, we investigate the correlation between a commit's time-based characteristics and its "bugginess"; such results can be useful for software developers and software engineering researchers. For instance, developers or code reviewers might be well-advised to thoroughly verify commits that are more likely to be buggy. In this paper, we study the correlation between a commit's bugginess and the time of day of the commit, the day of week of the commit, the commit frequency and seniority of the commit authors, and whether or not the developers have marked a commit as a "stable" commit. We survey three widely-used open source projects: the Linux kernel, PostgreSQL, and the Xorg server. Our main findings include: (1) commits between midnight and 4 AM (referred to as late-night commits) are significantly buggier and commits between 8 AM and noon are less buggy, implying that developers may want to double-check their own late-night commits; (2) daily-committing developers produce less-buggy commits, indicating that we may want to promote the practice of daily-committing developers reviewing other developers' commits; (3) the bugginess of commits versus day-of-week varies for different software projects; and (4) stable commits are significantly less buggy than commits in general.
引用
收藏
页码:1009 / 1039
页数:31
相关论文
共 33 条
  • [1] [Anonymous], 2008, ICSE
  • [2] [Anonymous], 2008, Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering, DOI [10.1145/1453101.1453106, DOI 10.1145/1453101.1453106]
  • [3] [Anonymous], ICSE
  • [4] [Anonymous], ASE
  • [5] [Anonymous], 2007, PROMISE
  • [6] Benjamini Y, 2001, ANN STAT, V29, P1165
  • [7] Bird C, 2012, P INT WORK C MIN SOF
  • [8] Bird C., 2011, 19 ACM SIGSOFT S 13, P4, DOI DOI 10.1145/2025113.2025119
  • [9] Fair and Balanced? Bias in Bug-Fix Datasets
    Bird, Christian
    Bachmann, Adrian
    Aune, Eirik
    Duffy, John
    Bernstein, Abraham
    Filkov, Vladimir
    Devanbu, Premkumar
    [J]. 7TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2009, : 121 - 130
  • [10] Does Distributed Development Affect Software Quality? An Empirical Case Study of Windows Vista
    Bird, Christian
    Nagappan, Nachiappan
    Devanbu, Premkumar
    Gall, Harald
    Murphy, Brendan
    [J]. 2009 31ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2009, : 518 - +