SZZ Unleashed: An Open Implementation of the SZZ Algorithm - Featuring Example Usage in a Study of Just-in-Time Bug Prediction for the Jenkins Project

被引:57
作者
Borg, Markus [1 ]
Svensson, Oscar [2 ]
Berg, Kristian [2 ]
Hansson, Daniel [3 ]
机构
[1] RISE Res Inst Sweden AB, ICT SICS, Lund, Sweden
[2] Lund Univ, Dept Comp Sci, Lund, Sweden
[3] Verifyter AB, Lund, Sweden
来源
PROCEEDINGS OF THE 3RD ACM SIGSOFT INTERNATIONAL WORKSHOP ON MACHINE LEARNING TECHNIQUES FOR SOFTWARE QUALITY EVALUATION (MALTESQUE '19) | 2019年
关键词
SZZ; defect prediction; mining software repositories; issue tracking;
D O I
10.1145/3340482.3342742
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Machine learning applications in software engineering often rely on detailed information about bugs. While issue trackers often contain information about when bugs were fixed, details about when they were introduced to the system are often absent. As a remedy, researchers often rely on the SZZ algorithm as a heuristic approach to identify bug-introducing software changes. Unfortunately, as reported in a recent systematic literature review, few researchers have made their SZZ implementations publicly available. Consequently, there is a risk that research effort is wasted as new projects based on SZZ output need to initially reimplement the approach. Furthermore, there is a risk that newly developed (closed source) SZZ implementations have not been properly tested, thus conducting research based on their output might introduce threats to validity. We present SZZ Unleashed, an open implementation of the SZZ algorithm for git repositories. This paper describes our implementation along with a usage example for the Jenkins project, and conclude with an illustrative study on just-in-time bug prediction. We hope to continue evolving SZZ Unleashed on GitHub, and warmly invite the community to contribute.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 27 条
  • [1] Berg K., 2018, THESIS
  • [2] Canfora Gerardo, 2007, 4 INT WORKSHOP MININ, P14
  • [3] Challenges and opportunities for software change request repositories: a systematic mapping study
    Cavalcanti, Yguarata Cerqueira
    da Mota Silveira Neto, Paulo Anselmo
    Machado, Ivan do Carmo
    Vale, Tassio Ferreira
    de Almeida, Eduardo Santana
    de Lemos Meira, Silvio Romero
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2014, 26 (07) : 620 - 653
  • [4] Correia J., 2017, OLD SZZ
  • [5] Czerwonka Jacek, 2011, Proceedings 2011 IEEE Fourth International Conference on Software Testing, Verification and Validation (ICST 2011), P357, DOI 10.1109/ICST.2011.24
  • [6] On the Relationship Between Change Coupling and Software Defects
    D'Ambros, Marco
    Lanza, Michele
    Robbes, Romain
    [J]. 16TH WORKING CONFERENCE ON REVERSE ENGINEERING (WCRE 2009), 2009, : 135 - 144
  • [7] de Mario Andre, 2016, P 31 ANN ACM S APPL, P1472, DOI DOI 10.1145/2851613.2851786
  • [8] A systematic review on regression test selection techniques
    Engstrom, Emelie
    Runeson, Per
    Skoglund, Mats
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2010, 52 (01) : 14 - 30
  • [9] A critique of software defect prediction models
    Fenton, NE
    Neil, M
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (05) : 675 - 689
  • [10] Ferrus T, 2008, 2008 IEEE SILICON NANOELECTRONICS WORKSHOP, P32