A Systematic Survey of Just-in-Time Software Defect Prediction

被引:40
作者
Zhao, Yunhua [1 ]
Damevski, Kostadin [2 ]
Chen, Hui [1 ,3 ]
机构
[1] CUNY, Grad Ctr, Dept Comp Sci, 365 5th Ave, New York, NY 10016 USA
[2] Virginia Commonwealth Univ, Dept Comp Sci, 401 West Main St, Richmond, VA 23284 USA
[3] CUNY, Brooklyn Coll, Dept Comp & Informat Sci, 2900 Bedford Ave, Brooklyn, NY 11210 USA
关键词
Software defect prediction; release software defect prediction; just-in-time software defect prediction; change-level software defect prediction; machine learning; searching-based algorithms; software change metrics; change defect density; REVIEWS; MODELS; IMPACT;
D O I
10.1145/3567550
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent years have experienced sustained focus in research on software defect prediction that aims to predict the likelihood of software defects. Moreover, with the increased interest in continuous deployment, a variant of software defect prediction called Just-in-Time Software Defect Prediction ( JIT-SDP) focuses on predicting whether each incremental software change is defective. JIT-SDP is unique in that it consists of two interconnected data streams, one consisting of the arrivals of software changes stemming from design and implementation, and the other the (defective or clean) labels of software changes resulting from quality assurance processes. We present a systematic survey of 67 JIT-SDP studies with the objective to help researchers advance the state of the art in JIT-SDP and to help practitioners become familiar with recent progress. We summarize best practices in each phase of the JIT-SDP workflow, carry out a meta-analysis of prior studies, and suggest future research directions. Our meta-analysis of JIT-SDP studies indicates, among other findings, that the predictive performance correlates with change defect ratio, suggesting that JIT-SDP is most performant in projects that experience relatively high defect ratios. Future research directions for JIT-SDP include situating each technique into its application domain, reliability-aware JIT-SDP, and user-centered JIT-SDP.
引用
收藏
页数:35
相关论文
共 95 条
[1]  
[Anonymous], 2005, ACM SIGSOFT SOFTW EN, DOI 10.1145/1083142.1083147
[2]   Just-in-time software defect prediction using deep temporal convolutional networks [J].
Ardimento, Pasquale ;
Aversano, Lerina ;
Bernardi, Mario Luca ;
Cimitile, Marta ;
Iammarino, Martina .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05) :3981-4001
[3]  
Aversano L., 2007, Proceedings of the Foundations of Software Engineering, P19, DOI 10.1145/1294948.1294954
[4]  
Barnett JG, 2016, 13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), P496, DOI [10.1109/MSR.2016.063, 10.1145/2901739.2903496]
[5]   Revisiting the Impact of Concept Drift on Just-in-Time Quality Assurance [J].
Bennin, Kwabena E. ;
Ali, Nauman bin ;
Borstler, Jurgen ;
Yu, Xiao .
2020 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY (QRS 2020), 2020, :53-59
[6]  
Box G.E.P., 1979, Robustness in statistics, P201, DOI [10.1016/B978-0-12-438150-6.50018-2, DOI 10.1016/B978-0-12-438150-6.50018-2]
[7]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[8]   Class Imbalance Evolution and Verification Latency in Just-in-Time Software Defect Prediction [J].
Cabral, George G. ;
Minku, Leandro L. ;
Shihab, Emad ;
Mujahid, Suhaib .
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, :666-676
[9]  
Neto EC, 2018, 2018 25TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2018), P380, DOI 10.1109/SANER.2018.8330225
[10]   Software fault prediction: A literature review and current trends [J].
Catal, Cagatay .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) :4626-4636