Machine learning in software defect prediction: A business-driven systematic mapping study

被引:19
作者
Stradowski, Szymon [1 ,2 ]
Madeyski, Lech [2 ]
机构
[1] Nokia, Szybowcowa 2, PL-54206 Wroclaw, Dolnoslaskie, Poland
[2] Wroclaw Univ Sci & Technol, Wyb Wyspianskiego 27, PL-50370 Wroclaw, Dolnoslaskie, Poland
关键词
Software defect prediction; Machine learning; Systematic mapping study; Business applicability; Effort and cost minimisation;
D O I
10.1016/j.infsof.2022.107128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Machine learning is a valuable tool in software engineering allowing fair defect prediction capabilities at a relatively small expense. However, although the practical usage of machine learning in defect prediction has been studied over many years, there is not sufficient systematic effort to analyse its potential for business application.Objective: The following systematic mapping study aims to analyse the current state-of-the-art in terms of machine learning software defect prediction modelling and to identify and classify the emerging new trends. Notably, the analysis is done from a business perspective, evaluating the opportunities to adopt the latest techniques and methods in commercial settings to improve software quality and lower the cost of development life cycle.Method: We created a broad search universe to answer our research questions, performing an automated query through the Scopus database to identify relevant primary studies. Next, we evaluated all found studies using a classification scheme to map the extent of business adoption of machine learning software defect prediction based on the keywords used in the publications. Additionally, we use PRISMA 2020 guideline to validate reporting.Results: After the application of the selection criteria, the remaining 742 primary studies included in Scopus until February 23, 2022 were mapped to classify and structure the research area. The results confirm that the usage of commercial datasets is significantly smaller than the established datasets from NASA and open-source projects. However, we have also found meaningful emerging trends considering business needs in analysed studies.Conclusions: There is still a considerable amount of work to fully internalise business applicability in the field. Performed analysis has shown that purely academic considerations dominate in published research; however, there are also traces of in vivo results becoming more available. Notably, the created maps offer insight into future machine learning software defect prediction research opportunities.
引用
收藏
页数:15
相关论文
共 71 条
[61]   Software survey: VOSviewer, a computer program for bibliometric mapping [J].
van Eck, Nees Jan ;
Waltman, Ludo .
SCIENTOMETRICS, 2010, 84 (02) :523-538
[62]  
Wallwork Adrian, 2016, English for writing research papers
[63]   Machine/Deep Learning for Software Engineering: A Systematic Literature Review [J].
Wang, Simin ;
Huang, Liguo ;
Gao, Amiao ;
Ge, Jidong ;
Zhang, Tengfei ;
Feng, Haitao ;
Satyarth, Ishna ;
Li, Ming ;
Zhang, He ;
Ng, Vincent .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (03) :1188-1231
[64]  
Wohlin C., 2014, P 18 INT C EV ASS SO, V38, DOI [DOI 10.1145/2601248.2601268, 10.1145/2601248.2601268.10]
[65]  
Wolpert D. H., 1997, IEEE Transactions on Evolutionary Computation, V1, P67, DOI 10.1109/4235.585893
[66]   The lack of A priori distinctions between learning algorithms [J].
Wolpert, DH .
NEURAL COMPUTATION, 1996, 8 (07) :1341-1390
[67]   ConPredictor: Concurrency Defect Prediction in Real-World Applications [J].
Yu, Tingting ;
Wen, Wei ;
Han, Xue ;
Hayes, Jane Huffman .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2019, 45 (06) :558-575
[68]   Machine Learning Testing: Survey, Landscapes and Horizons [J].
Zhang, Jie M. ;
Harman, Mark ;
Ma, Lei ;
Liu, Yang .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (01) :1-36
[69]  
Zhou X, 2016, ASIA PAC SOFWR ENG, P153, DOI [10.1109/APSEC.2016.62, 10.1109/APSEC.2016.031]
[70]  
Zhu ZY, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3565