Machine learning in software defect prediction: A business-driven systematic mapping study

被引:18
作者
Stradowski, Szymon [1 ,2 ]
Madeyski, Lech [2 ]
机构
[1] Nokia, Szybowcowa 2, PL-54206 Wroclaw, Dolnoslaskie, Poland
[2] Wroclaw Univ Sci & Technol, Wyb Wyspianskiego 27, PL-50370 Wroclaw, Dolnoslaskie, Poland
关键词
Software defect prediction; Machine learning; Systematic mapping study; Business applicability; Effort and cost minimisation;
D O I
10.1016/j.infsof.2022.107128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Machine learning is a valuable tool in software engineering allowing fair defect prediction capabilities at a relatively small expense. However, although the practical usage of machine learning in defect prediction has been studied over many years, there is not sufficient systematic effort to analyse its potential for business application.Objective: The following systematic mapping study aims to analyse the current state-of-the-art in terms of machine learning software defect prediction modelling and to identify and classify the emerging new trends. Notably, the analysis is done from a business perspective, evaluating the opportunities to adopt the latest techniques and methods in commercial settings to improve software quality and lower the cost of development life cycle.Method: We created a broad search universe to answer our research questions, performing an automated query through the Scopus database to identify relevant primary studies. Next, we evaluated all found studies using a classification scheme to map the extent of business adoption of machine learning software defect prediction based on the keywords used in the publications. Additionally, we use PRISMA 2020 guideline to validate reporting.Results: After the application of the selection criteria, the remaining 742 primary studies included in Scopus until February 23, 2022 were mapped to classify and structure the research area. The results confirm that the usage of commercial datasets is significantly smaller than the established datasets from NASA and open-source projects. However, we have also found meaningful emerging trends considering business needs in analysed studies.Conclusions: There is still a considerable amount of work to fully internalise business applicability in the field. Performed analysis has shown that purely academic considerations dominate in published research; however, there are also traces of in vivo results becoming more available. Notably, the created maps offer insight into future machine learning software defect prediction research opportunities.
引用
收藏
页数:15
相关论文
共 71 条
  • [61] An Investigation of Cross-Project Learning in Online Just-In-Time Software Defect Prediction
    Tabassum, Sadia
    Minku, Leandro L.
    Feng, Danyi
    Cabral, George G.
    Song, Liyan
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 554 - 565
  • [62] Software survey: VOSviewer, a computer program for bibliometric mapping
    van Eck, Nees Jan
    Waltman, Ludo
    [J]. SCIENTOMETRICS, 2010, 84 (02) : 523 - 538
  • [63] Wallwork A, 2016, English for writing research papers, P265
  • [64] Machine/Deep Learning for Software Engineering: A Systematic Literature Review
    Wang, Simin
    Huang, Liguo
    Gao, Amiao
    Ge, Jidong
    Zhang, Tengfei
    Feng, Haitao
    Satyarth, Ishna
    Li, Ming
    Zhang, He
    Ng, Vincent
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (03) : 1188 - 1231
  • [65] Wolpert D. H., 1997, IEEE Transactions on Evolutionary Computation, V1, P67, DOI 10.1109/4235.585893
  • [66] The lack of A priori distinctions between learning algorithms
    Wolpert, DH
    [J]. NEURAL COMPUTATION, 1996, 8 (07) : 1341 - 1390
  • [67] ConPredictor: Concurrency Defect Prediction in Real-World Applications
    Yu, Tingting
    Wen, Wei
    Han, Xue
    Hayes, Jane Huffman
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2019, 45 (06) : 558 - 575
  • [68] Machine Learning Testing: Survey, Landscapes and Horizons
    Zhang, Jie M.
    Harman, Mark
    Ma, Lei
    Liu, Yang
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (01) : 1 - 36
  • [69] Zhou X, 2016, ASIA PAC SOFWR ENG, P153, DOI [10.1109/APSEC.2016.62, 10.1109/APSEC.2016.031]
  • [70] Zhu ZY, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3565