Machine learning in software defect prediction: A business-driven systematic mapping study

被引:18
作者
Stradowski, Szymon [1 ,2 ]
Madeyski, Lech [2 ]
机构
[1] Nokia, Szybowcowa 2, PL-54206 Wroclaw, Dolnoslaskie, Poland
[2] Wroclaw Univ Sci & Technol, Wyb Wyspianskiego 27, PL-50370 Wroclaw, Dolnoslaskie, Poland
关键词
Software defect prediction; Machine learning; Systematic mapping study; Business applicability; Effort and cost minimisation;
D O I
10.1016/j.infsof.2022.107128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Machine learning is a valuable tool in software engineering allowing fair defect prediction capabilities at a relatively small expense. However, although the practical usage of machine learning in defect prediction has been studied over many years, there is not sufficient systematic effort to analyse its potential for business application.Objective: The following systematic mapping study aims to analyse the current state-of-the-art in terms of machine learning software defect prediction modelling and to identify and classify the emerging new trends. Notably, the analysis is done from a business perspective, evaluating the opportunities to adopt the latest techniques and methods in commercial settings to improve software quality and lower the cost of development life cycle.Method: We created a broad search universe to answer our research questions, performing an automated query through the Scopus database to identify relevant primary studies. Next, we evaluated all found studies using a classification scheme to map the extent of business adoption of machine learning software defect prediction based on the keywords used in the publications. Additionally, we use PRISMA 2020 guideline to validate reporting.Results: After the application of the selection criteria, the remaining 742 primary studies included in Scopus until February 23, 2022 were mapped to classify and structure the research area. The results confirm that the usage of commercial datasets is significantly smaller than the established datasets from NASA and open-source projects. However, we have also found meaningful emerging trends considering business needs in analysed studies.Conclusions: There is still a considerable amount of work to fully internalise business applicability in the field. Performed analysis has shown that purely academic considerations dominate in published research; however, there are also traces of in vivo results becoming more available. Notably, the created maps offer insight into future machine learning software defect prediction research opportunities.
引用
收藏
页数:15
相关论文
共 71 条
  • [1] Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
    Adadi, Amina
    Berrada, Mohammed
    [J]. IEEE ACCESS, 2018, 6 : 52138 - 52160
  • [2] Using Faults-Slip-Through Metric As A Predictor of Fault-Proneness
    Afzal, Wasif
    [J]. 17TH ASIA PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2010), 2010, : 414 - 422
  • [3] Demo: Automatically Retrainable Self Improving Model for the Automated Classification of Software Incidents into Multiple Classes
    Agrawal, Badal
    Mishra, Mohit
    [J]. 2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, : 1110 - 1113
  • [4] Convolutional Neural Networks over Control Flow Graphs for Software Defect Prediction
    Anh Viet Phan
    Minh Le Nguyen
    Lam Thu Bui
    [J]. 2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 45 - 52
  • [5] Phan AV, 2017, 2017 21ST ASIA PACIFIC SYMPOSIUM ON INTELLIGENT AND EVOLUTIONARY SYSTEMS (IES), P37, DOI 10.1109/IESYS.2017.8233558
  • [6] [Anonymous], 2014, P 18 INT C EV ASS SO, DOI 10.1145/2601248.2601268.10
  • [7] A systematic and comprehensive investigation of methods to build and evaluate fault prediction models
    Arisholm, Erik
    Briand, Lionel C.
    Johannessen, Eivind B.
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (01) : 2 - 17
  • [8] B.V. Elsevier, 2022, SCOPUS
  • [9] High-level software requirements and iteration changes: a predictive model
    Blincoe, Kelly
    Dehghan, Ali
    Salaou, Abdoul-Djawadou
    Neal, Adam
    Linaker, Johan
    Damian, Daniela
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (03) : 1610 - 1648
  • [10] Getting Defect Prediction into Industrial Practice: The ELFF Tool
    Bowes, David
    Counsell, Steve
    Hall, Tracy
    Petric, Jean
    Shippey, Thomas
    [J]. 2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2017), 2017, : 44 - 47