Interpretable Software Defect Prediction from Project Effort and Static Code Metrics

被引:2
|
作者
Haldar, Susmita [1 ,2 ]
Capretz, Luiz Fernando [2 ]
机构
[1] Fanshawe Coll, Sch Informat Technol, London, ON N5Y 5R6, Canada
[2] Western Univ, Dept Elect & Comp Engn, London, ON N6A 3K7, Canada
关键词
defect prediction; explainable machine learning; software quality; interpretability; cross-project defect prediction; NEAREST-NEIGHBOR;
D O I
10.3390/computers13020052
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Software defect prediction models enable test managers to predict defect-prone modules and assist with delivering quality products. A test manager would be willing to identify the attributes that can influence defect prediction and should be able to trust the model outcomes. The objective of this research is to create software defect prediction models with a focus on interpretability. Additionally, it aims to investigate the impact of size, complexity, and other source code metrics on the prediction of software defects. This research also assesses the reliability of cross-project defect prediction. Well-known machine learning techniques, such as support vector machines, k-nearest neighbors, random forest classifiers, and artificial neural networks, were applied to publicly available PROMISE datasets. The interpretability of this approach was demonstrated by SHapley Additive exPlanations (SHAP) and local interpretable model-agnostic explanations (LIME) techniques. The developed interpretable software defect prediction models showed reliability on independent and cross-project data. Finally, the results demonstrate that static code metrics can contribute to the defect prediction models, and the inclusion of explainability assists in establishing trust in the developed models.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Software defect association mining and defect correction effort prediction
    Song, QB
    Shepperd, M
    Cartwright, M
    Mair, C
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2006, 32 (02) : 69 - 82
  • [22] A Study on the Significance of Software Metrics in Defect Prediction
    Xia, Ye
    Yan, Guoying
    Si, Qianran
    2013 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2013, : 343 - 346
  • [23] Periodic Developer Metrics in Software Defect Prediction
    Kini, Seldag Ozcan
    Tosun, Ayse
    2018 IEEE 18TH INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM), 2018, : 72 - 81
  • [24] Software maintainability prediction by data mining of software code metrics
    Kaur, Arvinder
    Kaur, Kamaldeep
    Pathak, Kaushal
    2014 INTERNATIONAL CONFERENCE ON DATA MINING AND INTELLIGENT COMPUTING (ICDMIC), 2014,
  • [26] The Stability of Threshold Values for Software Metrics in Software Defect Prediction
    Mausa, Goran
    Grbac, Tihana Galinac
    MODEL AND DATA ENGINEERING (MEDI 2017), 2017, 10563 : 81 - 95
  • [27] Static source code metrics and static analysis warnings for fine-grained just-in-time defect prediction
    Trautsch, Alexander
    Herbold, Steffen
    Grabowski, Jens
    2020 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2020), 2020, : 127 - 138
  • [28] Within-Project Defect Prediction of Infrastructure-as-Code Using Product and Process Metrics
    Dalla Palma, Stefano
    Di Nucci, Dario
    Palomba, Fabio
    Tamburri, Damian A.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (06) : 2086 - 2104
  • [29] Empirical Investigation of Code and Process Metrics for Defect Prediction
    Han, Wenjing
    Lung, Chung-Horng
    Ajila, Samuel A.
    2016 IEEE SECOND INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2016, : 436 - 439
  • [30] Effective Estimation of Modules' Metrics in Software Defect Prediction
    Fakhrahmad, S. M.
    Sami, A.
    WORLD CONGRESS ON ENGINEERING 2009, VOLS I AND II, 2009, : 206 - 211