Interpretable Software Defect Prediction from Project Effort and Static Code Metrics

被引:2
|
作者
Haldar, Susmita [1 ,2 ]
Capretz, Luiz Fernando [2 ]
机构
[1] Fanshawe Coll, Sch Informat Technol, London, ON N5Y 5R6, Canada
[2] Western Univ, Dept Elect & Comp Engn, London, ON N6A 3K7, Canada
关键词
defect prediction; explainable machine learning; software quality; interpretability; cross-project defect prediction; NEAREST-NEIGHBOR;
D O I
10.3390/computers13020052
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Software defect prediction models enable test managers to predict defect-prone modules and assist with delivering quality products. A test manager would be willing to identify the attributes that can influence defect prediction and should be able to trust the model outcomes. The objective of this research is to create software defect prediction models with a focus on interpretability. Additionally, it aims to investigate the impact of size, complexity, and other source code metrics on the prediction of software defects. This research also assesses the reliability of cross-project defect prediction. Well-known machine learning techniques, such as support vector machines, k-nearest neighbors, random forest classifiers, and artificial neural networks, were applied to publicly available PROMISE datasets. The interpretability of this approach was demonstrated by SHapley Additive exPlanations (SHAP) and local interpretable model-agnostic explanations (LIME) techniques. The developed interpretable software defect prediction models showed reliability on independent and cross-project data. Finally, the results demonstrate that static code metrics can contribute to the defect prediction models, and the inclusion of explainability assists in establishing trust in the developed models.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A comparison between software design and code metrics for the prediction of software fault content
    Zhao, M
    Wohlin, C
    Ohlsson, N
    Xie, M
    INFORMATION AND SOFTWARE TECHNOLOGY, 1998, 40 (14) : 801 - 809
  • [42] A Software Vulnerability Prediction Model Using Traceable Code Patterns and Software Metrics
    Sultana K.Z.
    Boyd C.B.
    Williams B.J.
    SN Computer Science, 4 (5)
  • [43] Comparison between software design and code metrics for the prediction of software fault content
    Univ Coll of Gavle-Sandviken, Gavle-Sandviken, Sweden
    Inf Software Technol, 14 (801-809):
  • [44] Analysis of Approach for Predicting Software Defect Density using Static Metrics
    Mandhan, Neeraj
    Verma, Dinesh Kumar
    Kumar, Shishir
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 880 - 886
  • [45] Weighted software metrics aggregation and its application to defect prediction
    Ulan, Maria
    Lowe, Welf
    Ericsson, Morgan
    Wingkvist, Anna
    EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (05)
  • [46] New Conceptual Cohesion Metrics: Assessment for Software Defect Prediction
    Miholca, Diana-Lucia
    2021 23RD INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2021), 2021, : 163 - 170
  • [47] Weighted software metrics aggregation and its application to defect prediction
    Maria Ulan
    Welf Löwe
    Morgan Ericsson
    Anna Wingkvist
    Empirical Software Engineering, 2021, 26
  • [48] Revisiting the Impact of Dependency Network Metrics on Software Defect Prediction
    Gong, Lina
    Rajbahadur, Gopi Krishnan
    Hassan, Ahmed E.
    Jiang, Shujuan
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (12) : 5030 - 5049
  • [49] Defect prediction from static code features: current results, limitations, new approaches
    Tim Menzies
    Zach Milton
    Burak Turhan
    Bojan Cukic
    Yue Jiang
    Ayşe Bener
    Automated Software Engineering, 2010, 17 : 375 - 407
  • [50] Cross project defect prediction for open source software
    Agrawal A.
    Malhotra R.
    International Journal of Information Technology, 2022, 14 (1) : 587 - 601