Interpretable Software Defect Prediction from Project Effort and Static Code Metrics

被引:3
作者
Haldar, Susmita [1 ,2 ]
Capretz, Luiz Fernando [2 ]
机构
[1] Fanshawe Coll, Sch Informat Technol, London, ON N5Y 5R6, Canada
[2] Western Univ, Dept Elect & Comp Engn, London, ON N6A 3K7, Canada
关键词
defect prediction; explainable machine learning; software quality; interpretability; cross-project defect prediction; NEAREST-NEIGHBOR;
D O I
10.3390/computers13020052
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Software defect prediction models enable test managers to predict defect-prone modules and assist with delivering quality products. A test manager would be willing to identify the attributes that can influence defect prediction and should be able to trust the model outcomes. The objective of this research is to create software defect prediction models with a focus on interpretability. Additionally, it aims to investigate the impact of size, complexity, and other source code metrics on the prediction of software defects. This research also assesses the reliability of cross-project defect prediction. Well-known machine learning techniques, such as support vector machines, k-nearest neighbors, random forest classifiers, and artificial neural networks, were applied to publicly available PROMISE datasets. The interpretability of this approach was demonstrated by SHapley Additive exPlanations (SHAP) and local interpretable model-agnostic explanations (LIME) techniques. The developed interpretable software defect prediction models showed reliability on independent and cross-project data. Finally, the results demonstrate that static code metrics can contribute to the defect prediction models, and the inclusion of explainability assists in establishing trust in the developed models.
引用
收藏
页数:23
相关论文
共 56 条
[1]   A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm [J].
Abualigah, Laith ;
Dulaimi, Akram Jamal .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03) :2161-2176
[2]   Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].
Adadi, Amina ;
Berrada, Mohammed .
IEEE ACCESS, 2018, 6 :52138-52160
[3]  
Aggarwal Charu C, 2017, An Introduction to Outlier Analysis
[4]  
Aleem S., 2015, Int. J. Softw. Eng. Appl, V6, P11, DOI [DOI 10.5121/IJSEA.2015.6302, 10.5121/ijsea.2015.6302]
[5]  
Altland H.W., 1999, TECHNOMETRICS, V41, P367, DOI DOI 10.1080/00401706.1999.10485936
[6]  
Aydin Z.B.G., 2020, P 2020 5 INT C COMP, P1
[7]   A three-stage transfer learning framework for multi-source cross-project software defect prediction [J].
Bai, Jiaojiao ;
Jia, Jingdong ;
Capretz, Luiz Fernando .
INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 150
[8]   Software defect prediction via optimal trained convolutional neural network [J].
Balasubramaniam, S. ;
Gollagi, Shantappa G. .
ADVANCES IN ENGINEERING SOFTWARE, 2022, 169
[9]   Empirical Analysis of Rank Aggregation-Based Multi-Filter Feature Selection Methods in Software Defect Prediction [J].
Balogun, Abdullateef O. ;
Basri, Shuib ;
Mahamad, Saipunidzam ;
Abdulkadir, Said Jadid ;
Capretz, Luiz Fernando ;
Imam, Abdullahi A. ;
Almomani, Malek A. ;
Adeyemo, Victor E. ;
Kumar, Ganesh .
ELECTRONICS, 2021, 10 (02) :1-16
[10]  
Balogun AO, 2019, J ENG SCI TECHNOL, V14, P3294