Feature Importance in the Context of Traditional and Just-In-Time Software Defect Prediction Models

被引:0
作者
Haldar, Susmita [1 ]
Capretz, Luiz Fernando [2 ]
机构
[1] Fanshawe Coll, Sch Informat Technol, London, ON, Canada
[2] Western Univ, Dept Elect & Comp Engn, London, ON, Canada
来源
2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024 | 2024年
关键词
Software Defect Prediction; Source Code Metrics; Process Metrics; Just-In-Time Defect Prediction; Feature Importance;
D O I
10.1109/CCECE59415.2024.10667167
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Software defect prediction models can assist software testing initiatives by prioritizing testing error-prone modules. In recent years, in addition to the traditional defect prediction model approach of predicting defects from class, modules, etc., Just-In-Time defect prediction research, which focuses on the change history of software products is becoming prominent. For building these defect prediction models, it is important to understand which features are primary contributors to these classifiers. This study considered developing defect prediction models incorporating the traditional and the Just-In-Time approaches from the publicly available datasets including the Apache Camel project. A multi-layer Deep Learning algorithm was applied to these datasets in comparison with machine learning algorithms. The prediction models developed using the Deep Learning algorithm achieved an accuracy of 80% and 86%, with the area under receiving operator curve (AUC) scores of 66% and 78% for traditional and Just-In-Time defect prediction, respectively. Finally, the feature importance of these models was identified using a model-specific integrated gradient method and a model-agnostic Shapley Additive Explanation (SHAP) technique.
引用
收藏
页码:818 / 822
页数:5
相关论文
共 22 条
[1]  
Bener A., 2015, The art and science of analyzing software data, P453
[2]   Feature Sets in Just-in-Time Defect Prediction: An Empirical Evaluation [J].
Bludau, Peter ;
Pretschner, Alexander .
PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON PREDICTIVE MODELS AND DATA ANALYTICS IN SOFTWARE ENGINEERING, PROMISE 2022, 2022, :22-31
[3]   An introduction to ROC analysis [J].
Fawcett, Tom .
PATTERN RECOGNITION LETTERS, 2006, 27 (08) :861-874
[4]   Predicting the number of defects in a new software version [J].
Felix, Ebubeogu Amarachukwu ;
Lee, Sai Peck .
PLOS ONE, 2020, 15 (03)
[5]   On the use of deep learning in software defect prediction [J].
Giray, Gorkem ;
Bennin, Kwabena Ebo ;
Koksal, Omer ;
Babur, Onder ;
Tekinerdogan, Bedir .
JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 195
[6]  
Grigoriou M.-S., 2020, CASCON 20, P123
[7]  
Haldar Susmita, 2023, 2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS), P150, DOI 10.1109/ICICCS56967.2023.10142534
[8]   Interpretable Software Defect Prediction from Project Effort and Static Code Metrics [J].
Haldar, Susmita ;
Capretz, Luiz Fernando .
COMPUTERS, 2024, 13 (02)
[9]   Enhanced Integrated Gradients: improving interpretability of deep learning models using splicing codes as a case study [J].
Jha, Anupama ;
Aicher, Joseph K. ;
Gazzara, Matthew R. ;
Singh, Deependra ;
Barash, Yoseph .
GENOME BIOLOGY, 2020, 21 (01)
[10]  
Jureckzo Marian, 2010, Zenodo, DOI 10.5281/zenodo.268442