Cross-project smell-based defect prediction

被引：11

作者：

Sotto-Mayor, Bruno ^{[1
]}

Kalech, Meir ^{[1
]}

机构：

[1] Ben Gurion Univ Negev, Beer Sheva, Israel

来源：

SOFT COMPUTING | 2021年 / 25卷 / 22期

关键词：

Cross-project defect prediction; Defect prediction; Code smell; Mining software repositories; Software quality; Software engineering; CODE; METRICS;

D O I：

10.1007/s00500-021-06254-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Defect prediction is a technique introduced to optimize the testing phase of the software development pipeline by predicting which components in the software may contain defects. Its methodology trains a classifier with data regarding a set of features measured on each component from the target software project to predict whether the component may be defective or not. However, suppose the defective information is not available in the training set. In that case, we need to rely on an alternate approach that uses the training set of external projects to train the classifier. This approached is called cross-project defect prediction. Bad code smells are a category of features that have been previously explored in defect prediction and have been shown to be a good predictor of defects. Code smells are patterns of poor development in the code and indicate flaws in its design and implementation. Although they have been previously studied in the context of defect prediction, they have not been studied as features for cross-project defect prediction. In our experiment, we train defect prediction models for 100 projects to evaluate the predictive performance of the bad code smells. We implemented four cross-project approaches known in the literature and compared the performance of 37 smells with 56 code metrics, commonly used for defect prediction. The results show that the cross-project defect prediction models trained with code smells significantly improved 6.50% on the ROC AUC compared against the code metrics.

引用

页码：14171 / 14181

页数：11

共 50 条

[31] Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction
Sun, Ying
Jing, Xiao-Yuan
Wu, Fei
Li, Juanjuan
Xing, Danlei
Chen, Haowen
Sun, Yanfei
IEEE ACCESS, 2020, 8 : 32674 - 32687
[32] An Empirical Study on the Effectiveness of Feature Selection for Cross-Project Defect Prediction
Yu, Qiao
Qian, Junyan
Jiang, Shujuan
Wu, Zhenhua
Zhang, Gongjie
IEEE ACCESS, 2019, 7 : 35710 - 35718
[33] WIFLF: An approach independent of the target project for cross-project defect prediction
Cui, Can
Liu, Bin
Wang, Shihai
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2022, 34 (12)
[34] Cross-project software defect prediction based on the reduction and hybridization of software metrics
Abdu, Ahmed
Zhai, Zhengjun
Abdo, Hakim A.
Lee, Sungon
Al-masni, Mohammed A.
Gu, Yeong Hyeon
Algabri, Redhwan
ALEXANDRIA ENGINEERING JOURNAL, 2025, 112 : 161 - 176
[35] Cross-project software defect prediction based on domain adaptation learning and optimization
Jin, Cong
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171
[36] Correlation Metric Selection based Correlation Alignment for Cross-project Defect Prediction
Niu, Jingwen
Li, Zhiqiang
Qi, Chao
20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 490 - 495
[37] Graph-Based Feature Learning for Cross-Project Software Defect Prediction
Abdu, Ahmed
Zhai, Zhengjun
Abdo, Hakim A.
Algabri, Redhwan
Lee, Sungon
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 161 - 180
[38] A Cluster Based Feature Selection Method for Cross-Project Software Defect Prediction
Chao Ni
Wang-Shu Liu
Xiang Chen
Qing Gu
Dao-Xu Chen
Qi-Guo Huang
Journal of Computer Science and Technology, 2017, 32 : 1090 - 1107
[39] Cross-project Defect Prediction Using a Connectivity-based Unsupervised Classifier
Zhang, Feng
Zheng, Quan
Zou, Ying
Hassan, Ahmed E.
2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 309 - 320
[40] On the time-based conclusion stability of cross-project defect prediction models
Abdul Ali Bangash
Hareem Sahar
Abram Hindle
Karim Ali
Empirical Software Engineering, 2020, 25 : 5047 - 5083

← 1 2 3 4 5 →