Cross-project smell-based defect prediction

被引:11
|
作者
Sotto-Mayor, Bruno [1 ]
Kalech, Meir [1 ]
机构
[1] Ben Gurion Univ Negev, Beer Sheva, Israel
关键词
Cross-project defect prediction; Defect prediction; Code smell; Mining software repositories; Software quality; Software engineering; CODE; METRICS;
D O I
10.1007/s00500-021-06254-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Defect prediction is a technique introduced to optimize the testing phase of the software development pipeline by predicting which components in the software may contain defects. Its methodology trains a classifier with data regarding a set of features measured on each component from the target software project to predict whether the component may be defective or not. However, suppose the defective information is not available in the training set. In that case, we need to rely on an alternate approach that uses the training set of external projects to train the classifier. This approached is called cross-project defect prediction. Bad code smells are a category of features that have been previously explored in defect prediction and have been shown to be a good predictor of defects. Code smells are patterns of poor development in the code and indicate flaws in its design and implementation. Although they have been previously studied in the context of defect prediction, they have not been studied as features for cross-project defect prediction. In our experiment, we train defect prediction models for 100 projects to evaluate the predictive performance of the bad code smells. We implemented four cross-project approaches known in the literature and compared the performance of 37 smells with 56 code metrics, commonly used for defect prediction. The results show that the cross-project defect prediction models trained with code smells significantly improved 6.50% on the ROC AUC compared against the code metrics.
引用
收藏
页码:14171 / 14181
页数:11
相关论文
共 50 条
  • [31] Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction
    Sun, Ying
    Jing, Xiao-Yuan
    Wu, Fei
    Li, Juanjuan
    Xing, Danlei
    Chen, Haowen
    Sun, Yanfei
    IEEE ACCESS, 2020, 8 : 32674 - 32687
  • [32] An Empirical Study on the Effectiveness of Feature Selection for Cross-Project Defect Prediction
    Yu, Qiao
    Qian, Junyan
    Jiang, Shujuan
    Wu, Zhenhua
    Zhang, Gongjie
    IEEE ACCESS, 2019, 7 : 35710 - 35718
  • [33] WIFLF: An approach independent of the target project for cross-project defect prediction
    Cui, Can
    Liu, Bin
    Wang, Shihai
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2022, 34 (12)
  • [34] Cross-project software defect prediction based on the reduction and hybridization of software metrics
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Lee, Sungon
    Al-masni, Mohammed A.
    Gu, Yeong Hyeon
    Algabri, Redhwan
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 112 : 161 - 176
  • [35] Cross-project software defect prediction based on domain adaptation learning and optimization
    Jin, Cong
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171
  • [36] Correlation Metric Selection based Correlation Alignment for Cross-project Defect Prediction
    Niu, Jingwen
    Li, Zhiqiang
    Qi, Chao
    20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 490 - 495
  • [37] Graph-Based Feature Learning for Cross-Project Software Defect Prediction
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Algabri, Redhwan
    Lee, Sungon
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 161 - 180
  • [38] A Cluster Based Feature Selection Method for Cross-Project Software Defect Prediction
    Chao Ni
    Wang-Shu Liu
    Xiang Chen
    Qing Gu
    Dao-Xu Chen
    Qi-Guo Huang
    Journal of Computer Science and Technology, 2017, 32 : 1090 - 1107
  • [39] Cross-project Defect Prediction Using a Connectivity-based Unsupervised Classifier
    Zhang, Feng
    Zheng, Quan
    Zou, Ying
    Hassan, Ahmed E.
    2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 309 - 320
  • [40] On the time-based conclusion stability of cross-project defect prediction models
    Abdul Ali Bangash
    Hareem Sahar
    Abram Hindle
    Karim Ali
    Empirical Software Engineering, 2020, 25 : 5047 - 5083