An empirical evaluation of defect prediction approaches in within-project and cross-project context

被引:5
|
作者
Bhat, Nayeem Ahmad [1 ]
Farooq, Sheikh Umar [1 ]
机构
[1] Univ Kashmir, Dept Comp Sci, North Campus, Srinagar, J&K, India
关键词
Cross-project defect prediction; Software quality assurance; Source code metrics; Process metrics; Churn of source code; Feature selection; Imbalance learning; STATIC CODE ATTRIBUTES; BORDERLINE-SMOTE; SOFTWARE; METRICS; FAULTS; VALIDATION; MODELS;
D O I
10.1007/s11219-023-09615-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The software defect prediction approaches are evaluated, in within-project context only, with only a few other approaches, according to distinct scenarios and performance indicators. So, we conduct various experiments to evaluate well-known defect prediction approaches using different performance indicators. The evaluations are performed in the scenario of ranking the entities - with and without considering the effort to review the entities and classifying entities in within-project as well as cross-project contexts. The effect of imbalanced datasets on the ranking of the approaches is also evaluated. Our results indicate that in within-project as well as cross-project context, process metrics, the churn of source code, and entropy of source code perform significantly better under the context of classification and ranking - with and without effort consideration. The previous defect metrics and other single metric approaches (like lines of code) perform worst. The ranking of the approaches is not changed by imbalanced datasets. We suggest using the process metrics, the churn of source code, and entropy of source code metrics as predictors in future defect prediction studies and taking care while using the single metric approaches as predictors. Moreover, different evaluation scenarios generate different ordering of approaches in within-project and cross-project contexts. Therefore, we conclude that each problem context has distinct characteristics, and conclusions of within-project studies should not be generalized to cross-project context and vice versa.
引用
收藏
页码:917 / 946
页数:30
相关论文
共 50 条
  • [41] A Cluster Based Feature Selection Method for Cross-Project Software Defect Prediction
    Ni, Chao
    Liu, Wang-Shu
    Chen, Xiang
    Gu, Qing
    Chen, Dao-Xu
    Huang, Qi-Guo
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (06) : 1090 - 1107
  • [42] Cross-project smell-based defect prediction
    Bruno Sotto-Mayor
    Meir Kalech
    Soft Computing, 2021, 25 : 14171 - 14181
  • [43] An Improved Method for Training Data Selection for Cross-Project Defect Prediction
    Nayeem Ahmad Bhat
    Sheikh Umar Farooq
    Arabian Journal for Science and Engineering, 2022, 47 : 1939 - 1954
  • [44] Cross-version defect prediction: use historical data, cross-project data, or both?
    Amasaki, Sousuke
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (02) : 1573 - 1595
  • [45] Impact of hyper parameter optimization for cross-project software defect prediction
    Qu Y.
    Chen X.
    Zhao Y.
    Ju X.
    International Journal of Performability Engineering, 2018, 14 (06): : 1291 - 1299
  • [46] Cross-Project Defect Prediction Method Based on Manifold Feature Transformation
    Zhao, Yu
    Zhu, Yi
    Yu, Qiao
    Chen, Xiaoying
    FUTURE INTERNET, 2021, 13 (08)
  • [47] Cross-Project Defect Prediction Based on Domain Adaptation and LSTM Optimization
    Javed, Khadija
    Ren, Shengbing
    Asim, Muhammad
    Wani, Mudasir Ahmad
    ALGORITHMS, 2024, 17 (05)
  • [48] An Empirical Study of Training Data Selection Methods for Ranking-Oriented Cross-Project Defect Prediction
    Luo, Haoyu
    Dai, Heng
    Peng, Weiqiang
    Hu, Wenhua
    Li, Fuyang
    SENSORS, 2021, 21 (22)
  • [49] Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction
    Sun, Ying
    Jing, Xiao-Yuan
    Wu, Fei
    Li, Juanjuan
    Xing, Danlei
    Chen, Haowen
    Sun, Yanfei
    IEEE ACCESS, 2020, 8 : 32674 - 32687
  • [50] HDA: Cross-Project Defect Prediction via Heterogeneous Domain Adaptation With Dictionary Learning
    Xu, Zhou
    Yuan, Peipei
    Zhang, Tao
    Tang, Yutian
    Li, Shuai
    Xia, Zhen
    IEEE ACCESS, 2018, 6 : 57597 - 57613