An empirical evaluation of defect prediction approaches in within-project and cross-project context

被引:5
|
作者
Bhat, Nayeem Ahmad [1 ]
Farooq, Sheikh Umar [1 ]
机构
[1] Univ Kashmir, Dept Comp Sci, North Campus, Srinagar, J&K, India
关键词
Cross-project defect prediction; Software quality assurance; Source code metrics; Process metrics; Churn of source code; Feature selection; Imbalance learning; STATIC CODE ATTRIBUTES; BORDERLINE-SMOTE; SOFTWARE; METRICS; FAULTS; VALIDATION; MODELS;
D O I
10.1007/s11219-023-09615-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The software defect prediction approaches are evaluated, in within-project context only, with only a few other approaches, according to distinct scenarios and performance indicators. So, we conduct various experiments to evaluate well-known defect prediction approaches using different performance indicators. The evaluations are performed in the scenario of ranking the entities - with and without considering the effort to review the entities and classifying entities in within-project as well as cross-project contexts. The effect of imbalanced datasets on the ranking of the approaches is also evaluated. Our results indicate that in within-project as well as cross-project context, process metrics, the churn of source code, and entropy of source code perform significantly better under the context of classification and ranking - with and without effort consideration. The previous defect metrics and other single metric approaches (like lines of code) perform worst. The ranking of the approaches is not changed by imbalanced datasets. We suggest using the process metrics, the churn of source code, and entropy of source code metrics as predictors in future defect prediction studies and taking care while using the single metric approaches as predictors. Moreover, different evaluation scenarios generate different ordering of approaches in within-project and cross-project contexts. Therefore, we conclude that each problem context has distinct characteristics, and conclusions of within-project studies should not be generalized to cross-project context and vice versa.
引用
收藏
页码:917 / 946
页数:30
相关论文
共 50 条
  • [31] Comparing Hyperparameter Optimization in Cross- and Within-Project Defect Prediction: A Case Study
    Ozturk, Muhammed Maruf
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (04) : 3515 - 3530
  • [32] DeepCPDP: Deep Learning Based Cross-Project Defect Prediction
    Chen, Deyu
    Chen, Xiang
    Li, Hao
    Xie, Junfeng
    Mu, Yanzhou
    IEEE ACCESS, 2019, 7 : 184832 - 184848
  • [33] WIFLF: An approach independent of the target project for cross-project defect prediction
    Cui, Can
    Liu, Bin
    Wang, Shihai
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2022, 34 (12)
  • [34] Addressing Noise and Class Imbalance Problems in Heterogeneous Cross-Project Defect Prediction: An Empirical Study
    Vashisht, Rohit
    Rizvi, Syed Afzal Murtaza
    INTERNATIONAL JOURNAL OF E-COLLABORATION, 2023, 19 (01)
  • [35] A Survey on Cross-Project Software Defect Prediction Methods
    Chen X.
    Wang L.-P.
    Gu Q.
    Wang Z.
    Ni C.
    Liu W.-S.
    Wang Q.-P.
    2018, Science Press (41): : 254 - 274
  • [36] Correlation Metric Selection based Correlation Alignment for Cross-project Defect Prediction
    Niu, Jingwen
    Li, Zhiqiang
    Qi, Chao
    20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 490 - 495
  • [37] Cross-project Defect Prediction Using a Connectivity-based Unsupervised Classifier
    Zhang, Feng
    Zheng, Quan
    Zou, Ying
    Hassan, Ahmed E.
    2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 309 - 320
  • [38] DSSDPP: Data Selection and Sampling Based Domain Programming Predictor for Cross-Project Defect Prediction
    Li, Zhiqiang
    Zhang, Hongyu
    Jing, Xiao-Yuan
    Xie, Juanying
    Guo, Min
    Ren, Jie
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 1941 - 1963
  • [39] Adversarial domain adaptation for cross-project defect prediction
    Hengjie Song
    Guobin Wu
    Le Ma
    Yufei Pan
    Qingan Huang
    Siyu Jiang
    Empirical Software Engineering, 2023, 28
  • [40] Source selection and transfer defect learning based cross-project defect prediction
    Wen, Wanzhi
    Zhu, Ningbo
    Ye, Bingqing
    Li, Xikai
    Wang, Chuyue
    Chu, Jiawei
    Li, Yuehua
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2022, 16 (03) : 195 - 207