An empirical evaluation of defect prediction approaches in within-project and cross-project context

被引:5
|
作者
Bhat, Nayeem Ahmad [1 ]
Farooq, Sheikh Umar [1 ]
机构
[1] Univ Kashmir, Dept Comp Sci, North Campus, Srinagar, J&K, India
关键词
Cross-project defect prediction; Software quality assurance; Source code metrics; Process metrics; Churn of source code; Feature selection; Imbalance learning; STATIC CODE ATTRIBUTES; BORDERLINE-SMOTE; SOFTWARE; METRICS; FAULTS; VALIDATION; MODELS;
D O I
10.1007/s11219-023-09615-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The software defect prediction approaches are evaluated, in within-project context only, with only a few other approaches, according to distinct scenarios and performance indicators. So, we conduct various experiments to evaluate well-known defect prediction approaches using different performance indicators. The evaluations are performed in the scenario of ranking the entities - with and without considering the effort to review the entities and classifying entities in within-project as well as cross-project contexts. The effect of imbalanced datasets on the ranking of the approaches is also evaluated. Our results indicate that in within-project as well as cross-project context, process metrics, the churn of source code, and entropy of source code perform significantly better under the context of classification and ranking - with and without effort consideration. The previous defect metrics and other single metric approaches (like lines of code) perform worst. The ranking of the approaches is not changed by imbalanced datasets. We suggest using the process metrics, the churn of source code, and entropy of source code metrics as predictors in future defect prediction studies and taking care while using the single metric approaches as predictors. Moreover, different evaluation scenarios generate different ordering of approaches in within-project and cross-project contexts. Therefore, we conclude that each problem context has distinct characteristics, and conclusions of within-project studies should not be generalized to cross-project context and vice versa.
引用
收藏
页码:917 / 946
页数:30
相关论文
共 50 条
  • [21] Understanding the Automated Parameter Optimization on Transfer Learning for Cross-Project Defect Prediction: An Empirical Study
    Li, Ke
    Xiang, Zilin
    Chen, Tao
    Wang, Shuo
    Tan, Kay Chen
    2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 566 - 577
  • [22] Adversarial domain adaptation for cross-project defect prediction
    Song, Hengjie
    Wu, Guobin
    Ma, Le
    Pan, Yufei
    Huang, Qingan
    Jiang, Siyu
    EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (05)
  • [23] Towards Better Effort Estimation with Cross-Project Defect Prediction Approaches
    Amasaki, Sousuke
    Yokogawa, Tomoyuki
    Aman, Hirohisa
    PROCEEDINGS OF EASE 2019 - EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, 2019, : 357 - 360
  • [24] Cross-project smell-based defect prediction
    Sotto-Mayor, Bruno
    Kalech, Meir
    SOFT COMPUTING, 2021, 25 (22) : 14171 - 14181
  • [25] An Empirical Study on Combining Source Selection and Transfer Learning for Cross-Project Defect Prediction
    Wen, Wanzhi
    Zhang, Bin
    Gu, Xiang
    Ju, Xiaolin
    2019 IEEE 1ST INTERNATIONAL WORKSHOP ON INTELLIGENT BUG FIXING (IBF '19), 2019, : 29 - 38
  • [26] Which is More Important for Cross-Project Defect Prediction: Instance or Feature?
    Yu, Qiao
    Jiang, Shujuan
    Qian, Junyan
    2016 INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, TESTING AND EVOLUTION (SATE 2016), 2016, : 90 - 95
  • [27] Evaluating Data Filter on Cross-Project Defect Prediction: Comparison and Improvements
    Li, Yong
    Huang, Zhiqiu
    Wang, Yong
    Fang, Bingwu
    IEEE ACCESS, 2017, 5 : 25646 - 25656
  • [28] Comparing Hyperparameter Optimization in Cross- and Within-Project Defect Prediction: A Case Study
    Muhammed Maruf Öztürk
    Arabian Journal for Science and Engineering, 2019, 44 : 3515 - 3530
  • [29] An Improved CNN Model for Within-Project Software Defect Prediction
    Pan, Cong
    Lu, Minyan
    Xu, Biao
    Gao, Houleng
    APPLIED SCIENCES-BASEL, 2019, 9 (10):
  • [30] An Empirical Study on Multi-Source Cross-Project Defect Prediction Models
    Liu, Xuanying
    Li, Zonghao
    Zou, Jiaqi
    Tong, Haonan
    2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 318 - 327