Cross-Project and Within-Project Semisupervised Software Defect Prediction: A Unified Approach

被引:109
|
作者
Wu, Fei [1 ]
Jing, Xiao-Yuan [1 ,2 ]
Sun, Ying [1 ]
Sun, Jing [1 ]
Huang, Lin [1 ]
Cui, Fangyi [1 ]
Sun, Yanfei [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Jiangsu, Peoples R China
[2] Wuhan Univ, Sch Comp, State Key Lab Software Engn, Wuhan 430072, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Cost-sensitive kernelized semisupervised dictionary learning (CKSDL); cross-project semisupervised defect prediction (CSDP); within-project semisupervised defect prediction (WSDP); NETWORKS; MACHINE;
D O I
10.1109/TR.2018.2804922
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
When there exist not enough historical defect data for building an accurate prediction model, semisupervised defect prediction (SSDP) and cross-project defect prediction (CPDP) are two feasible solutions. Existing CPDP methods assume that the available source data are well labeled. However, due to expensive human efforts for labeling a large amount of defect data, usually, we can only utilize the suitable unlabeled source data. We call CPDP in this scenario as cross-project semisupervised defect prediction (CSDP). Although some within-project semisupervised defect prediction (WSDP) methods have been developed in recent years, there still exists much room for improvement on prediction performance. In this paper, we aim to provide a unified and effective solution for both CSDP and WSDP problems. We introduce the semisupervised dictionary learning technique and propose a cost-sensitive kernelized semisupervised dictionary learning (CKSDL) approach. CKSDL can make full use of the limited labeled defect data and a large amount of unlabeled data in the kernel space. In addition, CKSDL considers the misclassification costs in the dictionary learning process. Extensive experiments on 16 projects indicate that CKSDL outperforms state-of-the-art WSDP methods, using unlabeled cross-project defect data can help improve the WSDP performance, and CKSDL generally obtains significantly better prediction performance than related SSDP methods in the CSDP scenario.
引用
收藏
页码:581 / 597
页数:17
相关论文
共 22 条
  • [1] Tsbagging: A Novel Cross-Project Software Defect Prediction Algorithm Based on Semisupervised Clustering
    Tang, Shiqi
    Huang, Song
    Liu, ErHu
    Yao, YongMing
    Wu, KaiShun
    Ji, Haijin
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [2] An Improved SDA Based Defect Prediction Framework for Both Within-Project and Cross-Project Class-Imbalance Problems
    Jing, Xiao-Yuan
    Wu, Fei
    Dong, Xiwei
    Xu, Baowen
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2017, 43 (04) : 321 - 339
  • [3] A Hybrid Multiple Models Transfer Approach for Cross-Project Software Defect Prediction
    Zhang, Shenggang
    Jiang, Shujuan
    Yan, Yue
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (03) : 343 - 374
  • [4] Cross-Project Online Just-In-Time Software Defect Prediction
    Tabassum, Sadia
    Minku, Leandro L.
    Feng, Danyi
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (01) : 268 - 287
  • [5] Adversarial domain adaptation for cross-project defect prediction
    Song, Hengjie
    Wu, Guobin
    Ma, Le
    Pan, Yufei
    Huang, Qingan
    Jiang, Siyu
    EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (05)
  • [6] A Survey on Transfer Learning for Cross-Project Defect Prediction
    Sotto-Mayor, Bruno
    Kalech, Meir
    IEEE ACCESS, 2024, 12 : 93398 - 93425
  • [7] An Investigation of Cross-Project Learning in Online Just-In-Time Software Defect Prediction
    Tabassum, Sadia
    Minku, Leandro L.
    Feng, Danyi
    Cabral, George G.
    Song, Liyan
    2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 554 - 565
  • [8] Heterogeneous Cross-Project Defect Prediction via Optimal Transport
    Zong, Xing
    Li, Guiyu
    Zheng, Shang
    Zou, Haitao
    Yu, Hualong
    Gao, Shang
    IEEE ACCESS, 2023, 11 : 12015 - 12030
  • [9] Transfer Convolutional Neural Network for Cross-Project Defect Prediction
    Qiu, Shaojian
    Xu, Hao
    Deng, Jiehan
    Jiang, Siyu
    Lu, Lu
    APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [10] Cross-project defect prediction via semantic and syntactic encoding
    Jiang, Siyu
    Chen, Yuwen
    He, Zhenhang
    Shang, Yunpeng
    Ma, Le
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (04)