Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction

被引:12
|
作者
Sun, Ying [1 ]
Jing, Xiao-Yuan [1 ,2 ,3 ]
Wu, Fei [2 ]
Li, Juanjuan [2 ]
Xing, Danlei [1 ]
Chen, Haowen [3 ]
Sun, Yanfei [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Data models; Predictive models; Machine learning; Correlation; Sun; Gallium nitride; Prediction algorithms; Cross-project defect prediction; adversarial learning; semi-supervised learning; NEURAL-NETWORKS; FRAMEWORK; MODELS; CODE;
D O I
10.1109/ACCESS.2020.2974527
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-project defect prediction (CPDP) aims to build a prediction model on existing source projects and predict the labels of target project. The data distribution difference between different projects makes CPDP very challenging. Besides, most existing CPDP methods usually require sufficient and labeled data. However, acquiring lots of labeled data for a new project is difficult while obtaining the unlabeled data is relatively easy. A desirable approach is building a prediction model on unlabeled data and labeled data. CPDP in this scenario is called cross-project semi-supervised defect prediction (CSDP). Recently, generative adversarial networks have achieved impressive results with these strong ability of learning data distribution and discriminative representation. For effectively learning the discriminative features of data from different projects, we propose a Discriminative Adversarial Feature Learning (DAFL) approach for CSDP. DAFL consists of feature transformer and project discriminator, which compete with each other. A feature transformer tries to generate feature representation, which learns the discriminant information and preserves intrinsic structure inferred from both labeled and unlabeled data. A project discriminator tries to discriminate source and target instances on the generated representation. Experiments on 16 projects show that DAFL performs significantly better than baselines.
引用
收藏
页码:32674 / 32687
页数:14
相关论文
共 50 条
  • [41] Label propagation based semi-supervised learning for software defect prediction
    Zhang, Zhi-Wu
    Jing, Xiao-Yuan
    Wang, Tie-Jian
    AUTOMATED SOFTWARE ENGINEERING, 2017, 24 (01) : 47 - 69
  • [42] Software Defect Prediction Using Semi-supervised Learning with Dimension Reduction
    Lu, Huihua
    Cukic, Bojan
    Culp, Mark
    2012 PROCEEDINGS OF THE 27TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2012, : 314 - 317
  • [43] Label propagation based semi-supervised learning for software defect prediction
    Zhi-Wu Zhang
    Xiao-Yuan Jing
    Tie-Jian Wang
    Automated Software Engineering, 2017, 24 : 47 - 69
  • [44] Consistency and adversarial semi-supervised learning for medical image segmentation
    Tang, Yongqiang
    Wang, Shilei
    Qu, Yuxun
    Cui, Zhihua
    Zhang, Wensheng
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 161
  • [45] A Comparative Study to Benchmark Cross-Project Defect Prediction Approaches
    Herbold, Steffen
    Trautsch, Alexander
    Grabowski, Jens
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (09) : 811 - 833
  • [46] A Semi-Supervised Approach to Software Defect Prediction
    Lu, Huihua
    Cukic, Bojan
    Culp, Mark
    2014 IEEE 38TH ANNUAL INTERNATIONAL COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2014, : 416 - 425
  • [47] Cross-version defect prediction: use historical data, cross-project data, or both?
    Amasaki, Sousuke
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (02) : 1573 - 1595
  • [48] ALTRA: Cross-Project Software Defect Prediction via Active Learning and Tradaboost
    Yuan, Zhidan
    Chen, Xiang
    Cui, Zhanqi
    Mu, Yanzhou
    IEEE ACCESS, 2020, 8 : 30037 - 30049
  • [49] Semi-supervised Seizure Prediction with Generative Adversarial Networks
    Nhan Duy Truong
    Zhou, Luping
    Kavehei, Omid
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 2369 - 2372
  • [50] Cross-Project Transfer Learning on Lightweight Code Semantic Graphs for Defect Prediction
    Fang, Dingbang
    Liu, Shaoying
    Li, Yang
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (07) : 1095 - 1117