A Cross-Project Defect Prediction Approach Based on Code Semantics and Cross-Version Structural Information

被引:0
作者
Zou, Yifan [1 ]
Wang, Huiqiang [1 ]
Lv, Hongwu [1 ]
Zhao, Shuai [1 ]
Tian, Haoye [2 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150000, Peoples R China
[2] Univ Melbourne, Grattan St, Parkville, Vic 3052, Australia
基金
中国国家自然科学基金;
关键词
Cross-project defect prediction; deep learning; abstract syntax tree; class dependency network; FRAMEWORK;
D O I
10.1142/S0218194024500165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Context: Cross-project defect prediction (CPDP), due to the potential of adaption by industry in realistic scenarios, had gained significant attention from the research community. Currently, existing CPDP studies use static statistical features designed by experts, which might not capture the semantic and structural aspects of software, resulting in low accuracy in defect prediction. Meanwhile, they tend to overlook the valuable iterative information brought about by version updates in mature software projects.Objective: This paper introduces DETECTOR, a novel CPDP approach based on coDE semanTic and cross-vErsion struCTural infORmation to leverage cross-versions features of the software and improve the performance of CPDP.Methods: DETECTOR parses source code to exploit Abstract Syntax Trees (ASTs) and cross-version software network (Cross-SN) that consists of internal class dependency network and cross-version class dependency edges. It utilizes Attention-based Bi-LSTM and simplified graph convolutional neural networks to automatically extract software features from ASTs and Cross-SN. The extracted features are fused using gate(& sdot;) to generate more effective cross-version features. Finally the source project is selected to carry out the data used to train the classifier to predict the defects.Results: Empirical studies on seven open-source Java projects, the experiment results show that: (1) DETECTOR outperforms the state-of-the-art models in CPDP; (2) our proposed cross-version dependent edges positively contribute to DETECTOR performance; (3) gate(& sdot;) outperforms existing strategies in fusion features; (4) more multi-versions information enhance DETECTOR's performance.Conclusion: DETECTOR can predict more defects in CPDP and improve the accuracy and effectiveness of prediction.
引用
收藏
页码:1135 / 1171
页数:37
相关论文
共 50 条
  • [41] Dissimilarity Space Based Multi-Source Cross-Project Defect Prediction
    Ren, Shengbing
    Zhang, Wanying
    Munir, Hafiz Shahbaz
    Xia, Lei
    [J]. ALGORITHMS, 2019, 12 (01)
  • [42] A Comparative Study to Benchmark Cross-Project Defect Prediction Approaches
    Herbold, Steffen
    Trautsch, Alexander
    Grabowski, Jens
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (09) : 811 - 833
  • [43] Manifold embedded distribution adaptation for cross-project defect prediction
    Sun, Ying
    Jing, Xiao-Yuan
    Wu, Fei
    Sun, Yanfei
    [J]. IET SOFTWARE, 2020, 14 (07) : 825 - 838
  • [44] Improving Prediction Robustness of VAB-SVM for Cross-Project Defect Prediction
    Ryu, Duksan
    Choi, Okjoo
    Baik, Jongmoon
    [J]. 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, : 994 - 999
  • [45] Selective Pseudo-Labeling Based Subspace Learning for Cross-Project Defect Prediction
    Sun, Ying
    Jing, Xiao-Yuan
    Wu, Fei
    Sun, Yanfei
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (09) : 2003 - 2006
  • [46] Cross-project software defect prediction based on multi-source data sets
    Junfu H.
    Yawen W.
    Yunzhan G.
    Dahai J.
    [J]. Journal of China Universities of Posts and Telecommunications, 2021, 28 (04): : 75 - 87
  • [47] Cross-Project Software Defect Prediction Based on SMOTE and Deep Canonical Correlation Analysis
    Fan, Xin
    Zhang, Shuqing
    Wu, Kaisheng
    Zheng, Wei
    Ge, Yu
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 1687 - 1711
  • [48] Cross-project Defect Prediction Using a Credibility Theory based Naive Bayes Classifier
    Poon, Wai Nam
    Bennin, Kwabena Ebo
    Huang, Jianglin
    Phannachitta, Passakorn
    Keung, Jacky Wai
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS), 2017, : 434 - 441
  • [49] Unsupervised Domain Adaptation Based on Discriminative Subspace Learning for Cross-Project Defect Prediction
    Sun, Ying
    Sun, Yanfei
    Qi, Jin
    Wu, Fei
    Jing, Xiao-Yuan
    Xue, Yu
    Shen, Zixin
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3373 - 3389
  • [50] Assessing the Effect of Imbalanced Learning on Cross-project Software Defect Prediction
    Sohan, Md Fahimuzzman
    Jabiullah, Md Ismail
    Rahman, Sheikh Shah Mohammad Motiur
    Mahmud, S. M. Hasan
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,