A Cross-Project Defect Prediction Approach Based on Code Semantics and Cross-Version Structural Information

被引:0
|
作者
Zou, Yifan [1 ]
Wang, Huiqiang [1 ]
Lv, Hongwu [1 ]
Zhao, Shuai [1 ]
Tian, Haoye [2 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150000, Peoples R China
[2] Univ Melbourne, Grattan St, Parkville, Vic 3052, Australia
基金
中国国家自然科学基金;
关键词
Cross-project defect prediction; deep learning; abstract syntax tree; class dependency network; FRAMEWORK;
D O I
10.1142/S0218194024500165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Context: Cross-project defect prediction (CPDP), due to the potential of adaption by industry in realistic scenarios, had gained significant attention from the research community. Currently, existing CPDP studies use static statistical features designed by experts, which might not capture the semantic and structural aspects of software, resulting in low accuracy in defect prediction. Meanwhile, they tend to overlook the valuable iterative information brought about by version updates in mature software projects.Objective: This paper introduces DETECTOR, a novel CPDP approach based on coDE semanTic and cross-vErsion struCTural infORmation to leverage cross-versions features of the software and improve the performance of CPDP.Methods: DETECTOR parses source code to exploit Abstract Syntax Trees (ASTs) and cross-version software network (Cross-SN) that consists of internal class dependency network and cross-version class dependency edges. It utilizes Attention-based Bi-LSTM and simplified graph convolutional neural networks to automatically extract software features from ASTs and Cross-SN. The extracted features are fused using gate(& sdot;) to generate more effective cross-version features. Finally the source project is selected to carry out the data used to train the classifier to predict the defects.Results: Empirical studies on seven open-source Java projects, the experiment results show that: (1) DETECTOR outperforms the state-of-the-art models in CPDP; (2) our proposed cross-version dependent edges positively contribute to DETECTOR performance; (3) gate(& sdot;) outperforms existing strategies in fusion features; (4) more multi-versions information enhance DETECTOR's performance.Conclusion: DETECTOR can predict more defects in CPDP and improve the accuracy and effectiveness of prediction.
引用
收藏
页码:1135 / 1171
页数:37
相关论文
共 50 条
  • [1] Cross-Version Defect Prediction using Cross-Project Defect Prediction Approaches: Does it work?
    Amasaki, Sousuke
    PROMISE'18: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON PREDICTIVE MODELS AND DATA ANALYTICS IN SOFTWARE ENGINEERING, 2018, : 32 - 41
  • [2] Cross-version defect prediction: use historical data, cross-project data, or both?
    Sousuke Amasaki
    Empirical Software Engineering, 2020, 25 : 1573 - 1595
  • [3] Cross-version defect prediction: use historical data, cross-project data, or both?
    Amasaki, Sousuke
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (02) : 1573 - 1595
  • [4] Local modeling approach for cross-project defect prediction
    Bhat, Nayeem Ahmad
    Farooq, Sheikh Umar
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (04): : 623 - 637
  • [5] Cross-Project Transfer Learning on Lightweight Code Semantic Graphs for Defect Prediction
    Fang, Dingbang
    Liu, Shaoying
    Li, Yang
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (07) : 1095 - 1117
  • [6] DeepCPDP: Deep Learning Based Cross-Project Defect Prediction
    Chen, Deyu
    Chen, Xiang
    Li, Hao
    Xie, Junfeng
    Mu, Yanzhou
    IEEE ACCESS, 2019, 7 : 184832 - 184848
  • [7] Cross-project smell-based defect prediction
    Bruno Sotto-Mayor
    Meir Kalech
    Soft Computing, 2021, 25 : 14171 - 14181
  • [8] Cross-project smell-based defect prediction
    Sotto-Mayor, Bruno
    Kalech, Meir
    SOFT COMPUTING, 2021, 25 (22) : 14171 - 14181
  • [9] Cross-Project Defect Prediction: A Literature Review
    Pal, Sourabh
    Sillitti, Alberto
    IEEE ACCESS, 2022, 10 : 118697 - 118717
  • [10] Graph-Based Feature Learning for Cross-Project Software Defect Prediction
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Algabri, Redhwan
    Lee, Sungon
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 161 - 180