Cross-project defect prediction based on G-LSTM model

被引:10
|
作者
Xing, Ying [1 ]
Qian, Xiaomeng [2 ]
Guan, Yu [3 ]
Yang, Bin [3 ]
Zhang, Yuwei [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, 10 Xitucheng Rd, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Modern Post, 10 Xitucheng Rd, Beijing 100876, Peoples R China
[3] Du Xiaoman Sci Technol Co Ltd, 10 Xitucheng Rd, Beijing 100085, Peoples R China
[4] Peking Univ, Sch Comp Sci, 5 Summer Palace Rd, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Computational language processing; Cross-project defect prediction; Long-term and short-term memory neural network; Continuous bag-of-word model; Generative adversarial network;
D O I
10.1016/j.patrec.2022.04.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-project defect prediction (CPDP) is currently a hot research direction in the field of software reliability. Traditional CPDP methods cannot capture the semantic and contextual information of programs by handcrafted features, which affects the prediction performance. In this paper, we apply technology in the NLP domain to solve it. We first extract token vectors from the abstract syntax tree (AST) of source and target code files, and then convert them into numerical vectors by the word embedding algorithm of continuous bag-of-word model (CBOW) as the input of the proposed deep learning model named Generative Adversarial Long-Short Term Memory Neural Networks (G-LSTM). The model integrates generative adversarial network (GAN) and bidirectional long-short term memory networks (BiLSTM) with attention mechanism to automatically learn semantic and contextual features of programs. Specifically, GAN is used to eliminate the differences in data distribution between source and target projects, and BiLSTM is the feature extraction encoder. We compose five projects of the PROMISE dataset into 20 source-target project pairs and conduct comparison experiments on them. The experimental results demonstrate that our method outperforms some traditional and state-of-the-art CPDP methods in terms of the evaluation metrics of AUC and Acc. (C) 2022 Published by Elsevier B.V.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 50 条
  • [31] Cross-Version Defect Prediction using Cross-Project Defect Prediction Approaches: Does it work?
    Amasaki, Sousuke
    PROMISE'18: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON PREDICTIVE MODELS AND DATA ANALYTICS IN SOFTWARE ENGINEERING, 2018, : 32 - 41
  • [32] Cross-project defect prediction via semantic and syntactic encoding
    Jiang, Siyu
    Chen, Yuwen
    He, Zhenhang
    Shang, Yunpeng
    Ma, Le
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (04)
  • [33] Cross-project Defect Prediction Method Using Adversarial Learning
    Xing Y.
    Qian X.-M.
    Guan Y.
    Zhang S.-H.
    Zhao M.-C.
    Lin W.-T.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (06): : 2097 - 2112
  • [34] Manifold embedded distribution adaptation for cross-project defect prediction
    Sun, Ying
    Jing, Xiao-Yuan
    Wu, Fei
    Sun, Yanfei
    IET SOFTWARE, 2020, 14 (07) : 825 - 838
  • [35] A Comparative Study to Benchmark Cross-Project Defect Prediction Approaches
    Herbold, Steffen
    Trautsch, Alexander
    Grabowski, Jens
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (09) : 811 - 833
  • [36] Improving Prediction Robustness of VAB-SVM for Cross-Project Defect Prediction
    Ryu, Duksan
    Choi, Okjoo
    Baik, Jongmoon
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, : 994 - 999
  • [37] Selective Pseudo-Labeling Based Subspace Learning for Cross-Project Defect Prediction
    Sun, Ying
    Jing, Xiao-Yuan
    Wu, Fei
    Sun, Yanfei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (09) : 2003 - 2006
  • [38] CFPS: Collaborative filtering based source projects selection for cross-project defect prediction
    Sun, Zhongbin
    Li, Junqi
    Sun, Heli
    He, Liang
    APPLIED SOFT COMPUTING, 2021, 99
  • [39] Unsupervised Domain Adaptation Based on Discriminative Subspace Learning for Cross-Project Defect Prediction
    Sun, Ying
    Sun, Yanfei
    Qi, Jin
    Wu, Fei
    Jing, Xiao-Yuan
    Xue, Yu
    Shen, Zixin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3373 - 3389
  • [40] Cross-project software defect prediction based on multi-source data sets
    Junfu H.
    Yawen W.
    Yunzhan G.
    Dahai J.
    Journal of China Universities of Posts and Telecommunications, 2021, 28 (04): : 75 - 87