Cross-project defect prediction based on G-LSTM model

被引:10
|
作者
Xing, Ying [1 ]
Qian, Xiaomeng [2 ]
Guan, Yu [3 ]
Yang, Bin [3 ]
Zhang, Yuwei [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, 10 Xitucheng Rd, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Modern Post, 10 Xitucheng Rd, Beijing 100876, Peoples R China
[3] Du Xiaoman Sci Technol Co Ltd, 10 Xitucheng Rd, Beijing 100085, Peoples R China
[4] Peking Univ, Sch Comp Sci, 5 Summer Palace Rd, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Computational language processing; Cross-project defect prediction; Long-term and short-term memory neural network; Continuous bag-of-word model; Generative adversarial network;
D O I
10.1016/j.patrec.2022.04.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-project defect prediction (CPDP) is currently a hot research direction in the field of software reliability. Traditional CPDP methods cannot capture the semantic and contextual information of programs by handcrafted features, which affects the prediction performance. In this paper, we apply technology in the NLP domain to solve it. We first extract token vectors from the abstract syntax tree (AST) of source and target code files, and then convert them into numerical vectors by the word embedding algorithm of continuous bag-of-word model (CBOW) as the input of the proposed deep learning model named Generative Adversarial Long-Short Term Memory Neural Networks (G-LSTM). The model integrates generative adversarial network (GAN) and bidirectional long-short term memory networks (BiLSTM) with attention mechanism to automatically learn semantic and contextual features of programs. Specifically, GAN is used to eliminate the differences in data distribution between source and target projects, and BiLSTM is the feature extraction encoder. We compose five projects of the PROMISE dataset into 20 source-target project pairs and conduct comparison experiments on them. The experimental results demonstrate that our method outperforms some traditional and state-of-the-art CPDP methods in terms of the evaluation metrics of AUC and Acc. (C) 2022 Published by Elsevier B.V.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 50 条
  • [41] Cross-project Defect Prediction Using a Credibility Theory based Naive Bayes Classifier
    Poon, Wai Nam
    Bennin, Kwabena Ebo
    Huang, Jianglin
    Phannachitta, Passakorn
    Keung, Jacky Wai
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS), 2017, : 434 - 441
  • [42] Cross-Project Software Defect Prediction Based on SMOTE and Deep Canonical Correlation Analysis
    Fan, Xin
    Zhang, Shuqing
    Wu, Kaisheng
    Zheng, Wei
    Ge, Yu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 1687 - 1711
  • [43] An Improved Method for Training Data Selection for Cross-Project Defect Prediction
    Nayeem Ahmad Bhat
    Sheikh Umar Farooq
    Arabian Journal for Science and Engineering, 2022, 47 : 1939 - 1954
  • [44] Impact of hyper parameter optimization for cross-project software defect prediction
    Qu Y.
    Chen X.
    Zhao Y.
    Ju X.
    International Journal of Performability Engineering, 2018, 14 (06): : 1291 - 1299
  • [45] Assessing the Effect of Imbalanced Learning on Cross-project Software Defect Prediction
    Sohan, Md Fahimuzzman
    Jabiullah, Md Ismail
    Rahman, Sheikh Shah Mohammad Motiur
    Mahmud, S. M. Hasan
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [46] Which is More Important for Cross-Project Defect Prediction: Instance or Feature?
    Yu, Qiao
    Jiang, Shujuan
    Qian, Junyan
    2016 INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, TESTING AND EVOLUTION (SATE 2016), 2016, : 90 - 95
  • [47] Towards Better Effort Estimation with Cross-Project Defect Prediction Approaches
    Amasaki, Sousuke
    Yokogawa, Tomoyuki
    Aman, Hirohisa
    PROCEEDINGS OF EASE 2019 - EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, 2019, : 357 - 360
  • [48] An Abstract Syntax Tree Encoding Method for Cross-Project Defect Prediction
    Cai, Ziyi
    Lu, Lu
    Qiu, Shaojian
    IEEE ACCESS, 2019, 7 : 170844 - 170853
  • [49] Evaluating Data Filter on Cross-Project Defect Prediction: Comparison and Improvements
    Li, Yong
    Huang, Zhiqiu
    Wang, Yong
    Fang, Bingwu
    IEEE ACCESS, 2017, 5 : 25646 - 25656
  • [50] An Improved Method for Training Data Selection for Cross-Project Defect Prediction
    Bhat, Nayeem Ahmad
    Farooq, Sheikh Umar
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 1939 - 1954