Studying the effectiveness of deep active learning in software defect prediction

被引:2
|
作者
Feyzi F. [1 ]
Daneshdoost A. [1 ]
机构
[1] Faculty of Engineering, University of Guilan, Rasht
关键词
active learning; Bug prediction; code metrics; deep learning;
D O I
10.1080/1206212X.2023.2252117
中图分类号
学科分类号
摘要
Accurate prediction of defective software modules is of great importance for prioritizing quality assurance efforts, reasonably allocating testing resources, reducing costs and improving software quality. Several studies have used machine learning to predict software defects. However, complex structures and imbalanced class distributions in software defect data make learning an effective defect prediction model challenging. In this article, two deep learning-based defect prediction models using static code metrics are proposed. In order to enhance the learning process and improve the performance of the proposed models, pool-based active learning is employed. In this regard, the possibility of using active learning to mitigate the need for a large amount of labeled data in the process of building deep learning models is investigated. To deal with imbalanced distribution of software modules between defective and non-defective classes, Near-Miss under-sampling and KNN, with different number of neighbors, are used. The reason for choosing them is their good performance in binary classification problems. Experiments are performed on two well-known, publicly available datasets, GitHub Bug Dataset and public Unified Bug Dataset for java projects. The evaluation results reveal the effectiveness of our proposed models in comparison to the traditional machine learning algorithms. In the conducted investigations on the Unified Bug Dataset, at the file level, the value of F-measure and AUC criteria have improved by 13 and 11 percent, respectively and at the class level, the values have improved by 14 and 11 percent, respectively. © 2023 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:534 / 552
页数:18
相关论文
共 50 条
  • [31] Deep Learning Model for Prediction of Diffusion in Defect Substances
    AlArfaj, Abeer Abdulaziz
    Mahmoud, Hanan Ahmed Hosni
    PROCESSES, 2022, 10 (08)
  • [32] A Comprehensive Investigation of the Role of Imbalanced Learning for Software Defect Prediction
    Song, Qinbao
    Guo, Yuchen
    Shepperd, Martin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2019, 45 (12) : 1253 - 1269
  • [33] Software Reliability Prediction by using Deep Learning Technique
    Yadav, Shivani
    Balkishan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 683 - 693
  • [34] Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Algabri, Redhwan
    IEEE TRANSACTIONS ON RELIABILITY, 2024, 73 (02) : 820 - 834
  • [35] Deep Learning Approach for Software Maintainability Metrics Prediction
    Jha, Sudan
    Kumar, Raghvendra
    Le Hoang Son
    Abdel-Basset, Mohamed
    Priyadarshini, Ishaani
    Sharma, Rohit
    Hoang Viet Long
    IEEE ACCESS, 2019, 7 : 61840 - 61855
  • [36] Interactive Deep Learning for Shelf Life Prediction of Muskmelons Based on an Active Learning Approach
    Albert-Weiss, Dominique
    Osman, Ahmad
    SENSORS, 2022, 22 (02)
  • [37] Defect detection by a deep learning approach with active IR thermography
    Guaragnella, Giovanna
    Morelli, Davide
    D'Orazio, Tiziana
    Galietti, Umberto
    Trentadue, Bartolomeo
    Marani, Roberto
    2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 27 - 32
  • [38] Deep Learning-Based Defect Prediction for Mobile Applications
    Jorayeva, Manzura
    Akbulut, Akhan
    Catal, Cagatay
    Mishra, Alok
    SENSORS, 2022, 22 (13)
  • [39] A Deep Learning framework for simulation and defect prediction applied in microelectronics
    Dimitriou, Nikolaos
    Leontaris, Lampros
    Vafeiadis, Thanasis
    Ioannidis, Dimosthenis
    Wotherspoon, Tracy
    Tinker, Gregory
    Tzovaras, Dimitrios
    SIMULATION MODELLING PRACTICE AND THEORY, 2020, 100
  • [40] bjCnet: A contrastive learning-based framework for software defect prediction
    Han, Jiaxuan
    Huang, Cheng
    Liu, Jiayong
    COMPUTERS & SECURITY, 2024, 145