Application of Deep Learning in Software Defect Prediction: Systematic Literature Review and Meta-analysis

被引:20
|
作者
Zain, Zuhaira Muhammad [1 ]
Sakri, Sapiah [1 ]
Ismail, Nurul Halimatul Asmak [2 ]
机构
[1] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Informat Syst Dept, Riyadh, Saudi Arabia
[2] Princess Nourah Bint Abdulrahman Univ, Appl Coll, Dept Comp Sci & Informat Technol, Riyadh, Saudi Arabia
关键词
Deep Learning; Software Defect Prediction; Systematic Literature Review; Meta-Analysis; MEAN SQUARED ERROR; QUALITY;
D O I
10.1016/j.infsof.2023.107175
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Despite recent attention given to Software Defect Prediction (SDP), the lack of any systematic effort to assess existing empirical evidence on the application of Deep Learning (DL) in SDP indicates that it is still relatively under-researched.Objective: To synthesize literature on SDP using DL, pertaining to measurements, models, techniques, datasets, and achievements; to obtain a full understanding of current SDP-related methodologies using DL; and to compare the DL models' performances with those of Machine Learning (ML) models in classifying software defects.Method: We completed a thorough review of the literature in this domain. To answer the research issues, results from primary investigations were synthesized. The preliminary findings for DL vs. ML in SDP were verified by using meta-analysis (MA).Result: We discovered 63 primary studies that passed the systematic literature review quality evaluation. However, only 19 primary studies passed the MA quality evaluation. The five most popular performance mea-surements employed in SDP were f-measure, recall, accuracy, precision, and Area Under the Curve (AUC). The top five DL techniques used in building SDP models were Convolutional Neural Network (CNN), Deep Neural Network (DNN), Long Short-Term Memory (LSTM), Deep Belief Network (DBN), and Stacked Denoising Autoencoder (SDAE). PROMISE and NASA datasets were found to be used more frequently to train and test DL models in SDP. The MA results show that DL was favored over ML in terms of study and dataset across accuracy, f-measure, and AUC.Conclusion: The application of DL in SDP remains a challenge, but it has the potential to achieve better predictive performance when the performance-influencing parameters are optimized. We provide a reference point for future research which could be used to improve research quality in this domain.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Software Defect Prediction using Deep Learning
    Nevendra, Meetesh
    Singh, Pradeep
    ACTA POLYTECHNICA HUNGARICA, 2021, 18 (10) : 173 - 189
  • [42] Class Balancing Approaches in Dataset for Software Defect Prediction: A Systematic Literature Review
    Olvera-Villeda, Dan Javier
    Sanchez-Garcia, Angel J.
    Limon, Xavier
    Dominguez Isidro, Saul
    2023 11TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION, CONISOFT 2023, 2023, : 236 - 245
  • [43] Diagnostic Accuracy of Deep Learning and Radiomics in Lung Cancer Staging: A Systematic Review and Meta-Analysis
    Zheng, Xiushan
    He, Bo
    Hu, Yunhai
    Ren, Min
    Chen, Zhiyuan
    Zhang, Zhiguang
    Ma, Jun
    Ouyang, Lanwei
    Chu, Hongmei
    Gao, Huan
    He, Wenjing
    Liu, Tianhu
    Li, Gang
    FRONTIERS IN PUBLIC HEALTH, 2022, 10 : 938113
  • [44] Deep Learning for Quantified Gait Analysis: A Systematic Literature Review
    Khan, Adil
    Galarraga, Omar
    Garcia-Salicetti, Sonia
    Vigneron, Vincent
    IEEE ACCESS, 2024, 12 : 138932 - 138957
  • [45] Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis
    Liu, Lidan
    Liu, Lu
    Wafa, Hatem A.
    Tydeman, Florence
    Xie, Wanqing
    Wang, Yanzhong
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2394 - 2404
  • [46] Diagnostic accuracy of deep learning-based algorithms in laryngoscopy: a systematic review and meta-analysis
    Du, Shengyi
    Guo, Jin
    Huang, Donghai
    Liu, Yong
    Zhang, Xin
    Lu, Shanhong
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2025, 282 (01) : 351 - 360
  • [47] Deep Learning for Software Defect Prediction in time
    Yadav, Monika
    Singh, Vijendra
    Rastogi, Priyanka
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 7 - 12
  • [48] Machine learning-based models for prediction of survival in medulloblastoma: a systematic review and meta-analysis
    Hajikarimloo, Bardia
    Habibi, Mohammad Amin
    Alvani, Mohammadamin Sabbagh
    Meinagh, Sima Osouli
    Kooshki, Alireza
    Afkhami-Ardakani, Omid
    Rasouli, Fatemeh
    Tos, Salem M.
    Tavanaei, Roozbeh
    Akhlaghpasand, Mohammadhosein
    Hashemi, Rana
    Hasanzade, Arman
    NEUROLOGICAL SCIENCES, 2025, 46 (02) : 689 - 696
  • [49] Facial expression deep learning algorithms in the detection of neurological disorders: a systematic review and meta-analysis
    Shania Yoonesi
    Ramila Abedi Azar
    Melika Arab Bafrani
    Shayan Yaghmayee
    Haniye Shahavand
    Majid Mirmazloumi
    Narges Moazeni Limoudehi
    Mohammadreza Rahmani
    Saina Hasany
    Fatemeh Zahra Idjadi
    Mohammad Amin Aalipour
    Hossein Gharedaghi
    Sadaf Salehi
    Mahsa Asadi Anar
    Mohammad Saeed Soleimani
    BioMedical Engineering OnLine, 24 (1)
  • [50] Exploring the effectiveness of artificial intelligence, machine learning and deep learning in trauma triage: A systematic review and meta-analysis
    Adebayo, Oluwasemilore
    Bhuiyan, Zunira Areeba
    Ahmed, Zubair
    DIGITAL HEALTH, 2023, 9