Application of Deep Learning in Software Defect Prediction: Systematic Literature Review and Meta-analysis

被引：20

作者：

Zain, Zuhaira Muhammad ^{[1
]}

Sakri, Sapiah ^{[1
]}

Ismail, Nurul Halimatul Asmak ^{[2
]}

机构：

[1] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Informat Syst Dept, Riyadh, Saudi Arabia

[2] Princess Nourah Bint Abdulrahman Univ, Appl Coll, Dept Comp Sci & Informat Technol, Riyadh, Saudi Arabia

来源：

INFORMATION AND SOFTWARE TECHNOLOGY | 2023年 / 158卷

关键词：

Deep Learning; Software Defect Prediction; Systematic Literature Review; Meta-Analysis; MEAN SQUARED ERROR; QUALITY;

D O I：

10.1016/j.infsof.2023.107175

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Context: Despite recent attention given to Software Defect Prediction (SDP), the lack of any systematic effort to assess existing empirical evidence on the application of Deep Learning (DL) in SDP indicates that it is still relatively under-researched.Objective: To synthesize literature on SDP using DL, pertaining to measurements, models, techniques, datasets, and achievements; to obtain a full understanding of current SDP-related methodologies using DL; and to compare the DL models' performances with those of Machine Learning (ML) models in classifying software defects.Method: We completed a thorough review of the literature in this domain. To answer the research issues, results from primary investigations were synthesized. The preliminary findings for DL vs. ML in SDP were verified by using meta-analysis (MA).Result: We discovered 63 primary studies that passed the systematic literature review quality evaluation. However, only 19 primary studies passed the MA quality evaluation. The five most popular performance mea-surements employed in SDP were f-measure, recall, accuracy, precision, and Area Under the Curve (AUC). The top five DL techniques used in building SDP models were Convolutional Neural Network (CNN), Deep Neural Network (DNN), Long Short-Term Memory (LSTM), Deep Belief Network (DBN), and Stacked Denoising Autoencoder (SDAE). PROMISE and NASA datasets were found to be used more frequently to train and test DL models in SDP. The MA results show that DL was favored over ML in terms of study and dataset across accuracy, f-measure, and AUC.Conclusion: The application of DL in SDP remains a challenge, but it has the potential to achieve better predictive performance when the performance-influencing parameters are optimized. We provide a reference point for future research which could be used to improve research quality in this domain.

引用

页数：28

共 50 条

[41] Software Defect Prediction using Deep Learning
Nevendra, Meetesh
Singh, Pradeep
ACTA POLYTECHNICA HUNGARICA, 2021, 18 (10) : 173 - 189
[42] Class Balancing Approaches in Dataset for Software Defect Prediction: A Systematic Literature Review
Olvera-Villeda, Dan Javier
Sanchez-Garcia, Angel J.
Limon, Xavier
Dominguez Isidro, Saul
2023 11TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION, CONISOFT 2023, 2023, : 236 - 245
[43] Diagnostic Accuracy of Deep Learning and Radiomics in Lung Cancer Staging: A Systematic Review and Meta-Analysis
Zheng, Xiushan
He, Bo
Hu, Yunhai
Ren, Min
Chen, Zhiyuan
Zhang, Zhiguang
Ma, Jun
Ouyang, Lanwei
Chu, Hongmei
Gao, Huan
He, Wenjing
Liu, Tianhu
Li, Gang
FRONTIERS IN PUBLIC HEALTH, 2022, 10 : 938113
[44] Deep Learning for Quantified Gait Analysis: A Systematic Literature Review
Khan, Adil
Galarraga, Omar
Garcia-Salicetti, Sonia
Vigneron, Vincent
IEEE ACCESS, 2024, 12 : 138932 - 138957
[45] Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis
Liu, Lidan
Liu, Lu
Wafa, Hatem A.
Tydeman, Florence
Xie, Wanqing
Wang, Yanzhong
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2394 - 2404
[46] Diagnostic accuracy of deep learning-based algorithms in laryngoscopy: a systematic review and meta-analysis
Du, Shengyi
Guo, Jin
Huang, Donghai
Liu, Yong
Zhang, Xin
Lu, Shanhong
EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2025, 282 (01) : 351 - 360
[47] Deep Learning for Software Defect Prediction in time
Yadav, Monika
Singh, Vijendra
Rastogi, Priyanka
2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 7 - 12
[48] Machine learning-based models for prediction of survival in medulloblastoma: a systematic review and meta-analysis
Hajikarimloo, Bardia
Habibi, Mohammad Amin
Alvani, Mohammadamin Sabbagh
Meinagh, Sima Osouli
Kooshki, Alireza
Afkhami-Ardakani, Omid
Rasouli, Fatemeh
Tos, Salem M.
Tavanaei, Roozbeh
Akhlaghpasand, Mohammadhosein
Hashemi, Rana
Hasanzade, Arman
NEUROLOGICAL SCIENCES, 2025, 46 (02) : 689 - 696
[49] Facial expression deep learning algorithms in the detection of neurological disorders: a systematic review and meta-analysis
Shania Yoonesi
Ramila Abedi Azar
Melika Arab Bafrani
Shayan Yaghmayee
Haniye Shahavand
Majid Mirmazloumi
Narges Moazeni Limoudehi
Mohammadreza Rahmani
Saina Hasany
Fatemeh Zahra Idjadi
Mohammad Amin Aalipour
Hossein Gharedaghi
Sadaf Salehi
Mahsa Asadi Anar
Mohammad Saeed Soleimani
BioMedical Engineering OnLine, 24 (1)
[50] Exploring the effectiveness of artificial intelligence, machine learning and deep learning in trauma triage: A systematic review and meta-analysis
Adebayo, Oluwasemilore
Bhuiyan, Zunira Areeba
Ahmed, Zubair
DIGITAL HEALTH, 2023, 9

← 1 2 3 4 5 →