Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review

被引:66
|
作者
Matloob, Faseeha [1 ]
Ghazal, Taher M. [2 ,3 ]
Taleb, Nasser [4 ]
Aftab, Shabib [1 ,5 ]
Ahmad, Munir [5 ]
Khan, Muhammad Adnan [6 ]
Abbas, Sagheer [5 ]
Soomro, Tariq Rahim [7 ]
机构
[1] Virtual Univ Pakistan, Dept Comp Sci, Lahore 44000, Pakistan
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Cyber Secur, Bangi 43600, Selangor, Malaysia
[3] Univ City Sharjah, Skyline Univ Coll, Sch Informat Technol, Sharjah, U Arab Emirates
[4] Canadian Univ Dubai, Fac Management, Dubai, U Arab Emirates
[5] Natl Coll Business Adm & Econ, Sch Comp Sci, Lahore 54660, Pakistan
[6] Gachon Univ, Dept Software, Pattern Recognit & Machine Learning Lab, Seongnam 13557, South Korea
[7] Inst Business Management, CCSIS, Karachi 75190, Sindh, Pakistan
关键词
Software; Systematics; Data mining; Tools; Predictive models; Machine learning algorithms; Bibliographies; Systematic literature review (SLR); ensemble classifier; hybrid classifier; software defect prediction; FOREST;
D O I
10.1109/ACCESS.2021.3095559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in the domain of software defect prediction (SDP) include the integration of multiple classification techniques to create an ensemble or hybrid approach. This technique was introduced to improve the prediction performance by overcoming the limitations of any single classification technique. This research provides a systematic literature review on the use of the ensemble learning approach for software defect prediction. The review is conducted after critically analyzing research papers published since 2012 in four well-known online libraries: ACM, IEEE, Springer Link, and Science Direct. In this study, five research questions covering the different aspects of research progress on the use of ensemble learning for software defect prediction are addressed. To extract the answers to identified questions, 46 most relevant papers are shortlisted after a thorough systematic research process. This study will provide compact information regarding the latest trends and advances in ensemble learning for software defect prediction and provide a baseline for future innovations and further reviews. Through our study, we discovered that frequently employed ensemble methods by researchers are the random forest, boosting, and bagging. Less frequently employed methods include stacking, voting and Extra Trees. Researchers proposed many promising frameworks, such as EMKCA, SMOTE-Ensemble, MKEL, SDAEsTSE, TLEL, and LRCR, using ensemble learning methods. The AUC, accuracy, F-measure, Recall, Precision, and MCC were mostly utilized to measure the prediction performance of models. WEKA was widely adopted as a platform for machine learning. Many researchers showed through empirical analysis that features selection, and data sampling was necessary pre-processing steps that improve the performance of ensemble classifiers.
引用
收藏
页码:98754 / 98771
页数:18
相关论文
共 50 条
  • [31] Software Defect Prediction Using an Intelligent Ensemble-Based Model
    Ali, Misbah
    Mazhar, Tehseen
    Arif, Yasir
    Al-Otaibi, Shaha
    Ghadi, Yazeed Yasin
    Shahzad, Tariq
    Khan, Muhammad Amir
    Hamam, Habib
    IEEE ACCESS, 2024, 12 : 20376 - 20395
  • [32] Current Trends in Class Imbalance Learning for Software Defect Prediction
    Goyal, Somya R.
    IEEE ACCESS, 2025, 13 : 16896 - 16917
  • [33] Code Multiview Hypergraph Representation Learning for Software Defect Prediction
    Qiu, Shaojian
    Huang, Mengyang
    Liang, Yun
    Peng, Chaoda
    Yuan, Yuan
    IEEE TRANSACTIONS ON RELIABILITY, 2024, : 1863 - 1876
  • [34] Challenges of Software Requirements Quality Assurance and Validation: A Systematic Literature Review
    Atoum, Issa
    Baklizi, Mahmoud Khalid
    Alsmadi, Izzat
    Otoom, Ahmed Ali
    Alhersh, Taha
    Ababneh, Jafar
    Almalki, Jameel
    Alshahrani, Saeed Masoud
    IEEE ACCESS, 2021, 9 : 137613 - 137634
  • [35] Validating Unsupervised Machine Learning Techniques for Software Defect Prediction With Generic Metamorphic Testing
    Chan, Pak Yuen Patrick
    Keung, Jacky
    IEEE ACCESS, 2024, 12 : 165155 - 165172
  • [36] Software Defect Prediction and Localization with Attention-Based Models and Ensemble Learning
    Zhang, Tianhang
    Du, Qingfeng
    Xu, Jincheng
    Li, Jiechu
    Li, Xiaojun
    2020 27TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2020), 2020, : 81 - 90
  • [37] Software Defect Density Prediction Using Deep Learning
    Alghanim, Firas
    Azzeh, Mohammad
    El-Hassan, Ammar
    Qattous, Hazem
    IEEE ACCESS, 2022, 10 : 114629 - 114641
  • [38] Using Class Imbalance Learning for Software Defect Prediction
    Wang, Shuo
    Yao, Xin
    IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (02) : 434 - 443
  • [39] Model-based Software Defect Prediction from Software Quality Characterized Code Features by using Stacking Ensemble Learning
    Kumar P.S.
    Nayak J.
    Behera H.S.
    Journal of Engineering Science and Technology Review, 2022, 15 (02) : 137 - 155
  • [40] The need for more informative defect prediction: A systematic literature review
    Grattan, Natalie
    da Costa, Daniel Alencar
    Stanger, Nigel
    INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 171