Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review

被引:66
|
作者
Matloob, Faseeha [1 ]
Ghazal, Taher M. [2 ,3 ]
Taleb, Nasser [4 ]
Aftab, Shabib [1 ,5 ]
Ahmad, Munir [5 ]
Khan, Muhammad Adnan [6 ]
Abbas, Sagheer [5 ]
Soomro, Tariq Rahim [7 ]
机构
[1] Virtual Univ Pakistan, Dept Comp Sci, Lahore 44000, Pakistan
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Cyber Secur, Bangi 43600, Selangor, Malaysia
[3] Univ City Sharjah, Skyline Univ Coll, Sch Informat Technol, Sharjah, U Arab Emirates
[4] Canadian Univ Dubai, Fac Management, Dubai, U Arab Emirates
[5] Natl Coll Business Adm & Econ, Sch Comp Sci, Lahore 54660, Pakistan
[6] Gachon Univ, Dept Software, Pattern Recognit & Machine Learning Lab, Seongnam 13557, South Korea
[7] Inst Business Management, CCSIS, Karachi 75190, Sindh, Pakistan
关键词
Software; Systematics; Data mining; Tools; Predictive models; Machine learning algorithms; Bibliographies; Systematic literature review (SLR); ensemble classifier; hybrid classifier; software defect prediction; FOREST;
D O I
10.1109/ACCESS.2021.3095559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in the domain of software defect prediction (SDP) include the integration of multiple classification techniques to create an ensemble or hybrid approach. This technique was introduced to improve the prediction performance by overcoming the limitations of any single classification technique. This research provides a systematic literature review on the use of the ensemble learning approach for software defect prediction. The review is conducted after critically analyzing research papers published since 2012 in four well-known online libraries: ACM, IEEE, Springer Link, and Science Direct. In this study, five research questions covering the different aspects of research progress on the use of ensemble learning for software defect prediction are addressed. To extract the answers to identified questions, 46 most relevant papers are shortlisted after a thorough systematic research process. This study will provide compact information regarding the latest trends and advances in ensemble learning for software defect prediction and provide a baseline for future innovations and further reviews. Through our study, we discovered that frequently employed ensemble methods by researchers are the random forest, boosting, and bagging. Less frequently employed methods include stacking, voting and Extra Trees. Researchers proposed many promising frameworks, such as EMKCA, SMOTE-Ensemble, MKEL, SDAEsTSE, TLEL, and LRCR, using ensemble learning methods. The AUC, accuracy, F-measure, Recall, Precision, and MCC were mostly utilized to measure the prediction performance of models. WEKA was widely adopted as a platform for machine learning. Many researchers showed through empirical analysis that features selection, and data sampling was necessary pre-processing steps that improve the performance of ensemble classifiers.
引用
收藏
页码:98754 / 98771
页数:18
相关论文
共 50 条
  • [21] Software Defect Prediction Method Based on Clustering Ensemble Learning
    Tao, Hongwei
    Cao, Qiaoling
    Chen, Haoran
    Li, Yanting
    Niu, Xiaoxu
    Wang, Tao
    Geng, Zhenhao
    Shang, Songtao
    IET SOFTWARE, 2024, 2024
  • [22] Software Defect Prediction: A Machine Learning Approach with Voting Ensemble
    Mosquera, Marcela
    Hurtado, Remigio
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 585 - 595
  • [23] An Ensemble Learning Approach for Software Defect Prediction in Developing Quality Software Product
    Saheed, Yakub Kayode
    Longe, Olumide
    Baba, Usman Ahmad
    Rakshit, Sandip
    Vajjhala, Narasimha Rao
    ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 317 - 326
  • [24] A Hierarchical Feature Ensemble Deep Learning Approach for Software Defect Prediction
    Zhang, Shenggang
    Jiang, Shujuan
    Yan, Yue
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (04) : 543 - 573
  • [25] Success and Failure in Software Engineering: A Followup Systematic Literature Review
    Tamburri, Damian A.
    Palomba, Fabio
    Kazman, Rick
    IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2021, 68 (02) : 599 - 611
  • [26] A systematic literature review on software defect prediction using artificial intelligence: Datasets, Data Validation Methods, Approaches, and Tools
    Pachouly, Jalaj
    Ahirrao, Swati
    Kotecha, Ketan
    Selvachandran, Ganeshsree
    Abraham, Ajith
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 111
  • [27] Machine/Deep Learning for Software Engineering: A Systematic Literature Review
    Wang, Simin
    Huang, Liguo
    Gao, Amiao
    Ge, Jidong
    Zhang, Tengfei
    Feng, Haitao
    Satyarth, Ishna
    Li, Ming
    Zhang, He
    Ng, Vincent
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (03) : 1188 - 1231
  • [28] Just-in-time Software Defect Prediction: Literature Review
    Cai L.
    Fan Y.-R.
    Yan M.
    Xia X.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (05): : 1288 - 1307
  • [29] Hybrid SMOTE-Ensemble Approach for Software Defect Prediction
    Alsawalqah, Hamad
    Faris, Hossam
    Aljarah, Ibrahim
    Alnemer, Loai
    Alhindawi, Nouh
    SOFTWARE ENGINEERING TRENDS AND TECHNIQUES IN INTELLIGENT SYSTEMS, CSOC2017, VOL 3, 2017, 575 : 355 - 366
  • [30] An Update on Effort Estimation in Agile Software Development: A Systematic Literature Review
    Fernandez-Diego, Marta
    Mendez, Erwin R.
    Gonzalez-Ladron-De-Guevara, Fernando
    Abrahao, Silvia
    Insfran, Emilio
    IEEE ACCESS, 2020, 8 : 166768 - 166800