Data and Ensemble Machine Learning Fusion Based Intelligent Software Defect Prediction System

被引:3
作者
Abbas, Sagheer [1 ]
Aftab, Shabib [1 ,2 ]
Khan, Muhammad Adnan [3 ,4 ]
Ghazal, Taher M. [5 ,6 ]
Al Hamadi, Hussam [7 ]
Yeun, Chan Yeob [8 ]
机构
[1] Natl Coll Business Adm & Econ, Sch Comp Sci, Lahore 54000, Pakistan
[2] Virtual Univ Pakistan, Dept Comp Sci, Lahore 54000, Pakistan
[3] Gachon Univ, Fac Artificial Intelligence & Software, Dept Software, Seongnam 13120, South Korea
[4] Riphah Int Univ, Fac Comp, Riphah Sch Comp & Innovat, Lahore Campus, Lahore 54000, Pakistan
[5] Skyline Univ Coll, Sch Informat Technol, Sharjah, U Arab Emirates
[6] UKM, Fac Informat Sci & Technol, Ctr Cyber Secur, Bangi 43600, Selangor, Malaysia
[7] Univ Dubai, Coll Engn & IT, Al Ain 14143, U Arab Emirates
[8] Khalifa Univ, Ctr Cyber Phys Syst, EECS Dept, Abu Dhabi 127788, U Arab Emirates
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 75卷 / 03期
关键词
Ensemble machine learning fusion; software defect prediction; fuzzy logic; QUALITY; OVERLAP; MODEL;
D O I
10.32604/cmc.2023.037933
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The software engineering field has long focused on creating high-quality software despite limited resources. Detecting defects before the testing stage of software development can enable quality assurance engineers to con-centrate on problematic modules rather than all the modules. This approach can enhance the quality of the final product while lowering development costs. Identifying defective modules early on can allow for early corrections and ensure the timely delivery of a high-quality product that satisfies customers and instills greater confidence in the development team. This process is known as software defect prediction, and it can improve end-product quality while reducing the cost of testing and maintenance. This study proposes a software defect prediction system that utilizes data fusion, feature selection, and ensemble machine learning fusion techniques. A novel filter-based metric selection technique is proposed in the framework to select the optimum features. A three-step nested approach is presented for predicting defective modules to achieve high accuracy. In the first step, three supervised machine learning techniques, including Decision Tree, Support Vector Machines, and Naive Bayes, are used to detect faulty modules. The second step involves integrating the predictive accuracy of these classification techniques through three ensemble machine-learning methods: Bagging, Voting, and Stacking. Finally, in the third step, a fuzzy logic technique is employed to integrate the predictive accuracy of the ensemble machine learning techniques. The experiments are performed on a fused software defect dataset to ensure that the developed fused ensemble model can perform effectively on diverse datasets. Five NASA datasets are integrated to create the fused dataset: MW1, PC1, PC3, PC4, and CM1. According to the results, the proposed system exhibited superior performance to other advanced techniques for predicting software defects, achieving a remarkable accuracy rate of 92.08%.
引用
收藏
页码:6083 / 6100
页数:18
相关论文
共 30 条
  • [1] Abdou AS., 2018, INT J COMPUTER APPL, V179, P29, DOI [10.5120/ijca2018917185, DOI 10.5120/IJCA2018917185]
  • [2] Ali U., 2020, Modern Education and Computer Science, V12, P29, DOI 10.5815/ijmecs.2020.05.03
  • [3] Balogun A. O., 2022, 22 INT C COMP SCI IT, P615
  • [4] Software metrics thresholds calculation techniques to predict fault-proneness: An empirical comparison
    Boucher, Alexandre
    Badri, Mourad
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2018, 96 : 38 - 67
  • [5] Tackling class overlap and imbalance problems in software defect prediction
    Chen, Lin
    Fang, Bin
    Shang, Zhaowei
    Tang, Yuanyan
    [J]. SOFTWARE QUALITY JOURNAL, 2018, 26 (01) : 97 - 125
  • [6] Ensemble of convolutional neural networks based on an evolutionary algorithm applied to an industrial welding process
    Cruz, Yarens J.
    Rivas, Marcelino
    Quiza, Ramon
    Villalonga, Alberto
    Haber, Rodolfo E.
    Beruvides, Gerardo
    [J]. COMPUTERS IN INDUSTRY, 2021, 133
  • [7] Machine Learning Empowered Software Defect Prediction System
    Daoud, Mohammad Sh.
    Aftab, Shabib
    Ahmad, Munir
    Khan, Muhammad Adnan
    Iqbal, Ahmed
    Abbas, Sagheer
    Iqbal, Muhammad
    Ihnaini, Baha
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (02) : 1287 - 1300
  • [8] Goyal S., 2020, COMPUTATIONAL INTELL, P49
  • [9] Heterogeneous stacked ensemble classifier for software defect prediction
    Goyal, Somya
    Bhatia, Pradeep Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37033 - 37055
  • [10] Comparison of Machine Learning Techniques for Software Quality Prediction
    Goyal, Somya
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2020, 11 (02) : 20 - 40