A Feature Selection-Based K-NN Model for Fast Software Defect Prediction

被引:4
|
作者
Awotunde, Joseph Bamidele [1 ]
Misra, Sanjay [2 ]
Adeniyi, Abidemi Emmanuel [2 ]
Abiodun, Moses Kazeem [1 ,3 ]
Kaushik, Manju [4 ]
Lawrence, Morolake Oladayo [5 ]
机构
[1] Univ Ilorin, Dept Comp Sci, Ilorin, Nigeria
[2] Ostfold Univ Coll, Dept Comp Sci & Commun, Halden, Norway
[3] Landmark Univ, Dept Comp Sci, Omu Aran, Nigeria
[4] Amity Univ, Amity Inst Informat Technol, Jaipur, Rajasthan, India
[5] Baze Univ, Dept Comp Sci, Abuja, Nigeria
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART IV | 2022年 / 13380卷
关键词
Software defect prediction; Machine learning; Extreme gradient boost; Feature selection; Prediction; Software development life cycle;
D O I
10.1007/978-3-031-10542-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software Defect Prediction (SDP) is an advanced technological method of predicting software defects in the software development life cycle. Various research works have been previously being done on SDP but the performance of these methods varied from several datasets, hence, making them inconsistent for SDP in the unknown software project. But the hybrid technique using feature selection enabled with machine learning for SDP can be very efficient as it takes the advantage of various methods to come up with better prediction accuracy for a given dataset when compared with an individual classifier. The major issues with individual ML-based models for SDP are the long detection time, vulnerability of the software project, and high dimensionality of the feature parameters. Therefore, this study proposes a hybrid model using a feature selection enabled Extreme Gradient Boost (XGB) classifier to address these mentioned challenges. The cleaned NASA MDP datasets were used for the implementation of the proposed model, and various performance metrics like F-score, accuracy, and MCC were used to reveal the performance of the model. The results of the proposed model when compared with state-of-the-art methods without feature selection perform better in terms of the metrics used. The results reveal that the proposed model outperformed all other prediction techniques.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
  • [1] A many objective based feature selection model for software defect prediction
    Mao, Qi
    Zhang, Jingbo
    Zhao, Tianhao
    Cai, Xingjuan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (19)
  • [2] Feature Selection by Using DE Algorithm and k-NN Classifier
    Senel, Fatih Ahmet
    Yuksel, Asim Sinan
    Yigit, Tuncay
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 886 - 893
  • [3] Software Defect Prediction Scheme Based on Feature Selection
    Wang, Pei
    Jin, Cong
    Jin, Shu-Wei
    2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 477 - 480
  • [4] Genetic Feature Selection for Software Defect Prediction
    Wahono, Romi Satria
    Herman, Nanna Suryana
    ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 239 - 244
  • [5] Analysis of Feature Selection Methods in Software Defect Prediction Models
    Ali, Misbah
    Mazhar, Tehseen
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Mohsin, Syed Muhammad
    Akber, Syed Muhammad Abrar
    Ali, Mohammed
    IEEE ACCESS, 2023, 11 : 145954 - 145974
  • [6] A feature selection approach based on a similarity measure for software defect prediction
    Qiao Yu
    Shu-juan Jiang
    Rong-cun Wang
    Hong-yang Wang
    Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 1744 - 1753
  • [7] A feature selection approach based on a similarity measure for software defect prediction
    Yu, Qiao
    Jiang, Shu-juan
    Wang, Rong-cun
    Wang, Hong-yang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (11) : 1744 - 1753
  • [8] FECAR: A Feature Selection Framework for Software Defect Prediction
    Liu, Shulong
    Chen, Xiang
    Liu, Wangshu
    Chen, Jiaqiang
    Gu, Qing
    Chen, Daoxu
    2014 IEEE 38TH ANNUAL INTERNATIONAL COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2014, : 426 - 435
  • [9] USING K-NN WITH WEIGHTS TO DETECT DIABETES MELLITUS BASED ON GENETIC ALGORITHM FEATURE SELECTION
    Shu, Ting
    Zhang, Bob
    Tang, Y. Y.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2016, : 12 - 17
  • [10] ELM and KELM based software defect prediction using feature selection techniques
    Arora, Ishani
    Saha, Anju
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (05) : 1025 - 1045