A Feature Selection-Based K-NN Model for Fast Software Defect Prediction

被引:5
作者
Awotunde, Joseph Bamidele [1 ]
Misra, Sanjay [2 ]
Adeniyi, Abidemi Emmanuel [2 ]
Abiodun, Moses Kazeem [1 ,3 ]
Kaushik, Manju [4 ]
Lawrence, Morolake Oladayo [5 ]
机构
[1] Univ Ilorin, Dept Comp Sci, Ilorin, Nigeria
[2] Ostfold Univ Coll, Dept Comp Sci & Commun, Halden, Norway
[3] Landmark Univ, Dept Comp Sci, Omu Aran, Nigeria
[4] Amity Univ, Amity Inst Informat Technol, Jaipur, Rajasthan, India
[5] Baze Univ, Dept Comp Sci, Abuja, Nigeria
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART IV | 2022年 / 13380卷
关键词
Software defect prediction; Machine learning; Extreme gradient boost; Feature selection; Prediction; Software development life cycle;
D O I
10.1007/978-3-031-10542-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software Defect Prediction (SDP) is an advanced technological method of predicting software defects in the software development life cycle. Various research works have been previously being done on SDP but the performance of these methods varied from several datasets, hence, making them inconsistent for SDP in the unknown software project. But the hybrid technique using feature selection enabled with machine learning for SDP can be very efficient as it takes the advantage of various methods to come up with better prediction accuracy for a given dataset when compared with an individual classifier. The major issues with individual ML-based models for SDP are the long detection time, vulnerability of the software project, and high dimensionality of the feature parameters. Therefore, this study proposes a hybrid model using a feature selection enabled Extreme Gradient Boost (XGB) classifier to address these mentioned challenges. The cleaned NASA MDP datasets were used for the implementation of the proposed model, and various performance metrics like F-score, accuracy, and MCC were used to reveal the performance of the model. The results of the proposed model when compared with state-of-the-art methods without feature selection perform better in terms of the metrics used. The results reveal that the proposed model outperformed all other prediction techniques.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
[21]   A k-NN method for lung cancer prognosis with the use of a genetic algorithm for feature selection [J].
Maleki N. ;
Zeinali Y. ;
Niaki S.T.A. .
Expert Systems with Applications, 2021, 164
[22]   An Enhanced Diabetes Mellitus Prediction Using Feature Selection-Based Type-2 Fuzzy Model [J].
Awotunde, Joseph Bamidele ;
Misra, Sanjay ;
Pham, Quoc Trung .
FUTURE DATA AND SECURITY ENGINEERING. BIG DATA, SECURITY AND PRIVACY, SMART CITY AND INDUSTRY 4.0 APPLICATIONS, FDSE 2022, 2022, 1688 :625-639
[23]   RFC: a feature selection algorithm for software defect prediction [J].
Xu Xiaolong ;
Chen Wen ;
Wang Xinheng .
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (02) :389-398
[24]   Feature Selection in Software Defect Prediction: A Comparative Study [J].
Kakkar, Misha ;
Jain, Sarika .
2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, :658-663
[25]   k-NN Based MCS Selection in Distributed OFDM Wireless Networks [J].
Meng, Qingmin ;
Gu, Xiong ;
Tian, Feng ;
Zheng, Baoyu .
ADVANCED RESEARCH ON AUTOMATION, COMMUNICATION, ARCHITECTONICS AND MATERIALS, PTS 1 AND 2, 2011, 225-226 (1-2) :974-977
[26]   Combining Integreted Sampling Technique with Feature Selection for Software Defect Prediction [J].
Putri, Sukmawati Anggraeni ;
Frieyadie .
2017 5TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM 2017), 2017, :235-240
[27]   An empirical study on pareto based multi-objective feature selection for software defect prediction [J].
Ni, Chao ;
Chen, Xiang ;
Wu, Fangfang ;
Shen, Yuxiang ;
Gu, Qing .
JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 :215-238
[28]   A Feature Selection Method for Software Defect Prediction Based on Improved Beluga Whale Optimization Algorithm [J].
Qiu, Shaoming ;
He, Jingjie ;
Wang, Yan ;
E, Bicong .
CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 83 (03) :4879-4898
[29]   An effective feature selection based cross-project defect prediction model for software quality improvement [J].
Khatri, Yogita ;
Singh, Sandeep Kumar .
INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023, 14 (SUPPL 1) :S154-S172
[30]   An effective feature selection based cross-project defect prediction model for software quality improvement [J].
Yogita Khatri ;
Sandeep Kumar Singh .
International Journal of System Assurance Engineering and Management, 2023, 14 :154-172