A Feature Selection-Based K-NN Model for Fast Software Defect Prediction

被引:5
作者
Awotunde, Joseph Bamidele [1 ]
Misra, Sanjay [2 ]
Adeniyi, Abidemi Emmanuel [2 ]
Abiodun, Moses Kazeem [1 ,3 ]
Kaushik, Manju [4 ]
Lawrence, Morolake Oladayo [5 ]
机构
[1] Univ Ilorin, Dept Comp Sci, Ilorin, Nigeria
[2] Ostfold Univ Coll, Dept Comp Sci & Commun, Halden, Norway
[3] Landmark Univ, Dept Comp Sci, Omu Aran, Nigeria
[4] Amity Univ, Amity Inst Informat Technol, Jaipur, Rajasthan, India
[5] Baze Univ, Dept Comp Sci, Abuja, Nigeria
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART IV | 2022年 / 13380卷
关键词
Software defect prediction; Machine learning; Extreme gradient boost; Feature selection; Prediction; Software development life cycle;
D O I
10.1007/978-3-031-10542-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software Defect Prediction (SDP) is an advanced technological method of predicting software defects in the software development life cycle. Various research works have been previously being done on SDP but the performance of these methods varied from several datasets, hence, making them inconsistent for SDP in the unknown software project. But the hybrid technique using feature selection enabled with machine learning for SDP can be very efficient as it takes the advantage of various methods to come up with better prediction accuracy for a given dataset when compared with an individual classifier. The major issues with individual ML-based models for SDP are the long detection time, vulnerability of the software project, and high dimensionality of the feature parameters. Therefore, this study proposes a hybrid model using a feature selection enabled Extreme Gradient Boost (XGB) classifier to address these mentioned challenges. The cleaned NASA MDP datasets were used for the implementation of the proposed model, and various performance metrics like F-score, accuracy, and MCC were used to reveal the performance of the model. The results of the proposed model when compared with state-of-the-art methods without feature selection perform better in terms of the metrics used. The results reveal that the proposed model outperformed all other prediction techniques.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
[31]   Software defect prediction based on nested-stacking and heterogeneous feature selection [J].
Li-qiong Chen ;
Can Wang ;
Shi-long Song .
Complex & Intelligent Systems, 2022, 8 :3333-3348
[32]   Software defect prediction based on nested-stacking and heterogeneous feature selection [J].
Chen, Li-qiong ;
Wang, Can ;
Song, Shi-long .
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (04) :3333-3348
[33]   Feature-Grouping-Based Two Steps Feature Selection Algorithm in Software Defect Prediction [J].
Du, Yuntao ;
Zhang, Lu ;
Shi, Jiahao ;
Tang, Jingjuan ;
Yin, Ying .
ICAIP 2018: 2018 THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN IMAGE PROCESSING, 2018, :173-178
[34]   Selection of Relevant Features for Text Classification with K-NN [J].
Balicki, Jerzy ;
Krawczyk, Henryk ;
Rymko, Lukasz ;
Szymanski, Julian .
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II, 2013, 7895 :477-488
[35]   A feature selection model for software defect prediction using binary Rao optimization algorithm [J].
Thirumoorthy, Karpagalingam ;
Britto J, Jerold John .
APPLIED SOFT COMPUTING, 2022, 131
[36]   Optimal Feature Selection-Based Dental Caries Prediction Model Using Machine Learning for Decision Support System [J].
Kang, In-Ae ;
Njimbouom, Soualihou Ngnamsie ;
Kim, Jeong-Dong .
BIOENGINEERING-BASEL, 2023, 10 (02)
[37]   Enhancing Groundwater Recharge Prediction: A Feature Selection-Based Deep Forest Model With Bayesian Optimisation [J].
Liu, Bao ;
Sun, Yaohua ;
Gao, Lei .
HYDROLOGICAL PROCESSES, 2024, 38 (10)
[38]   Feature Selection and Software Defect Prediction by Different Ensemble Classifiers [J].
Shakhovska, Natalya ;
Yakovyna, Vitaliy .
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 :307-313
[39]   Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction [J].
Nasser, Abdullah B. ;
Ghanem, Waheed Ali H. M. ;
Saad, Abdul-Malik H. Y. ;
Abdul-Qawy, Antar Shaddad Hamed ;
Ghaleb, Sanaa A. A. ;
Alduais, Nayef Abdulwahab Mohammed ;
Din, Fakhrud ;
Ghetas, Mohamed .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 253
[40]   A software defect prediction method with metric compensation based on feature selection and transfer learning [J].
Chen, Jinfu ;
Wang, Xiaoli ;
Cai, Saihua ;
Xu, Jiaping ;
Chen, Jingyi ;
Chen, Haibo .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (05) :715-731