A Feature Selection Method for Software Defect Prediction Based on Improved Beluga Whale Optimization Algorithm

被引:0
作者
Qiu, Shaoming [1 ]
He, Jingjie [1 ]
Wang, Yan [1 ]
E, Bicong [1 ]
机构
[1] Dalian Univ, Sch Informat Engn, Dalian 116622, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2025年 / 83卷 / 03期
关键词
Software defect prediction; feature selection; beluga optimization algorithm; triangular wandering strategy; cauchy mutation; reverse learning;
D O I
10.32604/cmc.2025.061532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software defect prediction (SDP) aims to find a reliable method to predict defects in specific software projects and help software engineers allocate limited resources to release high-quality software products. Software defect prediction can be effectively performed using traditional features, but there are some redundant or irrelevant features in them (the presence or absence of this feature has little effect on the prediction results). These problems can be solved using feature selection. However, existing feature selection methods have shortcomings such as insignificant dimensionality reduction effect and low classification accuracy of the selected optimal feature subset. In order to reduce the impact of these shortcomings, this paper proposes a new feature selection method Cubic Traverse Ma Beluga whale optimization algorithm (CTMBWO) based on the improved Beluga whale optimization algorithm (BWO). The goal of this study is to determine how well the CTMBWO can extract the features that are most important for correctly predicting software defects, improve the accuracy of fault prediction, reduce the number of the selected feature and mitigate the risk of overfitting, thereby achieving more efficient resource utilization and better distribution of test workload. The CTMBWO comprises three main stages: preprocessing the dataset, selecting relevant features, and evaluating the classification performance of the model. The novel feature selection method can effectively improve the performance of SDP. This study performs experiments on two software defect datasets (PROMISE, NASA) and shows the method's classification performance using four detailed evaluation metrics, Accuracy, F1-score, MCC, AUC and Recall. The results indicate that the approach presented in this paper achieves outstanding classification performance on both datasets and has significant improvement over the baseline models.
引用
收藏
页码:4879 / 4898
页数:20
相关论文
共 29 条
[11]   Genetic Evolution-Based Feature Selection for Software Defect Prediction Using SVMs [J].
Goyal, Somya .
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (11)
[12]   Comparison of Machine Learning Techniques for Software Quality Prediction [J].
Goyal, Somya .
INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2020, 11 (02) :20-40
[13]   Fed-OLF: Federated Oversampling Learning Framework for Imbalanced Software Defect Prediction Under Privacy Protection [J].
Hu, Xiaowen ;
Zheng, Ming ;
Zhu, Rui ;
Zhang, Xuan ;
Jin, Zhi .
IEEE TRANSACTIONS ON RELIABILITY, 2025,
[14]  
Karim S., 2017, 2017 IEEE INT C CYB
[15]   Cost-sensitive decision tree ensembles for effective imbalanced classification [J].
Krawczyk, Bartosz ;
Wozniak, Michal ;
Schaefer, Gerald .
APPLIED SOFT COMPUTING, 2014, 14 :554-562
[16]   Bug severity classification in software using ant colony optimization based feature weighting technique [J].
Kukkar, Ashima ;
Kumar, Yugal ;
Sharma, Ashutosh ;
Sandhu, Jasminder Kaur .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
[17]  
Malhotra R, 2025, NEURAL COMPUT APPL, DOI [10.1007/s00521-024-10902-y, 10.1007/s00521-024-10902-y, DOI 10.1007/S00521-024-10902-Y]
[18]   A novel software defect prediction model using two-phase grey wolf optimisation for feature selection [J].
Malhotra, Ruchika ;
Khan, Kishwar .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09) :12185-12207
[19]   FAST, ACCURATE ALGORITHM FOR NUMERICAL-SIMULATION OF LEVY STABLE STOCHASTIC-PROCESSES [J].
MANTEGNA, RN .
PHYSICAL REVIEW E, 1994, 49 (05) :4677-4683
[20]   Optimal feature selection for support vector machines [J].
Nguyen, Minh Hoai ;
de la Torre, Fernando .
PATTERN RECOGNITION, 2010, 43 (03) :584-591