A Feature Selection Method for Software Defect Prediction Based on Improved Beluga Whale Optimization Algorithm

被引:0
作者
Qiu, Shaoming [1 ]
He, Jingjie [1 ]
Wang, Yan [1 ]
E, Bicong [1 ]
机构
[1] Dalian Univ, Sch Informat Engn, Dalian 116622, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2025年 / 83卷 / 03期
关键词
Software defect prediction; feature selection; beluga optimization algorithm; triangular wandering strategy; cauchy mutation; reverse learning;
D O I
10.32604/cmc.2025.061532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software defect prediction (SDP) aims to find a reliable method to predict defects in specific software projects and help software engineers allocate limited resources to release high-quality software products. Software defect prediction can be effectively performed using traditional features, but there are some redundant or irrelevant features in them (the presence or absence of this feature has little effect on the prediction results). These problems can be solved using feature selection. However, existing feature selection methods have shortcomings such as insignificant dimensionality reduction effect and low classification accuracy of the selected optimal feature subset. In order to reduce the impact of these shortcomings, this paper proposes a new feature selection method Cubic Traverse Ma Beluga whale optimization algorithm (CTMBWO) based on the improved Beluga whale optimization algorithm (BWO). The goal of this study is to determine how well the CTMBWO can extract the features that are most important for correctly predicting software defects, improve the accuracy of fault prediction, reduce the number of the selected feature and mitigate the risk of overfitting, thereby achieving more efficient resource utilization and better distribution of test workload. The CTMBWO comprises three main stages: preprocessing the dataset, selecting relevant features, and evaluating the classification performance of the model. The novel feature selection method can effectively improve the performance of SDP. This study performs experiments on two software defect datasets (PROMISE, NASA) and shows the method's classification performance using four detailed evaluation metrics, Accuracy, F1-score, MCC, AUC and Recall. The results indicate that the approach presented in this paper achieves outstanding classification performance on both datasets and has significant improvement over the baseline models.
引用
收藏
页码:4879 / 4898
页数:20
相关论文
共 29 条
[1]   Software fault prediction using particle swarm algorithm with genetic algorithm and support vector machine classifier [J].
Alsghaier, Hiba ;
Akour, Mohammed .
SOFTWARE-PRACTICE & EXPERIENCE, 2020, 50 (04) :407-427
[2]   Feature selection using firefly algorithm in software defect prediction [J].
Anbu, M. ;
Mala, G. S. Anandha .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5) :10925-10934
[3]   A new binary chaos-based metaheuristic algorithm for software defect prediction [J].
Arasteh, Bahman ;
Arasteh, Keyvan ;
Ghaffari, Ali ;
Ghanbarzadeh, Reza .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (07) :10093-10123
[4]  
Breiman L, 2001, MACH LEARN, V45, P5, DOI [10.1186/s12859-018-2419-4, 10.3322/caac.21834]
[5]   Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification [J].
Chantar, Hamouda ;
Mafarja, Majdi ;
Alsawalqah, Hamad ;
Heidari, Ali Asghar ;
Aljarah, Ibrahim ;
Faris, Hossam .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16) :12201-12220
[6]   Feature Selection Using Golden Jackal Optimization for Software Fault Prediction [J].
Das, Himansu ;
Prajapati, Sanjay ;
Gourisaria, Mahendra Kumar ;
Pattanayak, Radha Mohan ;
Alameen, Abdalla ;
Kolhar, Manjur .
MATHEMATICS, 2023, 11 (11)
[7]   A novel chaos optimization algorithm [J].
Feng, Junhong ;
Zhang, Jie ;
Zhu, Xiaoshu ;
Lian, Wenwu .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (16) :17405-17436
[8]  
Fenton NormanE., 1998, Software Metrics: A Rigorous and Practical Approach, V2nd, DOI 10.1201/b17461
[9]   SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary [J].
Fernandez, Alberto ;
Garcia, Salvador ;
Herrera, Francisco ;
Chawla, Nitesh V. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 :863-905
[10]  
Goyal Somya, 2021, International Journal of Information Technology, P2185, DOI [10.1007/s41870-021-00804-w, 10.1007/s41870-021-00804-w]