Capsule feature selector for software defect prediction

被引:0
作者
Tang, Yu [1 ,2 ,5 ]
Dai, Qi [3 ]
Du, Ye [1 ,2 ,4 ,5 ]
Zheng, Tian-shuai [1 ,2 ,5 ]
Li, Mei-hong [1 ,2 ,4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Sch Cyberspace Sci & Technol, 3,Shangyuan Village, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Transp, Beijing 100044, Peoples R China
[3] North China Univ Sci & Technol, Coll Sci, Tangshan 063210, Peoples R China
[4] Beijing Jiaotong Univ, Beijing Lab Natl Econ Secur Early Warning Engn, Beijing 100044, Peoples R China
[5] Beijing Jiaotong Univ, Tangshan Res Inst, Tangshan 063210, Peoples R China
关键词
Feature selection; Subspace; Capsule feature selector; Binary capsule operator; Software defect prediction; OPTIMIZATION;
D O I
10.1007/s11227-025-06949-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of predictive models in software defect prediction is vulnerable to redundant features. Feature selection methods prove effective in reducing the spatial dimensionality of features. However, traditional approaches in feature selection encounter challenges in efficiently delivering the optimal subset of features. We introduce a capsule feature selector (CFS) for software defect prediction, drawing inspiration from capsule networks. Initially, our method outlines four essential computational processes within the capsule feature selector: capsule structure, sorting rules, search rules, and transfer rules. Subsequently, we uniformly place the binary capsule operator into the capsule basic units, employing search rules to refine the search process. Sorting rules are applied to select the optimal binary capsule operator for each individual capsule basic unit. Lastly, the optimal binary capsule operator from each capsule basic unit is transmitted to the capsule region for fitness value comparison via the transfer rules. This process facilitates the selection of the optimal subset of features. Comparative experiments conducted on 15 publicly available software defect datasets demonstrate that CFS outperforms the other eight advanced feature selection algorithms. This indicates that CFS exhibits superior performance. The experimental findings are further substantiated through nonparametric statistical analysis.
引用
收藏
页数:38
相关论文
共 56 条
[1]   Chaotic binary Group Search Optimizer for feature selection [J].
Abualigah, Laith ;
Diabat, Ali .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
[2]   Evaluating the impact of feature selection consistency in software prediction [J].
Ali, Asad ;
Gravino, Carmine .
SCIENCE OF COMPUTER PROGRAMMING, 2022, 213
[3]   Examining the performance of kernel methods for software defect prediction based on support vector machine [J].
Azzeh, Mohammad ;
Elsheikh, Yousef ;
Nassif, Ali Bou ;
Angelis, Lefteris .
SCIENCE OF COMPUTER PROGRAMMING, 2023, 226
[4]   Data quality issues in software fault prediction: a systematic literature review [J].
Bhandari, Kirti ;
Kumar, Kuldeep ;
Sangal, Amrit Lal .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (08) :7839-7908
[5]   An improved hybrid chameleon swarm algorithm for feature selection in medical diagnosis [J].
Braik, Malik Shehadeh ;
Hammouri, Abdelaziz I. ;
Awadallah, Mohammed A. ;
Al-Betar, Mohammed Azmi ;
Khtatneh, Khalaf .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
[6]   EEG-based sleep staging via self-attention based capsule network with Bi-LSTM model [J].
Chen, Jin ;
Han, Zhihui ;
Qiao, Heyuan ;
Li, Chang ;
Peng, Hu .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
[7]   Software defect prediction based on nested-stacking and heterogeneous feature selection [J].
Chen, Li-qiong ;
Wang, Can ;
Song, Shi-long .
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (04) :3333-3348
[8]   Block feature selection based on NSGA-II applied to fault diagnosis of gearboxes [J].
Chen, Xianhua ;
Tian, Zhigang ;
Rao, Meng .
ADVANCED ENGINEERING INFORMATICS, 2023, 57
[9]   Introducing the fractional-order Darwinian PSO [J].
Couceiro, Micael S. ;
Rocha, Rui P. ;
Fonseca Ferreira, N. M. ;
Tenreiro Machado, J. A. .
SIGNAL IMAGE AND VIDEO PROCESSING, 2012, 6 (03) :343-350
[10]   Multi-label feature selection by strongly relevant label gain and label mutual aid [J].
Dai, Jianhua ;
Huang, Weiyi ;
Zhang, Chucai ;
Liu, Jie .
PATTERN RECOGNITION, 2024, 145