TSFNFS: two-stage-fuzzy-neighborhood feature selection with binary whale optimization algorithm

被引:12
作者
Sun, Lin [1 ,4 ]
Wang, Xinya [1 ,3 ]
Ding, Weiping [2 ]
Xu, Jiucheng [1 ]
Meng, Huili [1 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
[2] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[3] Henan Prov Construct Grp Co Ltd, Zhengzhou 450002, Peoples R China
[4] Engn Lab Intelligence Business & Internet Things, Xinxiang 453007, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Fuzzy neighborhood; Whale optimization algorithm; Fitness function; NEIGHBORHOOD ROUGH SETS; UNCERTAINTY MEASURES; ENTROPY; REDUCTION; INFORMATION;
D O I
10.1007/s13042-022-01653-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The optimal global feature subset cannot be found easily due to the high cost, and most swarm intelligence optimization-based feature selection methods are inefficient in handling high-dimensional data. In this study, a two-stage feature selection model based on fuzzy neighborhood rough sets (FNRS) and binary whale optimization algorithm (BWOA) is developed. First, to denote the fuzziness of samples for mixed data with symbolic and numerical features, fuzzy neighborhood similarity is presented to study the similarity matrix and fuzzy membership degree, and the lower and upper approximations can be developed to present new FNRS model. Fuzzy neighborhood-based uncertainty measures such as dependence degree, knowledge granularity, and entropy measures are studied. From the viewpoints of algebra and information, fuzzy knowledge granularity conditional entropy is presented to form a preselected feature reduction set in the first stage. Second, the cosine curve change is added to develop a new control factor, which slows down the convergence rate of BWOA in the early iteration to fully explore the global, and accelerates the convergence rate in the late iteration. Integrating dependence degree with fuzzy knowledge granularity conditional entropy, a new fitness function is designed for selecting an optimal feature subset in this second stage. Two strategies are fused to avoid BWOA falling into the local optimum: the population partition strategy with the adaptive neighborhood search radius to divide the whale population and the local interference strategy of the elite subgroup to adjust the whale position update. Finally, a two-stage feature selection algorithm is designed, where the Fisher score algorithm is employed to preliminarily delete those redundancy features of high-dimensional datasets. Experiments on six UCI datasets and five gene expression datasets show that our algorithm is valid compared to other related algorithms.
引用
收藏
页码:609 / 631
页数:23
相关论文
共 56 条
[1]   IWOA: An improved whale optimization algorithm for optimization problems [J].
Bozorgi, Seyed Mostafa ;
Yazdani, Samaneh .
JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2019, 6 (03) :243-259
[2]   An enhanced whale optimization algorithm for large scale optimization problems [J].
Chakraborty, Sanjoy ;
Saha, Apu Kumar ;
Chakraborty, Ratul ;
Saha, Moumita .
KNOWLEDGE-BASED SYSTEMS, 2021, 233
[3]   A hybrid whale optimization algorithm for global optimization [J].
Chakraborty, Sanjoy ;
Saha, Apu Kumar ;
Sharma, Sushmita ;
Chakraborty, Ratul ;
Debnath, Sudhan .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (1) :431-467
[4]   Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models [J].
Chen, Degang ;
Yang, Yanyan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (05) :1325-1334
[5]   Feature Subset Selection Based on Variable Precision Neighborhood Rough Sets [J].
Chen, Yingyue ;
Chen, Yumin .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) :572-581
[6]   Gene selection for tumor classification using neighborhood rough sets and entropy measures [J].
Chen, Yumin ;
Zhang, Zunjun ;
Zheng, Jianzhong ;
Ma, Ying ;
Xue, Yu .
JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 67 :59-68
[7]   Multigranulation Supertrust Model for Attribute Reduction [J].
Ding, Weiping ;
Pedrycz, Witold ;
Triguero, Isaac ;
Cao, Zehong ;
Lin, Chin-Teng .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (06) :1395-1408
[8]   ESSAWOA: Enhanced Whale Optimization Algorithm integrated with Salp Swarm Algorithm for global optimization [J].
Fan, Qian ;
Chen, Zhenjian ;
Zhang, Wei ;
Fang, Xuhua .
ENGINEERING WITH COMPUTERS, 2022, 38 (SUPPL 1) :797-814
[9]  
[樊鑫 Fan Xin], 2020, [计算机科学, Computer Science], V47, P87
[10]  
[方波 Fang Bo], 2019, [计算机科学, Computer Science], V46, P157