A new population initialization of metaheuristic algorithms based on hybrid fuzzy rough set for high-dimensional gene data feature selection

被引:7
作者
Guo, Xuanming [1 ]
Hu, Jiao [1 ]
Yu, Helong [2 ]
Wang, Mingjing [3 ]
Yang, Bo [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Agr Univ, Coll Informat Technol, Changchun 130118, Peoples R China
[3] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou 325000, Peoples R China
基金
中国国家自然科学基金;
关键词
Population initialization; Hybrid fuzzy rough set; Whale optimization algorithm; Gene data feature selection; Multiclass classification; MOTH-FLAME OPTIMIZATION; PARTICLE SWARM OPTIMIZATION; ATTRIBUTE REDUCTION; CLASSIFICATION; INFORMATION; CANCER; DISEASES;
D O I
10.1016/j.compbiomed.2023.107538
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the realm of modern medicine and biology, vast amounts of genetic data with high complexity are available. However, dealing with such high-dimensional data poses challenges due to increased processing complexity and size. Identifying critical genes to reduce data dimensionality is essential. The filter-wrapper hybrid method is a commonly used approach in feature selection. Most of these methods employ filters such as MRMR and ReliefF, but the performance of these simple filters is limited. Rough set methods, on the other hand, are a type of filter method that outperforms traditional filters. Simultaneously, many studies have pointed out the crucial importance of good initialization strategies for the performance of the metaheuristic algorithm (a type of wrapper-based method). Combining these two points, this paper proposes a novel filter-wrapper hybrid method for high-dimensional feature selection. To be specific, we utilize the variant of bWOA (binary Whale Optimization Algorithm) based on Hybrid Fuzzy Rough Set to perform attribute reduction, and the reduced attributes are used as prior knowledge to initialize the population. We then employ metaheuristics for further feature selection based on this initialized population. We conducted experiments using five different algorithms on 14 UCI datasets. The experiment results show that after applying the initialization method proposed in this article, the performance of five enhanced algorithms, has shown significant improvement. Particularly, the improved bMFO using our initialization method: fuzzy_bMFO outperformed six currently advanced algorithms, indicating that our initialization method for metaheuristic algorithms is suitable for high-dimensional feature selection tasks.
引用
收藏
页数:24
相关论文
共 113 条
[61]  
Maji Pradipta, 2014, Scalable Pattern Recognition Algorithms: Applications in Computational Biology and Bioinformatics, P171
[62]   Simultaneous feature selection and classification using kernel-penalized support vector machines [J].
Maldonado, Sebastian ;
Weber, Richard ;
Basak, Jayanta .
INFORMATION SCIENCES, 2011, 181 (01) :115-128
[63]   The Whale Optimization Algorithm [J].
Mirjalili, Seyedali ;
Lewis, Andrew .
ADVANCES IN ENGINEERING SOFTWARE, 2016, 95 :51-67
[64]   Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm [J].
Mirjalili, Seyedali .
KNOWLEDGE-BASED SYSTEMS, 2015, 89 :228-249
[65]   The Ant Lion Optimizer [J].
Mirjalili, Seyedali .
ADVANCES IN ENGINEERING SOFTWARE, 2015, 83 :80-98
[66]  
Pant M, 2009, STUD COMPUT INTELL, V203, P101
[67]   Feature analysis through information granulation and fuzzy sets [J].
Pedrycz, W ;
Vukovich, G .
PATTERN RECOGNITION, 2002, 35 (04) :825-834
[68]   Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy [J].
Peng, HC ;
Long, FH ;
Ding, C .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) :1226-1238
[69]   A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning [J].
Peng, Zhinan ;
Hu, Jiangping ;
Shi, Kaibo ;
Luo, Rui ;
Huang, Rui ;
Ghosh, Bijoy Kumar ;
Huang, Jiuke .
APPLIED MATHEMATICS AND COMPUTATION, 2020, 369
[70]  
Piatetsky-Shapiro G., 2003, ACM SIGKDD Explor. Newsl., V5, P1, DOI [DOI 10.1145/980972.980974, 10.1145/980972.980974]