A new population initialization of metaheuristic algorithms based on hybrid fuzzy rough set for high-dimensional gene data feature selection

被引:7
作者
Guo, Xuanming [1 ]
Hu, Jiao [1 ]
Yu, Helong [2 ]
Wang, Mingjing [3 ]
Yang, Bo [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Agr Univ, Coll Informat Technol, Changchun 130118, Peoples R China
[3] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou 325000, Peoples R China
基金
中国国家自然科学基金;
关键词
Population initialization; Hybrid fuzzy rough set; Whale optimization algorithm; Gene data feature selection; Multiclass classification; MOTH-FLAME OPTIMIZATION; PARTICLE SWARM OPTIMIZATION; ATTRIBUTE REDUCTION; CLASSIFICATION; INFORMATION; CANCER; DISEASES;
D O I
10.1016/j.compbiomed.2023.107538
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the realm of modern medicine and biology, vast amounts of genetic data with high complexity are available. However, dealing with such high-dimensional data poses challenges due to increased processing complexity and size. Identifying critical genes to reduce data dimensionality is essential. The filter-wrapper hybrid method is a commonly used approach in feature selection. Most of these methods employ filters such as MRMR and ReliefF, but the performance of these simple filters is limited. Rough set methods, on the other hand, are a type of filter method that outperforms traditional filters. Simultaneously, many studies have pointed out the crucial importance of good initialization strategies for the performance of the metaheuristic algorithm (a type of wrapper-based method). Combining these two points, this paper proposes a novel filter-wrapper hybrid method for high-dimensional feature selection. To be specific, we utilize the variant of bWOA (binary Whale Optimization Algorithm) based on Hybrid Fuzzy Rough Set to perform attribute reduction, and the reduced attributes are used as prior knowledge to initialize the population. We then employ metaheuristics for further feature selection based on this initialized population. We conducted experiments using five different algorithms on 14 UCI datasets. The experiment results show that after applying the initialization method proposed in this article, the performance of five enhanced algorithms, has shown significant improvement. Particularly, the improved bMFO using our initialization method: fuzzy_bMFO outperformed six currently advanced algorithms, indicating that our initialization method for metaheuristic algorithms is suitable for high-dimensional feature selection tasks.
引用
收藏
页数:24
相关论文
共 113 条
  • [1] A hybrid fuzzy feature selection algorithm for high-dimensional regression problems: An mRMR-based framework
    Aghaeipoor, Fatemeh
    Javidi, Mohammad Masoud
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
  • [2] INFO: An efficient optimization algorithm based on weighted mean of vectors
    Ahmadianfar, Iman
    Heidari, Ali Asghar
    Noshadian, Saeed
    Chen, Huiling
    Gandomi, Amir H.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [3] RUN beyond the metaphor: An efficient optimization algorithm based on Runge Kutta method
    Ahmadianfar, Iman
    Heidari, Ali Asghar
    Gandomi, Amir H.
    Chu, Xuefeng
    Chen, Huiling
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 181
  • [4] Alonso-Betanzos A, 2019, METHODS MOL BIOL, V1986, P65, DOI 10.1007/978-1-4939-9442-7_4
  • [5] Mrmr plus and Cfs plus feature selection algorithms for high-dimensional data
    Angulo, Adrian Pino
    Shin, Kilho
    [J]. APPLIED INTELLIGENCE, 2019, 49 (05) : 1954 - 1967
  • [6] [Anonymous], 2021, IOP C SER MAT SCI EN, V1125
  • [7] [Anonymous], 1991, Rough sets: Theoretical Aspects of Reasoning About Data
  • [8] A new optimal gene selection approach for cancer classification using enhanced Jaya-based forest optimization algorithm
    Baliarsingh, Santos Kumar
    Vipsita, Swati
    Dash, Bodhisattva
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12) : 8599 - 8616
  • [9] Chaotic emperor penguin optimised extreme learning machine for microarray cancer classification
    Baliarsingh, Santos Kumar
    Vipsita, Swati
    [J]. IET SYSTEMS BIOLOGY, 2020, 14 (02) : 85 - 95
  • [10] mRMR-PSO: A Hybrid Feature Selection Technique with a Multiobjective Approach for Sign Language Recognition
    BansalnAff, Sandhya Rani
    Wadhawan, Savita
    Goel, Rajeev
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 10365 - 10380