Multi-strategy fusion binary SHO guided by Pearson correlation coefficient for feature selection with cancer gene expression data

被引:0
作者
Wang, Yu-Cai [1 ]
Song, Hao-Ming [1 ]
Wang, Jie-Sheng [1 ]
Ma, Xin-Ru [1 ]
Song, Yu-Wei [1 ]
Qi, Yu-Liang [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Elect & Informat Engn, Anshan, Liaoning, Peoples R China
关键词
Feature selection; Cancer gene expression; Sea-horse optimizer; Multi-strategy fusion; Pearson correlation coefficient; WRAPPER; ALGORITHMS;
D O I
10.1016/j.eij.2025.100639
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer gene expression data is extensively utilized to address the challenges of cancer subtype diagnosis. However, this data is often characterized by high-dimensional, multi-text and multi-classification, which requires an effective feature selection (FS) method. A multi-strategy fusion binary sea-horse optimizer guided by Pearson correlation coefficient was proposed for FS with cancer gene expression data. For the multi-strategy fusion, the rest strategy is introduced in the sea-horse motor behavior stage. Subsequently, a search strategy based on symbiotic organisms of sea horses is designed for the predation stage. Finally, the elementary function dynamic weight strategy is proposed. Multi-strategy fusion enables the sea-horse optimizer (SHO) to perform dynamic exploitation and exploration in the early stage of iteration, expand the search scope initially, and narrow the search scope in the middle and later stages of the algorithm, so as to avoid the algorithm falling into the local optimal and increase the possibility of the algorithm jumping out of the local optimal, and avoid the blind search caused by elite influence. In the FS part, Pearson correlation coefficient guided strategy is proposed firstly to add or delete features. Then eight binary algorithms are derived from S-type and V-type transfer functions. The simulation experiment was divided into four parts. Firstly, the CEC-2022 test functions were used to test the performance of the multi-strategy fusion SHO, from which the best variant TanASSHO was selected, and then compared with other nine swarm intelligent optimization algorithms. Performance tests of various algorithm variants on 18 UCI datasets show that V1PTASSHO is the most effective binary version. Finally, V1PTASSHO was compared with other nine swarm intelligent optimization algorithms on 18 cancer gene expression datasets. The results demonstrate that V1PTASSHO effectively reduces feature subsets, improve classification accuracy and obtain lower fitness value. Friedman test and Wilcoxon rank sum test were used for statistical analysis to verify the effectiveness of the proposed algorithm.
引用
收藏
页数:34
相关论文
共 61 条
  • [1] Abdollahzadeh Benyamin, 2024, Clust. Comput.
  • [2] Abinash M. J., 2018, Intelligent Engineering Informatics. Proceedings of the 6th International Conference on FICTA. Advances in Intelligent Systems and Computing (AISC 695), P311, DOI 10.1007/978-981-10-7566-7_31
  • [3] Reptile Search Algorithm (RSA): A nature-inspired meta-heuristic optimizer
    Abualigah, Laith
    Abd Elaziz, Mohamed
    Sumari, Putra
    Geem, Zong Woo
    Gandomi, Amir H.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [4] The Arithmetic Optimization Algorithm
    Abualigah, Laith
    Diabat, Ali
    Mirjalili, Seyedali
    Elaziz, Mohamed Abd
    Gandomi, Amir H.
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 376
  • [5] A genetic algorithm-based multi-objective optimization of an artificial neural network classifier for breast cancer diagnosis
    Ahmad, Fadzil
    Isa, Nor Ashidi Mat
    Hussain, Zakaria
    Sulaiman, Siti Noraini
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 23 (05) : 1427 - 1435
  • [6] Ahrari A., 2022, P IEEE WORLD C COMP, P18
  • [7] Alipio M., 2020, THE J, DOI 10.2139/ssrn.3571484
  • [8] A wrapper-based feature selection approach to investigate potential biomarkers for early detection of breast cancer
    Alnowami, Majdi R.
    Abolaban, Fouad A.
    Taha, Eslam
    [J]. JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2022, 15 (01) : 104 - 110
  • [9] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [10] Hippopotamus optimization algorithm: a novel nature-inspired optimization algorithm
    Amiri, Mohammad Hussein
    Hashjin, Nastaran Mehrabi
    Montazeri, Mohsen
    Mirjalili, Seyedali
    Khodadadi, Nima
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)