Improved intelligent water drop-based hybrid feature selection method for data

被引:12
|
作者
Alhenawi, Esra'a [1 ,2 ]
Al-Sayyed, Rizik [2 ]
Hudaib, Amjad [2 ]
Mirjalili, Seyedali [3 ,4 ]
机构
[1] Al Ahliyya Amman Univ, Software Engn Dept, Amman, Jordan
[2] Univ Jordan, King AbdullahSchool Informat Technol 2, Amman, Jordan
[3] Torrens Univ Australia, Ctr Artificial Intelligence Res & Optimizat, Fortitude Valley, Brisbane, Qld 4006, Australia
[4] Univ Res, Obuda Univ, Innovat Ctr, Budapest, Hungary
关键词
Machine learning; Intelligent water drop algorithm; Hybrid feature selection; High dimensional datasets; Medical applications; OPTIMIZATION; ALGORITHM; ENSEMBLE; SEARCH;
D O I
10.1016/j.compbiolchem.2022.107809
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classifying microarray datasets, which usually contains many noise genes that degrade the performance of classifiers and decrease classification accuracy rate, is a competitive research topic. Feature selection (FS) is one of the most practical ways for finding the most optimal subset of genes that increases classification's accuracy for diagnostic and prognostic prediction of tumor cancer from the microarray datasets. This means that we always need to develop more efficient FS methods, that select only optimal or close-to-optimal subset of features to improve classification performance. In this paper, we propose a hybrid FS method for microarray data processing, that combines an ensemble filter with an Improved Intelligent Water Drop (IIWD) algorithm as a wrapper by adding one of three local search (LS) algorithms: Tabu search (TS), Novel LS algorithm (NLSA), or Hill Climbing (HC) in each iteration from IWD, and using a correlation coefficient filter as a heuristic undesirability (HUD) for next node selection in the original IWD algorithm. The effects of adding three different LS algorithms to the proposed IIWD algorithm have been evaluated through comparing the performance of the proposed ensemble filter-IIWD-based wrapper without adding any LS algorithms named (PHFS-IWD) FS method versus its performance when adding a specific LS algorithm from (TS, NLSA or HC) in FS methods named, (PHFS-IWDTS, PHFS-IWDNLSA, and PHFS-IWDHC), respectively. Naive Bayes(NB) classifier with five microarray datasets have been deployed for evaluating and comparing the proposed hybrid FS methods. Results show that using LS algorithms in each iteration from the IWD algorithm improves F-score value with an average equal to 5% compared with PHFS-IWD. Also, PHFS-IWDNLSA improves the F-score value with an average of 4.15% over PHFS-IWDTS, and 5.67% over PHFS-IWDHC while PHFS-IWDTS outperformed PHFS-IWDHC with an average of increment equal to 1.6%. On the other hand, the proposed hybrid-based FS methods improve accuracy with an average equal to 8.92% in three out of five datasets and decrease the number of genes with a percentage of 58.5% in all five datasets compared with six of the most recent state-of-the-art FS methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Hybrid machine learning system based on multivariate data decomposition and feature selection for improved multitemporal evapotranspiration forecasting
    Lee, Jinwook
    Bateni, Sayed M.
    Jun, Changhyun
    Heggy, Essam
    Jamei, Mehdi
    Kim, Dongkyun
    Ghafouri, Hamid Reza
    Deenik, Jonathan L.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 135
  • [32] Mutual information-based filter hybrid feature selection method for medical datasets using feature clustering
    Asghari, Sadegh
    Nematzadeh, Hossein
    Akbari, Ebrahim
    Motameni, Homayun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42617 - 42639
  • [33] Golden eagle based improved Att-BiLSTM model for big data classification with hybrid feature extraction and feature selection techniques
    Kotikam, Gnanendra
    Selvaraj, Lokesh
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024, 35 (02) : 154 - 189
  • [34] Unmanned Aerial Vehicle Path Planning Based on Improved Intelligent Water Drop Algorithm
    Sun, Xixia
    Pan, Su
    Cai, Chao
    Chen, Yanfang
    Chen, Jie
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 867 - 872
  • [35] Water potability classification based on hybrid stacked model and feature selection
    Ahmed M. Elshewey
    Rasha Y. Youssef
    Hazem M. El-Bakry
    Ahmed M. Osman
    Environmental Science and Pollution Research, 2025, 32 (13) : 7933 - 7949
  • [36] A hybrid feature selection scheme for high-dimensional data
    Ganjei, Mohammad Ahmadi
    Boostani, Reza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
  • [37] A hybrid feature selection method based on Binary Jaya algorithm for micro-array data classification
    Chaudhuri, Abhilasha
    Sahu, Tirath Prasad
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 90
  • [38] Feature Selection Based on Improved White Shark Optimizer
    Cui, Qianqian
    Zhao, Shijie
    Chen, Miao
    Zhao, Qiuli
    JOURNAL OF BIONIC ENGINEERING, 2024, 21 (06) : 3123 - 3150
  • [39] A Hybrid Feature Selection and Multi-Label Driven Intelligent Fault Diagnosis Method for Gearbox
    Liu, Di
    Zhang, Xiangfeng
    Zhang, Zhiyu
    Jiang, Hong
    SENSORS, 2023, 23 (10)
  • [40] An intelligent hybrid feature subset selection and production pattern recognition method for modeling ethylene plant
    Li, Qing
    Zhang, Mengxuan
    Shi, Xiaogang
    Lan, Xingying
    Guo, Xuqiang
    Guan, Yunlong
    JOURNAL OF ANALYTICAL AND APPLIED PYROLYSIS, 2021, 160