Improved intelligent water drop-based hybrid feature selection method for data

被引:12
|
作者
Alhenawi, Esra'a [1 ,2 ]
Al-Sayyed, Rizik [2 ]
Hudaib, Amjad [2 ]
Mirjalili, Seyedali [3 ,4 ]
机构
[1] Al Ahliyya Amman Univ, Software Engn Dept, Amman, Jordan
[2] Univ Jordan, King AbdullahSchool Informat Technol 2, Amman, Jordan
[3] Torrens Univ Australia, Ctr Artificial Intelligence Res & Optimizat, Fortitude Valley, Brisbane, Qld 4006, Australia
[4] Univ Res, Obuda Univ, Innovat Ctr, Budapest, Hungary
关键词
Machine learning; Intelligent water drop algorithm; Hybrid feature selection; High dimensional datasets; Medical applications; OPTIMIZATION; ALGORITHM; ENSEMBLE; SEARCH;
D O I
10.1016/j.compbiolchem.2022.107809
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classifying microarray datasets, which usually contains many noise genes that degrade the performance of classifiers and decrease classification accuracy rate, is a competitive research topic. Feature selection (FS) is one of the most practical ways for finding the most optimal subset of genes that increases classification's accuracy for diagnostic and prognostic prediction of tumor cancer from the microarray datasets. This means that we always need to develop more efficient FS methods, that select only optimal or close-to-optimal subset of features to improve classification performance. In this paper, we propose a hybrid FS method for microarray data processing, that combines an ensemble filter with an Improved Intelligent Water Drop (IIWD) algorithm as a wrapper by adding one of three local search (LS) algorithms: Tabu search (TS), Novel LS algorithm (NLSA), or Hill Climbing (HC) in each iteration from IWD, and using a correlation coefficient filter as a heuristic undesirability (HUD) for next node selection in the original IWD algorithm. The effects of adding three different LS algorithms to the proposed IIWD algorithm have been evaluated through comparing the performance of the proposed ensemble filter-IIWD-based wrapper without adding any LS algorithms named (PHFS-IWD) FS method versus its performance when adding a specific LS algorithm from (TS, NLSA or HC) in FS methods named, (PHFS-IWDTS, PHFS-IWDNLSA, and PHFS-IWDHC), respectively. Naive Bayes(NB) classifier with five microarray datasets have been deployed for evaluating and comparing the proposed hybrid FS methods. Results show that using LS algorithms in each iteration from the IWD algorithm improves F-score value with an average equal to 5% compared with PHFS-IWD. Also, PHFS-IWDNLSA improves the F-score value with an average of 4.15% over PHFS-IWDTS, and 5.67% over PHFS-IWDHC while PHFS-IWDTS outperformed PHFS-IWDHC with an average of increment equal to 1.6%. On the other hand, the proposed hybrid-based FS methods improve accuracy with an average equal to 8.92% in three out of five datasets and decrease the number of genes with a percentage of 58.5% in all five datasets compared with six of the most recent state-of-the-art FS methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A new hybrid feature selection based on multi-filter weights and multi-feature weights
    Wang, Youwei
    Feng, Lizhou
    APPLIED INTELLIGENCE, 2019, 49 (12) : 4033 - 4057
  • [42] GAWA-A Feature Selection Method for Hybrid Sentiment Classification
    Rasool, Abdur
    Tao, Ran
    Kamyab, Marjan
    Hayat, Shoaib
    IEEE ACCESS, 2020, 8 : 191850 - 191861
  • [43] COMB: A Hybrid Method for Cross-validated Feature Selection
    Thejas, G. S.
    Jimenez, Daniel
    Iyengar, S. S.
    Miller, Jerry
    Sunitha, N. R.
    Badrinath, Prajwal
    ACMSE 2020: PROCEEDINGS OF THE 2020 ACM SOUTHEAST CONFERENCE, 2020, : 100 - 106
  • [44] A Stroke Risk Detection: Improving Hybrid Feature Selection Method
    Zhang, Yonglai
    Zhou, Yaojian
    Zhang, Dongsong
    Song, Wenai
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2019, 21 (04)
  • [45] An Hybrid Method for Feature Selection based on Multiobjective Optimization and Mutual Information
    Grandchamp, Enguerran
    Abadi, Mohamed
    Alata, Olivier
    JOURNAL OF INFORMATICS AND MATHEMATICAL SCIENCES, 2015, 7 (01): : 21 - 48
  • [46] A fuzzy gaussian rank aggregation ensemble feature selection method for microarray data
    Venkatesh, B.
    Anuradha, J.
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2020, 24 (04) : 289 - 301
  • [47] Innovative Feature Selection Method Based on Hybrid Sine Cosine and Dipper Throated Optimization Algorithms
    Abdelhamid, Abdelaziz A.
    El-Kenawy, El-Sayed M.
    Ibrahim, Abdelhameed
    Eid, Marwa Metwally
    Khafaga, Doaa Sami
    Alhussan, Amel Ali
    Mirjalili, Seyedali
    Khodadadi, Nima
    Lim, Wei Hong
    Shams, Mahmoud Y.
    IEEE ACCESS, 2023, 11 : 79750 - 79776
  • [48] A Hybrid Method Based on Intelligent Water Drop Algorithm and Simulated Annealing for Solving Multi-depot Vehicle Routing Problem
    Ezugwu, Absalom E.
    Olusanya, Micheal O.
    Adewumi, Aderemi O.
    APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 204 - 219
  • [49] A Clustering Based Feature Selection Method Using Feature Information Distance for Text Data
    Chao, Shilong
    Cai, Jie
    Yang, Sheng
    Wang, Shulin
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT I, 2016, 9771 : 122 - 132
  • [50] Nested ensemble selection: An effective hybrid feature selection method
    Kamalov, Firuz
    Sulieman, Hana
    Moussa, Sherif
    Reyes, Jorge Avante
    Safaraliev, Murodbek
    HELIYON, 2023, 9 (09)