Feature Selection of Microarray Data Using Simulated Kalman Filter with Mutation

被引:5
|
作者
Zamri, Nurhawani Ahmad [1 ]
Aziz, Nor Azlina Ab [1 ]
Bhuvaneswari, Thangavel [1 ]
Aziz, Nor Hidayati Abdul [1 ]
Ghazali, Anith Khairunnisa [1 ]
机构
[1] Multimedia Univ, Fac Engn & Technol, Melaka 75450, Malaysia
关键词
feature selection; simulated Kalman filter; microarray data; classification; mutation; GENE-EXPRESSION DATA; CANCER; CLASSIFICATION; PREDICTION; OPTIMIZATION; INTELLIGENCE; DISCOVERY; SYSTEM; TESTS;
D O I
10.3390/pr11082409
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Microarrays have been proven to be beneficial for understanding the genetics of disease. They are used to assess many different types of cancers. Machine learning algorithms, like the artificial neural network (ANN), can be trained to determine whether a microarray sample is cancerous or not. The classification is performed using the features of DNA microarray data, which are composed of thousands of gene values. However, most of the gene values have been proven to be uninformative and redundant. Meanwhile, the number of the samples is significantly smaller in comparison to the number of genes. Therefore, this paper proposed the use of a simulated Kalman filter with mutation (SKF-MUT) for the feature selection of microarray data to enhance the classification accuracy of ANN. The algorithm is based on a metaheuristics optimization algorithm, inspired by the famous Kalman filter estimator. The mutation operator is proposed to enhance the performance of the original SKF in the selection of microarray features. Eight different benchmark datasets were used, which comprised: diffuse large b-cell lymphomas (DLBCL); prostate cancer; lung cancer; leukemia cancer; "small, round blue cell tumor" (SRBCT); brain tumor; nine types of human tumors; and 11 types of human tumors. These consist of both binary and multiclass datasets. The accuracy is taken as the performance measurement by considering the confusion matrix. Based on the results, SKF-MUT effectively selected the number of features needed, leading toward a higher classification accuracy ranging from 95% to 100%.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A New Evolutionary Ensemble Learning of Multimodal Feature Selection from Microarray Data
    Nekouie, Nadia
    Romoozi, Morteza
    Esmaeili, Mahdi
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6753 - 6780
  • [2] Feature selection using the Kalman filter for classification of multivariate data
    Wu, W
    Rutan, SC
    Baldovin, A
    Massart, DL
    ANALYTICA CHIMICA ACTA, 1996, 335 (1-2) : 11 - 22
  • [3] A New Filter Feature Selection Based on Criteria Fusion for Gene Microarray Data
    Ke, Wenjun
    Wu, Chunxue
    Wu, Yan
    Xiong, Neal N.
    IEEE ACCESS, 2018, 6 : 61065 - 61076
  • [4] A Comprehensive Survey of Recent Hybrid Feature Selection Methods in Cancer Microarray Gene Expression Data
    Almazrua, Halah
    Alshamlan, Hala
    IEEE ACCESS, 2022, 10 : 71427 - 71449
  • [5] Multimodal feature selection from microarray data based on Dempster-Shafer evidence fusion
    Nekouie, Nadia
    Romoozi, Morteza
    Esmaeili, Mahdi
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (11) : 12591 - 12621
  • [6] A Robust and Efficient Feature Selection Algorithm for Microarray Data
    Bari, Mehrab Ghanat
    Salekin, Sirajul
    Zhang, Jianqiu
    MOLECULAR INFORMATICS, 2017, 36 (04)
  • [7] Feature Selection Using Simulated Kalman Filter (SKF) for Prediction of Body Fat Percentage
    Zamri, Nurhawani Bt Ahmad
    Bhuvaneswari, Thangavel
    Ab Aziz, Nor Azlina Bt
    Aziz, Nor Hidayati Bt Abdul
    ICOMS 2018: 2018 INTERNATIONAL CONFERENCE ON MATHEMATICS AND STATISTICS, 2018, : 23 - 27
  • [8] Feature Selection using Binary Simulated Kalman Filter for Peak Classification of EEG Signals
    Muhammad, Badaruddin
    Jusof, Mohd Falfazli Mat
    Shapiai, Mohd Ibrahim
    Adam, Asrul
    Yusof, Zulkifli Md
    Azmi, Kamil Zakwan Mohd
    Aziz, Nor Hidayati Abdul
    Ibrahim, Zuwairie
    Mokhtar, Norrima
    2018 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS), 2018, : 1 - 6
  • [9] A hybrid feature selection algorithm for microarray data
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (05) : 3494 - 3526
  • [10] A novel filter feature selection method for paired microarray expression data analysis
    Cao, Zhongbo
    Wang, Yan
    Sun, Ying
    Du, Wei
    Liang, Yanchun
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) : 363 - 386