Feature Selection of Microarray Data Using Simulated Kalman Filter with Mutation

被引:5
|
作者
Zamri, Nurhawani Ahmad [1 ]
Aziz, Nor Azlina Ab [1 ]
Bhuvaneswari, Thangavel [1 ]
Aziz, Nor Hidayati Abdul [1 ]
Ghazali, Anith Khairunnisa [1 ]
机构
[1] Multimedia Univ, Fac Engn & Technol, Melaka 75450, Malaysia
关键词
feature selection; simulated Kalman filter; microarray data; classification; mutation; GENE-EXPRESSION DATA; CANCER; CLASSIFICATION; PREDICTION; OPTIMIZATION; INTELLIGENCE; DISCOVERY; SYSTEM; TESTS;
D O I
10.3390/pr11082409
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Microarrays have been proven to be beneficial for understanding the genetics of disease. They are used to assess many different types of cancers. Machine learning algorithms, like the artificial neural network (ANN), can be trained to determine whether a microarray sample is cancerous or not. The classification is performed using the features of DNA microarray data, which are composed of thousands of gene values. However, most of the gene values have been proven to be uninformative and redundant. Meanwhile, the number of the samples is significantly smaller in comparison to the number of genes. Therefore, this paper proposed the use of a simulated Kalman filter with mutation (SKF-MUT) for the feature selection of microarray data to enhance the classification accuracy of ANN. The algorithm is based on a metaheuristics optimization algorithm, inspired by the famous Kalman filter estimator. The mutation operator is proposed to enhance the performance of the original SKF in the selection of microarray features. Eight different benchmark datasets were used, which comprised: diffuse large b-cell lymphomas (DLBCL); prostate cancer; lung cancer; leukemia cancer; "small, round blue cell tumor" (SRBCT); brain tumor; nine types of human tumors; and 11 types of human tumors. These consist of both binary and multiclass datasets. The accuracy is taken as the performance measurement by considering the confusion matrix. Based on the results, SKF-MUT effectively selected the number of features needed, leading toward a higher classification accuracy ranging from 95% to 100%.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Using a genetic algorithm and a perceptron for feature selection and supervised class learning in DNA microarray data
    Karzynski, M
    Mateos, A
    Herrero, J
    Dopazo, J
    ARTIFICIAL INTELLIGENCE REVIEW, 2003, 20 (1-2) : 39 - 51
  • [22] A Filter Based Feature Selection Algorithm Using Null Space of Covariance Matrix for DNA Microarray Gene Expression Data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    CURRENT BIOINFORMATICS, 2012, 7 (03) : 289 - 294
  • [23] Metaheuristic approach for an enhanced mRMR filter method for classification using drug response microarray data
    Mohamed, Nur Shazila
    Zainudin, Suhaila
    Othman, Zulaiha Ali
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 : 224 - 231
  • [24] Linear regression-based feature selection for microarray data classification
    Hasan, Md Abid
    Hasan, Md Kamrul
    Mottalib, M. Abdul
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (02) : 167 - 179
  • [25] Comparing Multiobjective Evolutionary Algorithms for Cancer Data Microarray Feature Selection
    Sol Dussaut, Julieta
    Javier Vidal, Pablo
    Ponzoni, Ignacio
    Carolina Olivera, Ana
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 149 - 156
  • [26] Prominent feature selection of microarray data
    Liu, Yihui
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2009, 19 (10) : 1365 - 1371
  • [27] Feature Selection and Classification of Microarray Data using MapReduce based ANOVA and K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Nitish Kumar
    Swain, Amitav
    Rath, Santanu Kumar
    ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 301 - 310
  • [28] A hybrid feature selection method for DNA microarray data
    Chuang, Li-Yeh
    Yang, Cheng-Huei
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2011, 41 (04) : 228 - 237
  • [29] Feature selection using differential evolution for microarray data classification
    Prajapati S.
    Das H.
    Gourisaria M.K.
    Discover Internet of Things, 2023, 3 (01):
  • [30] A Top-r Feature Selection Algorithm for Microarray Gene Expression Data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (03) : 754 - 764