Feature Selection of Microarray Data Using Simulated Kalman Filter with Mutation

被引:5
作者
Zamri, Nurhawani Ahmad [1 ]
Aziz, Nor Azlina Ab [1 ]
Bhuvaneswari, Thangavel [1 ]
Aziz, Nor Hidayati Abdul [1 ]
Ghazali, Anith Khairunnisa [1 ]
机构
[1] Multimedia Univ, Fac Engn & Technol, Melaka 75450, Malaysia
关键词
feature selection; simulated Kalman filter; microarray data; classification; mutation; GENE-EXPRESSION DATA; CANCER; CLASSIFICATION; PREDICTION; OPTIMIZATION; INTELLIGENCE; DISCOVERY; SYSTEM; TESTS;
D O I
10.3390/pr11082409
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Microarrays have been proven to be beneficial for understanding the genetics of disease. They are used to assess many different types of cancers. Machine learning algorithms, like the artificial neural network (ANN), can be trained to determine whether a microarray sample is cancerous or not. The classification is performed using the features of DNA microarray data, which are composed of thousands of gene values. However, most of the gene values have been proven to be uninformative and redundant. Meanwhile, the number of the samples is significantly smaller in comparison to the number of genes. Therefore, this paper proposed the use of a simulated Kalman filter with mutation (SKF-MUT) for the feature selection of microarray data to enhance the classification accuracy of ANN. The algorithm is based on a metaheuristics optimization algorithm, inspired by the famous Kalman filter estimator. The mutation operator is proposed to enhance the performance of the original SKF in the selection of microarray features. Eight different benchmark datasets were used, which comprised: diffuse large b-cell lymphomas (DLBCL); prostate cancer; lung cancer; leukemia cancer; "small, round blue cell tumor" (SRBCT); brain tumor; nine types of human tumors; and 11 types of human tumors. These consist of both binary and multiclass datasets. The accuracy is taken as the performance measurement by considering the confusion matrix. Based on the results, SKF-MUT effectively selected the number of features needed, leading toward a higher classification accuracy ranging from 95% to 100%.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A Memetic Cellular Genetic Algorithm for Cancer Data Microarray Feature Selection
    Rojas, Matias Gabriel
    Olivera, Ana Carolina
    Carballido, Jessica Andrea
    Vidal, Pablo Javier
    IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (11) : 1874 - 1883
  • [42] RETRACTED: A hybrid feature selection algorithm for microarray data (Retracted Article)
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (12)
  • [43] A Comparative Study of Improvements Filter Methods Bring on Feature Selection Using Microarray Data
    Wang, Yingying
    Fan, Xiaomao
    Cai, Yunpeng
    HEALTH INFORMATION SCIENCE, HIS 2014, 2014, 8423 : 55 - 62
  • [44] Prominent feature selection of microarray data
    Yihui Liu School of Computer Science and Information Technology
    Progress in Natural Science, 2009, 19 (10) : 1365 - 1371
  • [45] FEATURE DISCRETIZATION AND SELECTION IN MICROARRAY DATA
    Ferreira, Artur
    Figueiredo, Mario
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 465 - 469
  • [46] Feature Selection in Microarray Gene Expression Data Using Fisher Discriminant Ratio
    Sarbazi-Azad, Saeed
    Abadeh, Mohammad Saniee
    Abadi, Mehdi Irannejad Najaf
    2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2018, : 225 - 230
  • [47] Optimizing the Hybrid Feature Selection in the DNA Microarray for Cancer Diagnosis Using Fuzzy Entropy and the Giza Pyramid Construction Algorithm
    Motevalli, Masoumeh
    Khalilian, Madjid
    Bastanfard, Azam
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2025, 24 (01)
  • [48] An Efficient Filter-Based Feature Selection Model to Identify Significant Features from High-Dimensional Microarray Data
    Raj, D. M. Deepak
    Mohanasundaram, R.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (04) : 2619 - 2630
  • [49] A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis
    Lazar, Cosmin
    Taminau, Jonatan
    Meganck, Stijn
    Steenhoff, David
    Coletta, Alain
    Molter, Colin
    de Schaetzen, Virginie
    Duque, Robin
    Bersini, Hugues
    Nowe, Ann
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (04) : 1106 - 1119
  • [50] A fuzzy gaussian rank aggregation ensemble feature selection method for microarray data
    Venkatesh, B.
    Anuradha, J.
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2020, 24 (04) : 289 - 301