Using a genetic algorithm and a perceptron for feature selection and supervised class learning in DNA microarray data

被引:20
|
作者
Karzynski, M [1 ]
Mateos, A [1 ]
Herrero, J [1 ]
Dopazo, J [1 ]
机构
[1] Ctr Nacl Invest Oncol, Bioinformat Unit, Madrid 28029, Spain
关键词
clustering; dimensionality reduction; feature selection; gene expression; genetic algorithm; perceptron; SOTA; weights; GROWING NEURAL-NETWORK; CLUSTERING ANALYSIS; EXPRESSION DATA; CLASSIFICATION; PREDICTION; CANCER;
D O I
10.1023/A:1026032530166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class prediction and feature selection is key in the context of diagnostic applications of DNA microarrays. Microarray data is noisy and typically composed of a low number of samples and a large number of genes. Perceptrons can constitute an efficient tool for accurate classification of microarray data. Nevertheless, the large input layers necessary for the direct application of perceptrons and the low samples available for the training process hamper its use. Two strategies can be taken for an optimal use of a perceptron with a favourable balance between samples for training and the size of the input layer: (a) reducing the dimensionality of the data set from thousands to no more than one hundred, highly informative average values, and using the weights of the perceptron for feature selection or (b) using a selection of only few genes that produce an optimal classification with the perceptron. In this case, feature selection is carried out first. Obviously, a combined approach is also possible. In this manuscript we explore and compare both alternatives. We study the informative contents of the data at different levels of compression with a very efficient clustering algorithm (Self Organizing Tree Algorithm). We show how a simple genetic algorithm selects a subset of gene expression values with 100% accuracy in the classification of samples with maximum efficiency. Finally, the importance of dimensionality reduction is discussed in light of its capacity for reducing noise and redundancies in microarray data.
引用
收藏
页码:39 / 51
页数:13
相关论文
共 50 条
  • [1] Using a Genetic Algorithm and a Perceptron for Feature Selection and Supervised Class Learning in DNA Microarray Data
    Michal Karzynski
    Álvaro Mateos
    Javier Herrero
    Joaquín Dopazo
    Artificial Intelligence Review, 2003, 20 : 39 - 51
  • [2] Effective Feature Selection for Supervised Learning Using Genetic Algorithm
    Glaris, T. Hilda
    Rajalaxmi, R. R.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 909 - 914
  • [3] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Zixuan Wang
    Yi Zhou
    Tatsuya Takagi
    Jiangning Song
    Yu-Shi Tian
    Tetsuo Shibuya
    BMC Bioinformatics, 24
  • [4] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Wang, Zixuan
    Zhou, Yi
    Takagi, Tatsuya
    Song, Jiangning
    Tian, Yu-Shi
    Shibuya, Tetsuo
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [5] Improved class prediction in DNA microarray gene expression data by unsupervised reduction of the dimensionality followed by supervised learning with a perceptron
    Conde, L
    Mateos, A
    Herrero, J
    Dopazo, J
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 35 (03): : 245 - 253
  • [6] Improved Class Prediction in DNA Microarray Gene Expression Data by Unsupervised Reduction of the Dimensionality followed by Supervised Learning with a Perceptron
    Lucía Conde
    Álvaro Mateos
    Javier Herrero
    Joaquín Dopazo
    Journal of VLSI signal processing systems for signal, image and video technology, 2003, 35 : 245 - 253
  • [7] Improving feature subset selection using a genetic algorithm for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Zhang, Yanqing
    Bourgeois, Anu G.
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 2514 - 2519
  • [8] Hybrid feature selection using micro genetic algorithm on microarray gene expression data
    Pragadeesh, C.
    Jeyaraj, Rohana
    Siranjeevi, K.
    Abishek, R.
    Jeyakumar, G.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2241 - 2246
  • [9] Feature selection from microarray data : Genetic algorithm based approach
    Ram, Pintu Kumar
    Kuila, Pratyay
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08): : 1599 - 1610
  • [10] A Memetic Cellular Genetic Algorithm for Cancer Data Microarray Feature Selection
    Rojas, Matias Gabriel
    Olivera, Ana Carolina
    Carballido, Jessica Andrea
    Vidal, Pablo Javier
    IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (11) : 1874 - 1883