Null space based feature selection method for gene expression data

被引:61
作者
Sharma, Alok [1 ,2 ]
Imoto, Seiya [1 ]
Miyano, Satoru [1 ]
Sharma, Vandana [3 ]
机构
[1] Univ Tokyo, Ctr Human Genome, Inst Med Sci, Lab DNA Informat Anal,Minato Ku, Tokyo 1088639, Japan
[2] Univ S Pacific, Sch Phys & Engn, Suva, Fiji
[3] CWM Hosp, Suva, Fiji
关键词
Feature selection; Null space; DNA microarray gene expression data; Classification accuracy; Biological significance; CLASSIFICATION; CANCER; PREDICTION; ALGORITHM;
D O I
10.1007/s13042-011-0061-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is quite an important process in gene expression data analysis. Feature selection methods discard unimportant genes from several thousands of genes for finding important genes or pathways for the target biological phenomenon like cancer. The obtained gene subset is used for statistical analysis for prediction such as survival as well as functional analysis for understanding biological characteristics. In this paper we propose a null space based feature selection method for gene expression data in terms of supervised classification. The proposed method discards the redundant genes by applying the information of null space of scatter matrices. We derive the method theoretically and demonstrate its effectiveness on several DNA gene expression datasets. The method is easy to implement and computationally efficient.
引用
收藏
页码:269 / 276
页数:8
相关论文
共 30 条
  • [1] Arif Muhammad, 2010, Journal of Biomedical Science & Engineering, V3, P380, DOI 10.4236/jbise.2010.34053
  • [2] MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia
    Armstrong, SA
    Staunton, JE
    Silverman, LB
    Pieters, R
    de Boer, ML
    Minden, MD
    Sallan, SE
    Lander, ES
    Golub, TR
    Korsmeyer, SJ
    [J]. NATURE GENETICS, 2002, 30 (01) : 41 - 47
  • [3] Evolutionary rough feature selection in gene expression data
    Banerjee, Mohua
    Mitra, Sushmita
    Banka, Haider
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2007, 37 (04): : 622 - 632
  • [4] Classifying cognitive states of brain activity via one-class neural networks with feature selection by genetic algorithms
    Boehm, Omer
    Hardoon, David R.
    Manevitz, Larry M.
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2011, 2 (03) : 125 - 134
  • [5] A new LDA-based face recognition system which can solve the small sample size problem
    Chen, LF
    Liao, HYM
    Ko, MT
    Lin, JC
    Yu, GJ
    [J]. PATTERN RECOGNITION, 2000, 33 (10) : 1713 - 1726
  • [6] Cong G., 2005, P 2005 ACM SIGMOD IN, P670, DOI DOI 10.1145/1066157.1066234
  • [7] First InP/InGaAs PNPHBT grown by metal organic chemical vapor deposition
    Cui, DL
    Hsu, S
    Pavlidis, D
    [J]. 2001 INTERNATIONAL CONFERENCE ON INDIUM PHOSPHIDE AND RELATED MATERIALS, CONFERENCE PROCEEDINGS, 2001, : 224 - 227
  • [8] Duda R. O., 1973, Pattern Classification and Scene Analysis, V3
  • [9] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [10] Sensitivity Versus Accuracy in Multiclass Problems Using Memetic Pareto Evolutionary Neural Networks
    Fernandez Caballero, Juan Carlos
    Jose Martinez, Francisco
    Hervas, Cesar
    Antonio Gutierrez, Pedro
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (05): : 750 - 770