MaskedPainter: Feature selection for microarray data analysis

被引:11
|
作者
Apiletti, Daniele [1 ]
Baralis, Elena [1 ]
Bruno, Giulia [1 ]
Fiori, Alessandro [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
关键词
Feature selection; microarray analysis; tumor classification; data mining; GENE SELECTION; COLON-CANCER; CLASSIFICATION; EXPRESSION; PREDICTION;
D O I
10.3233/IDA-2012-0546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selecting a small number of discriminative genes from thousands is a fundamental task in microarray data analysis. An effective feature selection allows biologists to investigate only a subset of genes instead of the entire set, thus avoiding insignificant, noisy, and redundant features. This paper presents the Masked Painter feature selection method for gene expression data. The proposed method measures the ability of each gene to classify samples belonging to different classes and ranks genes by computing an overlap score. A density based technique is exploited to smooth the effects of outliers in the overlap score computation. Analogously to other approaches, the number of selected genes can be set by the user. However, our algorithm may automatically detect the minimum set of genes that yields the best classification coverage of training set samples. The effectiveness of our approach has been demonstrated through an empirical study on public microarray datasets with different characteristics. Experimental results show that the proposed approach yields a higher classification accuracy with respect to widely used feature selection techniques.
引用
收藏
页码:717 / 737
页数:21
相关论文
共 50 条
  • [41] Feature selection in independent component subspace for microarray data classification
    Zheng, Chun-Hou
    huang, De-S Huang
    Shang, Li
    NEUROCOMPUTING, 2006, 69 (16-18) : 2407 - 2410
  • [42] Feature selection using differential evolution for microarray data classification
    Prajapati S.
    Das H.
    Gourisaria M.K.
    Discover Internet of Things, 2023, 3 (01):
  • [43] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [44] Robust Feature Selection for Microarray Data Based on Multicriterion Fusion
    Yang, Feng
    Mao, K. Z.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (04) : 1080 - 1092
  • [45] Feature Selection Using Counting Grids: Application to Microarray Data
    Lovato, Pietro
    Bicego, Manuele
    Cristani, Marco
    Jojic, Nebojsa
    Perina, Alessandro
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 629 - 637
  • [46] An enhanced feature selection filter for classification of microarray cancer data
    Mazumder, Dilwar Hussain
    Veilumuthu, Ramachandran
    ETRI JOURNAL, 2019, 41 (03) : 358 - 370
  • [47] Parallel classification and feature selection in microarray data using SPRINT
    Mitchell, Lawrence
    Sloan, Terence M.
    Mewissen, Muriel
    Ghazal, Peter
    Forster, Thorsten
    Piotrowski, Michal
    Trew, Arthur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (04): : 854 - 865
  • [48] Modified PSO based Feature Selection for Microarray Data Classification
    Mohapatra, Puspanjali
    Chakravarty, S.
    2015 IEEE POWER, COMMUNICATION AND INFORMATION TECHNOLOGY CONFERENCE (PCITC-2015), 2015, : 703 - 709
  • [49] The application of feature selection methods to analyze the tissue microarray data
    Lin, Weipeng
    Liu, Kunhong
    Liu, Guoyan
    Proceedings of 4th International Workshop on Advanced Computational Intelligence, IWACI 2011, 2011, : 455 - 460
  • [50] Integrating Biological Information for Feature Selection in Microarray Data Classification
    Fang, Ong Huey
    Mustapha, Norwati
    Sulaiman, Md. Nasir
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 2, 2010, : 330 - 334