Interval-valued analysis for discriminative gene selection and tissue sample classification using microarray data

被引:12
作者
Qi, Yunsong [1 ]
Yang, Xibei [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp Sci & Engn, Zhenjiang 212003, Jiangsu, Peoples R China
关键词
Microarray; Gene selection; Classification; Rough sets; Interval-valued decision table; ACUTE LYMPHOBLASTIC-LEUKEMIA; RNA-SEQ; MOLECULAR CLASSIFICATION; DIFFERENTIAL EXPRESSION; MUTUAL INFORMATION; ROUGH SETS; CANCER; SIMILARITY; PREDICTION; NORMALIZATION;
D O I
10.1016/j.ygeno.2012.09.004
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
An important application of gene expression data is to classify samples in a variety of diagnostic fields. However, high dimensionality and a small number of noisy samples pose significant challenges to existing classification methods. Focused on the problems of overfitting and sensitivity to noise of the dataset in the classification of microarray data, we propose an interval-valued analysis method based on a rough set technique to select discriminative genes and to use these genes to classify tissue samples of microarray data. We first select a small subset of genes based on interval-valued rough set by considering the preference-ordered domains of the gene expression data, and then classify test samples into certain classes with a term of similar degree. Experiments show that the proposed method is able to reach high prediction accuracies with a small number of selected genes and its performance is robust to noise. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:38 / 48
页数:11
相关论文
共 57 条
[1]  
[Anonymous], 2010, LECT NOTES ELECT ENG
[2]   Improving the performance of fuzzy rule-based classification systems with interval-valued fuzzy sets and genetic amplitude tuning [J].
Antonio Sanz, Jose ;
Fernandez, Alberto ;
Bustince, Humberto ;
Herrera, Francisco .
INFORMATION SCIENCES, 2010, 180 (19) :3674-3685
[3]   Simple and flexible classification of gene expression microarrays via Swirls and Ripples [J].
Baker, Stuart G. .
BMC BIOINFORMATICS, 2010, 11
[4]   Evolutionary rough feature selection in gene expression data [J].
Banerjee, Mohua ;
Mitra, Sushmita ;
Banka, Haider .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2007, 37 (04) :622-632
[5]   Regularized logistic regression without a penalty term: An application to cancer classification with microarray data [J].
Bielza, Concha ;
Robles, Victor ;
Larranaga, Pedro .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) :5110-5118
[6]  
Bolón-Canedo V, 2010, IEEE IJCNN
[7]   Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments [J].
Bullard, James H. ;
Purdom, Elizabeth ;
Hansen, Kasper D. ;
Dudoit, Sandrine .
BMC BIOINFORMATICS, 2010, 11
[8]   INTERFERON GAMMA INHIBITS APOPTOTIC CELL-DEATH IN B-CELL CHRONIC LYMPHOCYTIC-LEUKEMIA [J].
BUSCHLE, M ;
CAMPANA, D ;
CARDING, SR ;
RICHARD, C ;
HOFFBRAND, AV ;
BRENNER, MK .
JOURNAL OF EXPERIMENTAL MEDICINE, 1993, 177 (01) :213-218
[9]   An evolutionary approach for gene selection and classification of microarray data based on SVM error-bound theories [J].
Debnath, Rameswar ;
Kurita, Takio .
BIOSYSTEMS, 2010, 100 (01) :39-46
[10]   Comparison of performance of five common classifiers represented as boundary methods: Euclidean Distance to Centroids, Linear Discriminant Analysis, Quadratic Discriminant Analysis, Learning Vector Quantization and Support Vector Machines, as dependent on data structure [J].
Dixon, Sarah J. ;
Brereton, Richard G. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2009, 95 (01) :1-17