Gene selection;
Classification;
Dedicated genetic algorithm;
Linear discriminant analysis;
SUPPORT VECTOR MACHINE;
CANCER CLASSIFICATION;
DISCRIMINANT-ANALYSIS;
EXPRESSION DATA;
MOLECULAR CLASSIFICATION;
TISSUE CLASSIFICATION;
BIOMARKER DISCOVERY;
PREDICTION;
TUMOR;
VALIDATION;
D O I:
10.1016/j.neucom.2010.03.024
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
In supervised classification of Microarray data, gene selection aims at identifying a (small) subset of informative genes from the initial data in order to obtain high predictive accuracy. This paper introduces a new embedded approach to this difficult task where a genetic algorithm (GA) is combined with Fisher's linear discriminant analysis (LDA). This LDA-based GA algorithm has the major characteristic that the GA uses not only a LDA classifier in its fitness function, but also LDA's discriminant coefficients in its dedicated crossover and mutation operators. Computational experiments on seven public datasets show that under an unbiased experimental protocol, the proposed algorithm is able to reach high prediction accuracies with a small number of selected genes. (C) 2010 Elsevier B.V. All rights reserved.