Parameters selection in gene selection using Gaussian kernel support vector machines by genetic algorithm

被引:0
作者
Mao Yong
Zhou Xiao-bo
Pi Dao-ying
Sun You-xian
Wong Stephen T.C.
机构
[1] Zhejiang University,National Laboratory of Industrial Control Technology, Institute of Modern Control Engineering
[2] Harvard University,Harvard Center for Neurodegeneration and Repair, Harvard Medical School and Brigham and Women's Hospital, Harvard Medical School
关键词
Gene selection; Support vector machine (SVM); Recursive feature elimination (RFE); Genetic algorithm (GA); Parameter selection; A; Q789; R73;
D O I
10.1007/BF02888487
中图分类号
学科分类号
摘要
In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear statistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two representative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method performs well in selecting genes and achieves high classification accuracies with these genes.
引用
收藏
页码:961 / 973
页数:12
相关论文
共 82 条
[1]  
Alizadeh A.A.(2000)Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling Nature 403 503-511
[2]  
Eisen M.B.(2002)Choosing kernel parameters for support vector machines Machine Learning 46 131-159
[3]  
Davis R.E.(2002)Comparison of discrimination methods for the classification of tumors using gene expression data Journal of the American Statistical Association 97 77-87
[4]  
Ma C.(2003)An accelerated procedure for recursive feature ranking on microarray data Neural Networks 16 641-648
[5]  
Lossos I.S.(1999)Molecular classification of cancer: class discovery and class prediction by gene expression monitoring Science 286 531-537
[6]  
Rosenwald A.(2002)Gene selection for cancer classification using support vector machines Machine Learning 46 389-422
[7]  
Boldrick J.C.(2001)Gene expression profiles in hereditary breast cancer The New England Journal of Medicine 344 539-548
[8]  
Sabet H.(2000)Multivariate measurement of gene expression relations Geonomics 67 201-209
[9]  
Tran T.(2002)Strong feature sets from small samples Journal of Computational Biology 9 127-146
[10]  
Yu X.(2003)Gene selection: a Bayesian variable selection approach Bioinformatics 19 90-97