Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms

被引:156
作者
Alba, Enrique [1 ]
Garcia-Nieto, Jose [1 ]
Jourdan, Laetitia [2 ]
Talbi, El-Ghazali [2 ]
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Computac, E-29071 Malaga, Spain
[2] Univ Lille 1, INRIA Futurs, LIFL, F-59655 Villeneuve Dascq, France
来源
2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS | 2007年
关键词
D O I
10.1109/CEC.2007.4424483
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we compare the use of a Particle Swarm Optimization (PSO) and a Genetic Algorithm (GA) (both augmented with Support Vector Machines SVM) for the classification of high dimensional Microarray Data. Both algorithms are used for finding small samples of informative genes amongst thousands of them. A SVM classifier with 10-fold cross-validation is applied in order to validate and evaluate the provided solutions. A first contribution is to prove that PSOSVM is able to find interesting genes and to provide classification competitive performance. Specifically, a new version of PSO, called Geometric PSO, is empirically evaluated for the first time in this work using a binary representation in Hamming space. In this sense, a comparison of this approach with a new GA(SVM) and also with other existing methods of literature is provided. A second important contribution consists in the actual discovery of new and challenging results on six public datasets identifying significant in the development of a variety of cancers (leukemia, breast, colon, ovarian, prostate, and lung).
引用
收藏
页码:284 / +
页数:2
相关论文
共 32 条
  • [1] Alba E, 2002, LECT NOTES COMPUT SC, V2400, P927
  • [2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [3] Cahon S., 2003, P INT PAR DISTR PROC, P144
  • [4] CHANG CC, 2002, LIBSVM LIBR SUPPORT
  • [5] Clerc M., 2005, BINARY PARTICLE SWAR
  • [6] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [7] Deb K., 2003, 2003001 KANGAL
  • [8] FIX, 1951, 4 US AIR FORC SCH AV
  • [9] Support vector machine classification and validation of cancer tissue samples using microarray expression data
    Furey, TS
    Cristianini, N
    Duffy, N
    Bednarski, DW
    Schummer, M
    Haussler, D
    [J]. BIOINFORMATICS, 2000, 16 (10) : 906 - 914
  • [10] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537