Pattern classification in DNA microarray data of multiple tumor types

被引:33
作者
Lin, Tsun-Chen
Liu, Ru-Sheng
Chen, Chien-Yu
Chao, Ya-Ting
Chen, Shu-Yuan
机构
[1] Yuan Ze Univ, Dept Comp Engn & Sci, Tao Yuan 32026, Taiwan
[2] Yuan Ze Univ, Grad Sch Biotechnol & Bioinformat, Tao Yuan 32026, Taiwan
[3] Natl Taiwan Univ, Dept Bioind Mechatron Engn, Taipei 106, Taiwan
关键词
gene expression profiling; cancer classification; genetic algorithm; silhouette statistics;
D O I
10.1016/j.patcog.2006.01.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a genetic algorithm with silhouette statistics as discriminant function (GASS) for gene selection and pattern recognition. The proposed method evaluates gene expression patterns for discriminating heterogeneous cancers. Distance metrics and classification rules have also been analyzed to design a GASS with high classification accuracy. Moreover, the proposed method is compared to previously published methods. Various experimental results show that our method is effective for classifying the NCI60, the GCM and the SRBCTs datasets. Moreover, GASS outperforms other existing methods in both the leave-one-out cross validations and the independent test for novel data. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:2426 / 2438
页数:13
相关论文
共 23 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]  
[Anonymous], 2003, Statistical Analysis of Gene Expression Microarray Data. Interdisciplinary Statistics
[3]  
BENDOR A, 2000, AGL200013 AG LAB
[4]   Evolutionary algorithms for finding optimal gene sets in microarray prediction [J].
Deutsch, JM .
BIOINFORMATICS, 2003, 19 (01) :45-52
[5]   Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[6]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[7]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[8]  
Hall MA, 1998, AUST COMP S, V20, P181
[9]   The hallmarks of cancer [J].
Hanahan, D ;
Weinberg, RA .
CELL, 2000, 100 (01) :57-70
[10]  
Kaufman L., 1990, FINDING GROUPS DATA