A genetic algorithm approach for discovering diagnostic patterns in molecular measurement data

被引:0
作者
Schaffer, JD [1 ]
Janevski, A [1 ]
Simpson, MR [1 ]
机构
[1] Philips Res USA, Briarcliff Manor, NY 10510 USA
来源
PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY | 2005年
关键词
gene expression; classification; molecular diagnostics; microarray; genetic algorithm;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The objective of this work is the development of an algorithm that, after training, will be able to discriminate between disease classes in molecular data. The system proposed uses a genetic algorithm (GA) to achieve this discrimination. We apply our method to three publicly available data sets. Two of the data sets are based on microarray data that allow the simultaneous measurement of the expression levels of genes under different disease states. The third data set is based on serum proteomic pattern diagnostics of ovarian cancer using high-resolution mass spectrometry to extract a set of biomarker classifiers. We show how our methodology rinds an abundance of different feature models, automatically selecting a subset of discriminatory features, whose classification accuracy is comparable to other approaches considered. This raises questions about how to choose among the many competing models, while simultaneously estimating the prediction accuracy of the chosen models.
引用
收藏
页码:392 / 399
页数:8
相关论文
共 29 条
[1]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[2]  
[Anonymous], FDN GENETIC ALGORITH
[3]  
BAGGERLY KA, 2005, CANC INFORM
[4]   Is cross-validation valid for small-sample microarray classification? [J].
Braga-Neto, UM ;
Dougherty, ER .
BIOINFORMATICS, 2004, 20 (03) :374-380
[5]   Breast cancer diagnosis using self-organizing map for sonography [J].
Chen, DR ;
Chang, RF ;
Huang, YL .
ULTRASOUND IN MEDICINE AND BIOLOGY, 2000, 26 (03) :405-411
[6]   High-resolution serum proteomic features for ovarian cancer detection [J].
Conrads, TP ;
Fusaro, VA ;
Ross, S ;
Johann, D ;
Rajapakse, V ;
Hitt, BA ;
Steinberg, SM ;
Kohn, EC ;
Fishman, DA ;
Whiteley, G ;
Barrett, JC ;
Liotta, LA ;
Petricoin, EF ;
Veenstra, TD .
ENDOCRINE-RELATED CANCER, 2004, 11 (02) :163-178
[7]   Statistical tests for differential expression in cDNA microarray experiments [J].
Cui, XQ ;
Churchill, GA .
GENOME BIOLOGY, 2003, 4 (04)
[8]  
Dudoit S, 2002, STAT SINICA, V12, P111
[9]  
ESHELMAN LJ, 1993, PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, P9
[10]   Judging the quality of gene expression-based clustering methods using gene annotation [J].
Gibbons, FD ;
Roth, FP .
GENOME RESEARCH, 2002, 12 (10) :1574-1581