A neural network-based similarity index for clustering DNA microarray data

被引:27
作者
Sawa, T
Ohno-Machado, L
机构
[1] Harvard Univ, Sch Med, Div Hlth Sci & Technol, Cambridge, MA 02139 USA
[2] MIT, Cambridge, MA 02139 USA
[3] Brigham & Womens Hosp, Decis Syst Grp, Boston, MA 02115 USA
关键词
neural networks; machine learning; DNA microarray; cluster analysis;
D O I
10.1016/S0010-4825(02)00032-X
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A common approach to the analysis of gene expression data is to define clusters of genes that have similar expression. A critical step in cluster analysis is the determination of similarity between the expression levels of two genes. We introduce a neural network-based similarity index as a non-linear similarity index and compare the results with other proximity measures for Saccharomyces cerevisiae gene expression data. We show that the clusters obtained using Euclidean distance, correlation coefficients, and mutual information were not significantly different. The clusters formed with the neural network-based index were more in agreement with those defined by functional categories and common regulatory motifs. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 12 条
[1]  
Aldenderfer M., 1984, Cluster Analysis, DOI DOI 10.4135/9781412983648
[2]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[3]   The transcriptional program of sporulation in budding yeast [J].
Chu, S ;
DeRisi, J ;
Eisen, M ;
Mulholland, J ;
Botstein, D ;
Brown, PO ;
Herskowitz, I .
SCIENCE, 1998, 282 (5389) :699-705
[4]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[5]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[6]   Large-scale clustering of cDNA-fingerprinting data [J].
Herwig, R ;
Poustka, AJ ;
Müller, C ;
Bull, C ;
Lehrach, H ;
O'Brien, J .
GENOME RESEARCH, 1999, 9 (11) :1093-1105
[7]   Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae [J].
Hughes, JD ;
Estep, PW ;
Tavazoie, S ;
Church, GM .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 296 (05) :1205-1214
[8]  
Rosner B., 2000, Fundamentals of biostatistics, V5th
[9]   Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation [J].
Roth, FP ;
Hughes, JD ;
Estep, PW ;
Church, GM .
NATURE BIOTECHNOLOGY, 1998, 16 (10) :939-945
[10]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (03) :379-423