Novel algorithm for coexpression detection in time-varying microarray data sets

被引:7
作者
Yin, Zong-Xian [1 ]
Chiang, Jung-Hsien [2 ]
机构
[1] So Taiwan Univ, Dept Multimedia & Entertainment Sci, Tainan 71005, Taiwan
[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
pattern analysis; time series analysis; bioinformatics; data mining; clustering; gene expression;
D O I
10.1109/TCBB.2007.1052
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
When analyzing the results of microarray experiments, biologists generally use unsupervised categorization tools. However, such tools regard each time point as an independent dimension and utilize the euclidean distance to compute the similarities between expressions. Furthermore, some of these methods require the number of clusters to be determined in advance, which is clearly impossible in the case of a new data set. Therefore, this study proposes a novel scheme, designated the Variation-based Coexpression Detection (VCD) algorithm, to analyze the trends of expressions based on their variation over time. The proposed algorithm has two advantages. First, it is unnecessary to determine the number of clusters in advance since the algorithm automatically detects those genes whose profiles are grouped together and creates patterns for these groups. Second, the algorithm features a new measurement criterion for calculating the degree of change of the expressions between adjacent time points and evaluating their trend similarities. Three real-world microarray data sets are employed to evaluate the performance of the proposed algorithm.
引用
收藏
页码:120 / 135
页数:16
相关论文
共 35 条
[1]   Attribute clustering for grouping, selection, and classification of gene expression data [J].
Au, WH ;
Chan, KCC ;
Wong, AKC ;
Wang, Y .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (02) :83-101
[2]   Clustering gene expression patterns [J].
Ben-Dor, A ;
Shamir, R ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :281-297
[3]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[4]   A new fuzzy cover approach to clustering [J].
Chiang, JH ;
Yue, SH ;
Yin, ZX .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (02) :199-208
[5]  
Cho H, 2004, SIAM PROC S, P114
[6]   Robust clustering methods: A unified view [J].
Dave, RN ;
Krishnapuram, R .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1997, 5 (02) :270-293
[7]  
Dhillon I. S., 2002, Proceedings 2002 IEEE International Conference on Data Mining. ICDM 2002, P131, DOI 10.1109/ICDM.2002.1183895
[8]   Diametrical clustering for identifying anti-correlated gene clusters [J].
Dhillon, IS ;
Marcotte, EM ;
Roshan, U .
BIOINFORMATICS, 2003, 19 (13) :1612-1619
[9]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[10]   Optimization methods for fuzzy clustering [J].
Fu, GY .
FUZZY SETS AND SYSTEMS, 1998, 93 (03) :301-309