Iterative cluster analysis of protein interaction data

被引:145
作者
Arnau, V
Mars, S
Marín, I
机构
[1] Univ Valencia, Dept Informat, E-46100 Valencia, Spain
[2] Univ Valencia, Dept Genet, E-46100 Valencia, Spain
关键词
D O I
10.1093/bioinformatics/bti021
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Generation of fast tools of hierarchical clustering to be applied when distances among elements of a set are constrained, causing frequent distance ties, as happens in protein interaction data. Results: We present in this work the program UVCLUSTER, that iteratively explores distance datasets using hierarchical clustering. Once the user selects a group of proteins, UVCLUSTER converts the set of primary distances among them (i.e. the minimum number of steps, or interactions, required to connect two proteins) into secondary distances that measure the strength of the connection between each pair of proteins when the interactions for all the proteins in the group are considered. We show that this novel strategy has advantages over conventional clustering methods to explore protein-protein interaction data. UVCLUSTER easily incorporates the information of the largest available interaction datasets to generate comprehensive primary distance tables. The versatility, simplicity of use and high speed of UVCLUSTER on standard personal computers suggest that it can be a benchmark analytical tool for interactome data analysis.
引用
收藏
页码:364 / 378
页数:15
相关论文
共 47 条
[1]   Statistical mechanics of complex networks [J].
Albert, R ;
Barabási, AL .
REVIEWS OF MODERN PHYSICS, 2002, 74 (01) :47-97
[2]  
Arnau V, 2003, LECT NOTES COMPUT SC, V2652, P62
[3]   Multiple UPGMA and neighbor-joining trees and the performance of some computer packages [J].
Backeljau, T ;
DeBruyn, L ;
DeWolf, H ;
Jordaens, K ;
VanDongen, S ;
Winnepenninckx, B .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (02) :309-313
[4]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[5]   Functional genomics and proteomics: charting a multidimensional map of the yeast cell [J].
Bader, GD ;
Heilbut, A ;
Andrews, B ;
Tyers, M ;
Hughes, T ;
Boone, C .
TRENDS IN CELL BIOLOGY, 2003, 13 (07) :344-356
[6]   Analyzing yeast protein-protein interaction data obtained from different sources [J].
Bader, GD ;
Hogue, CWV .
NATURE BIOTECHNOLOGY, 2002, 20 (10) :991-997
[7]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[8]   Similarities and differences in genome-wide expression data of six organisms [J].
Bergmann, S ;
Ihmels, J ;
Barkai, N .
PLOS BIOLOGY, 2004, 2 (01) :85-93
[9]   Topological structure analysis of the protein-protein interaction network in budding yeast [J].
Bu, DB ;
Zhao, Y ;
Cai, L ;
Xue, H ;
Zhu, XP ;
Lu, HC ;
Zhang, JF ;
Sun, SW ;
Ling, LJ ;
Zhang, N ;
Li, GJ ;
Chen, RS .
NUCLEIC ACIDS RESEARCH, 2003, 31 (09) :2443-2450
[10]   A protein interaction map for cell polarity development [J].
Drees, BL ;
Sundin, B ;
Brazeau, E ;
Caviston, JP ;
Chen, GC ;
Guo, W ;
Kozminski, KG ;
Lau, MW ;
Moskow, JJ ;
Tong, A ;
Schenkman, LR ;
McKenzie, A ;
Brennwald, P ;
Longtine, M ;
Bi, E ;
Chan, C ;
Novick, P ;
Boone, C ;
Pringle, JR ;
Davis, TN ;
Fields, S ;
Drubin, DG .
JOURNAL OF CELL BIOLOGY, 2001, 154 (03) :549-571