Biological network comparison using graphlet degree distribution

被引:552
作者
Przulj, Natasa [1 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
关键词
D O I
10.1093/bioinformatics/btl301
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Analogous to biological sequence comparison, comparing cellular networks is an important problem that could provide insight into biological understanding and therapeutics. For technical reasons, comparing large networks is computationally infeasible, and thus heuristics, such as the degree distribution, clustering coefficient, diameter, and relative graphlet frequency distribution have been sought. It is easy to demonstrate that two networks are different by simply showing a short list of properties in which they differ. It is much harder to show that two networks are similar, as it requires demonstrating their similarity in all of their exponentially many properties. Clearly, it is computationally prohibitive to analyze all network properties, but the larger the number of constraints we impose in determining network similarity, the more likely it is that the networks will truly be similar. Results: We introduce a new systematic measure of a network's local structure that imposes a large number of similarity constraints on networks being compared. In particular, we generalize the degree distribution, which measures the number of nodes 'touching' k edges, into distributions measuring the number of nodes 'touching' k graphlets, where graphlets are small connected non-isomorphic subgraphs of a large network. Our new measure of network local structure consists of 73 graphlet degree distributions of graphlets with 2-5 nodes, but it is easily extendible to a greater number of constraints (i.e. graphlets), if necessary, and the extensions are limited only by the available CPU. Furthermore, we show a way to combine the 73 graphlet degree distributions into a network 'agreement' measure which is a number between 0 and 1, where 1 means that networks have identical distributions and 0 means that they are far apart. Based on this new network agreement measure, we show that almost all of the 14 eukaryotic PPI networks, including human, resulting from various high-throughput experimental techniques, as well as from curated databases, are better modeled by geometric random graphs than by Erdos-Reny, random scale-free, or Barabasi-Albert scale-free networks.
引用
收藏
页码:E177 / E183
页数:7
相关论文
共 41 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
Bader GD, 2003, NUCLEIC ACIDS RES, V31, P248, DOI 10.1093/nar/gkg056
[3]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[4]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[5]   Local graph alignment and motif search in biological networks [J].
Berg, J ;
Lässig, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (41) :14689-14694
[6]   Online predicted human interaction database [J].
Brown, KR ;
Jurisica, I .
BIOINFORMATICS, 2005, 21 (09) :2076-2082
[7]   Spectral analysis and the dynamic response of complex networks [J].
de Aguiar, MAM ;
Bar-Yam, Y .
PHYSICAL REVIEW E, 2005, 71 (01)
[8]   The "robust yet fragile" nature of the Internet [J].
Doyle, JC ;
Alderson, DL ;
Li, L ;
Low, S ;
Roughan, M ;
Shalunov, S ;
Tanaka, R ;
Willinger, W .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (41) :14497-14502
[9]  
ERDOS P, 1960, B INT STATIST INST, V38, P343
[10]   A protein interaction map of Drosophila melanogaster [J].
Giot, L ;
Bader, JS ;
Brouwer, C ;
Chaudhuri, A ;
Kuang, B ;
Li, Y ;
Hao, YL ;
Ooi, CE ;
Godwin, B ;
Vitols, E ;
Vijayadamodar, G ;
Pochart, P ;
Machineni, H ;
Welsh, M ;
Kong, Y ;
Zerhusen, B ;
Malcolm, R ;
Varrone, Z ;
Collis, A ;
Minto, M ;
Burgess, S ;
McDaniel, L ;
Stimpson, E ;
Spriggs, F ;
Williams, J ;
Neurath, K ;
Ioime, N ;
Agee, M ;
Voss, E ;
Furtak, K ;
Renzulli, R ;
Aanensen, N ;
Carrolla, S ;
Bickelhaupt, E ;
Lazovatsky, Y ;
DaSilva, A ;
Zhong, J ;
Stanyon, CA ;
Finley, RL ;
White, KP ;
Braverman, M ;
Jarvie, T ;
Gold, S ;
Leach, M ;
Knight, J ;
Shimkets, RA ;
McKenna, MP ;
Chant, J ;
Rothberg, JM .
SCIENCE, 2003, 302 (5651) :1727-1736