Estimating the size of the human interactome

被引:547
作者
Stumpf, Michael P. H. [1 ,2 ]
Thorne, Thomas [1 ]
de Silva, Eric [1 ]
Stewart, Ronald [1 ]
An, Hyeong Jun [3 ]
Lappe, Michael [3 ]
Wiuf, Carsten [4 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Div Mol Biosci, London SW7 2AZ, England
[2] Univ London Imperial Coll Sci Technol & Med, Inst Math Sci, London SW7 2AZ, England
[3] Aarhus Univ, Bioinformat Res Ctr, DK-8000 Aarhus, Denmark
[4] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
基金
英国生物技术与生命科学研究理事会; 英国惠康基金;
关键词
evolutionary systems biology; network inference; network sampling theory; network evolution;
D O I
10.1073/pnas.0708078105
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
After the completion of the human and other genome projects it emerged that the number of genes in organisms as diverse as fruit flies, nematodes, and humans does not reflect our perception of their relative complexity. Here, we provide reliable evidence that the size of protein interaction networks in different organisms appears to correlate much better with their apparent biological complexity. We develop a stable and powerful, yet simple, statistical procedure to estimate the size of the whole network from subnet data. This approach is then applied to a range of eukaryotic organisms for which extensive protein interaction data have been collected and we estimate the number of interactions in humans to be approximate to 650,000. We find that the human interaction network is one order of magnitude bigger than the Drosophila melanogaster interactome and approximate to 3 times bigger than in Caenorhabditis elegans.
引用
收藏
页码:6959 / 6964
页数:6
相关论文
共 43 条
[1]  
[Anonymous], 1998, INTRO BOOTSTRAP
[2]  
[Anonymous], 1979, Theoretical statistics
[3]  
[Anonymous], 1975, REPRINTING MONOGRAPH
[4]   Gaining confidence in high-throughput protein interaction networks [J].
Bader, JS ;
Chaudhuri, A ;
Rothberg, JM ;
Chant, J .
NATURE BIOTECHNOLOGY, 2004, 22 (01) :78-85
[5]   The transcriptional landscape of the mammalian genome [J].
Carninci, P ;
Kasukawa, T ;
Katayama, S ;
Gough, J ;
Frith, MC ;
Maeda, N ;
Oyama, R ;
Ravasi, T ;
Lenhard, B ;
Wells, C ;
Kodzius, R ;
Shimokawa, K ;
Bajic, VB ;
Brenner, SE ;
Batalov, S ;
Forrest, ARR ;
Zavolan, M ;
Davis, MJ ;
Wilming, LG ;
Aidinis, V ;
Allen, JE ;
Ambesi-Impiombato, X ;
Apweiler, R ;
Aturaliya, RN ;
Bailey, TL ;
Bansal, M ;
Baxter, L ;
Beisel, KW ;
Bersano, T ;
Bono, H ;
Chalk, AM ;
Chiu, KP ;
Choudhary, V ;
Christoffels, A ;
Clutterbuck, DR ;
Crowe, ML ;
Dalla, E ;
Dalrymple, BP ;
de Bono, B ;
Della Gatta, G ;
di Bernardo, D ;
Down, T ;
Engstrom, P ;
Fagiolini, M ;
Faulkner, G ;
Fletcher, CF ;
Fukushima, T ;
Furuno, M ;
Futaki, S ;
Gariboldi, M .
SCIENCE, 2005, 309 (5740) :1559-1563
[6]   Evolution of the protein repertoire [J].
Chothia, C ;
Gough, J ;
Vogel, C ;
Teichmann, SA .
SCIENCE, 2003, 300 (5626) :1701-1703
[7]   The animal in the genome: comparative genomics and evolution [J].
Copley, Richard R. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1496) :1453-1461
[8]   Complex networks and simple models in biology [J].
de Silva, E ;
Stumpf, MPH .
JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2005, 2 (05) :419-430
[9]   The effects of incomplete protein interaction data on structural and evolutionary inferences [J].
de Silva, Eric ;
Thorne, Thomas ;
Ingram, Piers ;
Agrafioti, Ino ;
Swire, Jonathan ;
Wiuf, Carsten ;
Stumpf, Michael P. H. .
BMC BIOLOGY, 2006, 4 (1)
[10]   A simple physical model for scaling in protein-protein interaction networks [J].
Deeds, EJ ;
Ashenberg, O ;
Shakhnovich, EI .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (02) :311-316