Reconstruction of the experimentally supported human protein interactome: what can we learn?

被引:25
作者
Klapa, Maria I. [1 ]
Tsafou, Kalliopi [1 ,2 ]
Theodoridis, Evangelos [3 ]
Tsakalidis, Athanasios [3 ]
Moschonas, Nicholas K. [2 ]
机构
[1] Fdn Res & Technol Hellas FORTH ICE HT, Metab Engn & Syst Biol Lab, Inst Chem Engn Sci, Patras, Greece
[2] Univ Patras, Sch Med, Dept Gen Biol, GR-26110 Patras, Greece
[3] Univ Patras, Comp Engn & Informat Dept, Patras, Greece
来源
BMC SYSTEMS BIOLOGY | 2013年 / 7卷
关键词
Human protein interactome analysis; Human protein-protein interaction (PPI) databases; Network biology; PPI network reconstruction; INTERACTION DATABASE; INTERACTION NETWORK; MAP; IDENTIFICATION; FEATURES; TOOLS;
D O I
10.1186/1752-0509-7-96
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Understanding the topology and dynamics of the human protein-protein interaction (PPI) network will significantly contribute to biomedical research, therefore its systematic reconstruction is required. Several meta-databases integrate source PPI datasets, but the protein node sets of their networks vary depending on the PPI data combined. Due to this inherent heterogeneity, the way in which the human PPI network expands via multiple dataset integration has not been comprehensively analyzed. We aim at assembling the human interactome in a global structured way and exploring it to gain insights of biological relevance. Results: First, we defined the UniProtKB manually reviewed human "complete" proteome as the reference protein-node set and then we mined five major source PPI datasets for direct PPIs exclusively between the reference proteins. We updated the protein and publication identifiers and normalized all PPIs to the UniProt identifier level. The reconstructed interactome covers approximately 60% of the human proteome and has a scale-free structure. No apparent differentiating gene functional classification characteristics were identified for the unrepresented proteins. The source dataset integration augments the network mainly in PPIs. Polyubiquitin emerged as the highest-degree node, but the inclusion of most of its identified PPIs may be reconsidered. The high number (>300) of connections of the subsequent fifteen proteins correlates well with their essential biological role. According to the power-law network structure, the unrepresented proteins should mainly have up to four connections with equally poorly-connected interactors. Conclusions: Reconstructing the human interactome based on the a priori definition of the protein nodes enabled us to identify the currently included part of the human "complete" proteome, and discuss the role of the proteins within the network topology with respect to their function. As the network expansion has to comply with the scale-free theory, we suggest that the core of the human interactome has essentially emerged. Thus, it could be employed in systems biology and biomedical research, despite the considerable number of currently unrepresented proteins. The latter are probably involved in specialized physiological conditions, justifying the scarcity of related PPI information, and their identification can assist in designing relevant functional experiments and targeted text mining algorithms.
引用
收藏
页数:13
相关论文
共 47 条
  • [11] Functional organization of the yeast proteome by systematic analysis of protein complexes
    Gavin, AC
    Bösche, M
    Krause, R
    Grandi, P
    Marzioch, M
    Bauer, A
    Schultz, J
    Rick, JM
    Michon, AM
    Cruciat, CM
    Remor, M
    Höfert, C
    Schelder, M
    Brajenovic, M
    Ruffner, H
    Merino, A
    Klein, K
    Hudak, M
    Dickson, D
    Rudi, T
    Gnau, V
    Bauch, A
    Bastuck, S
    Huhse, B
    Leutwein, C
    Heurtier, MA
    Copley, RR
    Edelmann, A
    Querfurth, E
    Rybin, V
    Drewes, G
    Raida, M
    Bouwmeester, T
    Bork, P
    Seraphin, B
    Kuster, B
    Neubauer, G
    Superti-Furga, G
    [J]. NATURE, 2002, 415 (6868) : 141 - 147
  • [12] Disentangling function from topology to infer the network properties of disease genes
    Ghersi, Dario
    Singh, Mona
    [J]. BMC SYSTEMS BIOLOGY, 2013, 7
  • [13] A protein interaction map of Drosophila melanogaster
    Giot, L
    Bader, JS
    Brouwer, C
    Chaudhuri, A
    Kuang, B
    Li, Y
    Hao, YL
    Ooi, CE
    Godwin, B
    Vitols, E
    Vijayadamodar, G
    Pochart, P
    Machineni, H
    Welsh, M
    Kong, Y
    Zerhusen, B
    Malcolm, R
    Varrone, Z
    Collis, A
    Minto, M
    Burgess, S
    McDaniel, L
    Stimpson, E
    Spriggs, F
    Williams, J
    Neurath, K
    Ioime, N
    Agee, M
    Voss, E
    Furtak, K
    Renzulli, R
    Aanensen, N
    Carrolla, S
    Bickelhaupt, E
    Lazovatsky, Y
    DaSilva, A
    Zhong, J
    Stanyon, CA
    Finley, RL
    White, KP
    Braverman, M
    Jarvie, T
    Gold, S
    Leach, M
    Knight, J
    Shimkets, RA
    McKenna, MP
    Chant, J
    Rothberg, JM
    [J]. SCIENCE, 2003, 302 (5651) : 1727 - 1736
  • [14] Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources
    Huang, Da Wei
    Sherman, Brad T.
    Lempicki, Richard A.
    [J]. NATURE PROTOCOLS, 2009, 4 (01) : 44 - 57
  • [15] Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists
    Huang, Da Wei
    Sherman, Brad T.
    Lempicki, Richard A.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 (01) : 1 - 13
  • [16] Roles for the two-hybrid system in exploration of the yeast protein interactome
    Ito, T
    Ota, K
    Kubota, H
    Yamaguchi, Y
    Chiba, T
    Sakuraba, K
    Yoshida, M
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (08) : 561 - 566
  • [17] Global topological features of cancer proteins in the human interactome
    Jonsson, Pall F.
    Bates, Paul A.
    [J]. BIOINFORMATICS, 2006, 22 (18) : 2291 - 2297
  • [18] The ConsensusPathDB interaction database: 2013 update
    Kamburov, Atanas
    Stelzl, Ulrich
    Lehrach, Hans
    Herwig, Ralf
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D793 - D800
  • [19] The IntAct molecular interaction database in 2012
    Kerrien, Samuel
    Aranda, Bruno
    Breuza, Lionel
    Bridge, Alan
    Broackes-Carter, Fiona
    Chen, Carol
    Duesbury, Margaret
    Dumousseau, Marine
    Feuermann, Marc
    Hinz, Ursula
    Jandrasits, Christine
    Jimenez, Rafael C.
    Khadake, Jyoti
    Mahadevan, Usha
    Masson, Patrick
    Pedruzzi, Ivo
    Pfeiffenberger, Eric
    Porras, Pablo
    Raghunath, Arathi
    Roechert, Bernd
    Orchard, Sandra
    Hermjakob, Henning
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D841 - D846
  • [20] Protein-protein interaction and pathway databases, a graphical review
    Klingstrom, Tomas
    Plewczynski, Dariusz
    [J]. BRIEFINGS IN BIOINFORMATICS, 2011, 12 (06) : 702 - 713