PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology

被引:39
作者
Gioutlakis, Aris [1 ,2 ]
Klapa, Maria I. [2 ]
Moschonas, Nicholas K. [1 ,2 ]
机构
[1] Univ Patras, Sch Med, Dept Gen Biol, Patras, Greece
[2] Fdn Res & Technol Hellas FORTH ICE HT, Inst Chem Engn Sci, Metab Engn & Syst Biol Lab, Patras, Greece
来源
PLOS ONE | 2017年 / 12卷 / 10期
关键词
CURATION; NETWORK;
D O I
10.1371/journal.pone.0186039
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
It has been acknowledged that source databases recording experimentally supported human protein-protein interactions (PPIs) exhibit limited overlap. Thus, the reconstruction of a comprehensive PPI network requires appropriate integration of multiple heterogeneous primary datasets, presenting the PPIs at various genetic reference levels. Existing PPI meta-databases perform integration via normalization; namely, PPIs are merged after converted to a certain target level. Hence, the node set of the integrated network depends each time on the number and type of the combined datasets. Moreover, the irreversible a priori normalization process hinders the identification of normalization artifacts in the integrated network, which originate from the nonlinearity characterizing the genetic information flow. PICKLE (Protein InteraCtion KnowLedgebasE) 2.0 implements a new architecture for this recently introduced human PPI meta-database. Its main novel feature over the existing meta-databases is its approach to primary PPI dataset integration via genetic information ontology. Building upon the PICKLE principles of using the reviewed human complete proteome (RHCP) of UniProtKB/Swiss-Prot as the reference protein interactor set, and filtering out protein interactions with low probability of being direct based on the available evidence, PICKLE 2.0 first assembles the RHCP genetic information ontology network by connecting the corresponding genes, nucleotide sequences (mRNAs) and proteins (UniProt entries) and then integrates PPI datasets by superimposing them on the ontology network without any a priori transformations. Importantly, this process allows the resulting heterogeneous integrated network to be reversibly normalized to any level of genetic reference without loss of the original information, the latter being used for identification of normalization biases, and enables the appraisal of potential false positive interactions through PPI source database cross-checking. The PICKLE web-based interface (www. pickle. gr) allows for the simultaneous query of multiple entities and provides integrated human PPI networks at either the protein (UniProt) or the gene level, at three PPI filtering modes.
引用
收藏
页数:17
相关论文
共 24 条
  • [1] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
  • [2] The BioGRID interaction database: 2017 update
    Chatr-aryamontri, Andrew
    Oughtred, Rose
    Boucher, Lorrie
    Rust, Jennifer
    Chang, Christie
    Kolas, Nadine K.
    O'Donnell, Lara
    Oster, Sara
    Theesfeld, Chandra
    Sellam, Adnane
    Stark, Chris
    Breitkreutz, Bobby-Joe
    Dolinski, Kara
    Tyers, Mike
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D369 - D379
  • [3] Dissecting the Human Protein-Protein Interaction Network via Phylogenetic Decomposition
    Chen, Cho-Yi
    Ho, Andy
    Huang, Hsin-Yuan
    Juan, Hsueh-Fen
    Huang, Hsuan-Cheng
    [J]. SCIENTIFIC REPORTS, 2014, 4
  • [4] Towards a Protein-Protein Interaction information extraction system: Recognizing named entities
    Danger, Roxana
    Pla, Ferran
    Molina, Antonio
    Rosso, Paolo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 57 : 104 - 118
  • [5] Biana: a software framework for compiling biological interactions and analyzing networks
    Garcia-Garcia, Javier
    Guney, Emre
    Aragues, Ramon
    Planas-Iglesias, Joan
    Oliva, Baldo
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [6] UniHI 7: an enhanced database for retrieval and interactive analysis of human molecular interaction networks
    Kalathur, Ravi Kiran Reddy
    Pinto, Jose Pedro
    Hernandez-Prieto, Miguel A.
    Machado, Rui S. R.
    Almeida, Dulce
    Chaurasia, Gautam
    Futschik, Matthias E.
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D408 - D414
  • [7] ConsensusPathDB-a database for integrating human functional interaction networks
    Kamburov, Atanas
    Wierling, Christoph
    Lehrach, Hans
    Herwig, Ralf
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D623 - D628
  • [8] Reconstruction of the experimentally supported human protein interactome: what can we learn?
    Klapa, Maria I.
    Tsafou, Kalliopi
    Theodoridis, Evangelos
    Tsakalidis, Athanasios
    Moschonas, Nicholas K.
    [J]. BMC SYSTEMS BIOLOGY, 2013, 7
  • [9] Integrated interactions database: tissue-specific view of the human and model organism interactomes
    Kotlyar, Max
    Pastrello, Chiara
    Sheahan, Nicholas
    Jurisica, Igor
    [J]. NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D536 - D541
  • [10] Causal analysis approaches in Ingenuity Pathway Analysis
    Kraemer, Andreas
    Green, Jeff
    Pollard, Jack, Jr.
    Tugendreich, Stuart
    [J]. BIOINFORMATICS, 2014, 30 (04) : 523 - 530