Novel Protein-Protein Interactions Inferred from Literature Context

被引:35
作者
van Haagen, Herman H. H. B. M.
't Hoen, Peter A. C.
Bovo, Alessandro Botelho
de Morree, Antoine
van Mulligen, Erik M.
Chichester, Christine
Kors, Jan A.
den Dunnen, Johan T.
van Ommen, Gert-Jan B.
van der Maarel, Silvere M.
Kern, Vinicius Medina
Mons, Barend
Schuemie, Martijn J.
机构
[1] Biosemantics Association, Department of Human Genetics, Leiden University Medical Center, Leiden
[2] Program in Knowledge Engineering and Management (EGC), Federal University of Santa Catarina (UFSC), Florianópolis
[3] Department of Medical Informatics, Erasmus University Medical Center, Rotterdam
来源
PLOS ONE | 2009年 / 4卷 / 11期
关键词
BIOLOGICAL NETWORKS; CONCEPT PROFILES; INTERACTION MAP; GENE; CALPAIN-3; KNOWLEDGE; DYSFERLIN; IDENTIFICATION; DISCOVERY; COMPONENT;
D O I
10.1371/journal.pone.0007894
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We have developed a method that predicts Protein-Protein Interactions (PPIs) based on the similarity of the context in which proteins appear in literature. This method outperforms previously developed PPI prediction algorithms that rely on the conjunction of two protein names in MEDLINE abstracts. We show significant increases in coverage (76% versus 32%) and sensitivity (66% versus 41% at a specificity of 95%) for the prediction of PPIs currently archived in 6 PPI databases. A retrospective analysis shows that PPIs can efficiently be predicted before they enter PPI databases and before their interaction is explicitly described in the literature. The practical value of the method for discovery of novel PPIs is illustrated by the experimental confirmation of the inferred physical interaction between CAPN3 and PARVB, which was based on frequent co-occurrence of both proteins with concepts like Z-disc, dysferlin, and alpha-actinin. The relationships between proteins predicted by our method are broader than PPIs, and include proteins in the same complex or pathway. Dependent on the type of relationships deemed useful, the precision of our method can be as high as 90%. The full set of predicted interactions is available in a downloadable matrix and through the webtool Nermal, which lists the most likely interaction partners for a given protein. Our framework can be used for prioritizing potential interaction partners, hitherto undiscovered, for follow-up studies and to aid the generation of accurate protein interaction maps.
引用
收藏
页数:8
相关论文
共 47 条
  • [1] Integrin-linked kinase, a novel component of the cardiac mechanical stretch sensor, controls contractility in the zebrafish heart
    Bendig, Garnet
    Grimmler, Matthias
    Huttner, Inken G.
    Wessels, Georgia
    Dahme, Tillman
    Just, Steffen
    Trano, Nicole
    Katus, Hugo A.
    Fishman, Mark C.
    Rottbauer, Wolfgang
    [J]. GENES & DEVELOPMENT, 2006, 20 (17) : 2361 - 2372
  • [2] BENHUR A, 2006, CHOOSING NEGATIVE EX
  • [3] The Unified Medical Language System (UMLS): integrating biomedical terminology
    Bodenreider, O
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D267 - D270
  • [4] Boutet Emmanuel, 2007, V406, P89
  • [5] MINT: the molecular INTeraction database
    Chatr-aryamontri, Andrew
    Ceol, Arnaud
    Palazzi, Luisa Montecchi
    Nardelli, Giuliano
    Schneider, Maria Victoria
    Castagnoli, Luisa
    Cesareni, Gianni
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D572 - D574
  • [6] Gene name ambiguity of eukaryotic nomenclatures
    Chen, LF
    Liu, HF
    Friedman, C
    [J]. BIOINFORMATICS, 2005, 21 (02) : 248 - 256
  • [7] A survey of current work in biomedical text mining
    Cohen, AM
    Hersh, WR
    [J]. BRIEFINGS IN BIOINFORMATICS, 2005, 6 (01) : 57 - 71
  • [8] Identification of putative in vivo substrates of calpain 3 by comparative proteomics of overexpressing transgenic and nontransgenic mice
    Cohen, Niaz
    Kudryashova, Elena
    Kramerova, Irina
    Anderson, Louise V. B.
    Beckmann, Jacques S.
    Bushby, Katherine
    Spencer, Melissa J.
    [J]. PROTEOMICS, 2006, 6 (22) : 6075 - 6084
  • [9] A protein interaction map of Drosophila melanogaster
    Giot, L
    Bader, JS
    Brouwer, C
    Chaudhuri, A
    Kuang, B
    Li, Y
    Hao, YL
    Ooi, CE
    Godwin, B
    Vitols, E
    Vijayadamodar, G
    Pochart, P
    Machineni, H
    Welsh, M
    Kong, Y
    Zerhusen, B
    Malcolm, R
    Varrone, Z
    Collis, A
    Minto, M
    Burgess, S
    McDaniel, L
    Stimpson, E
    Spriggs, F
    Williams, J
    Neurath, K
    Ioime, N
    Agee, M
    Voss, E
    Furtak, K
    Renzulli, R
    Aanensen, N
    Carrolla, S
    Bickelhaupt, E
    Lazovatsky, Y
    DaSilva, A
    Zhong, J
    Stanyon, CA
    Finley, RL
    White, KP
    Braverman, M
    Jarvie, T
    Gold, S
    Leach, M
    Knight, J
    Shimkets, RA
    McKenna, MP
    Chant, J
    Rothberg, JM
    [J]. SCIENCE, 2003, 302 (5651) : 1727 - 1736
  • [10] Predicting biological networks from genomic data
    Harrington, Eoghan D.
    Jensen, Lars J.
    Bork, Peer
    [J]. FEBS LETTERS, 2008, 582 (08) : 1251 - 1258