Functional protein representations from biological networks enable diverse cross-species inference

被引:20
作者
Fan, Jason [1 ,2 ]
Cannistra, Anthony [3 ]
Fried, Inbar [4 ]
Lim, Tim [5 ]
Schaffner, Thomas [6 ]
Crovella, Mark [5 ]
Hescott, Benjamin [7 ]
Leiserson, Mark D. M. [1 ,2 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[2] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[3] Univ Washington, Dept Biol, Seattle, WA 98195 USA
[4] Univ N Carolina, Med Sch, Chapel Hill, NC 27515 USA
[5] Boston Univ, Dept Comp Sci, Boston, MA 02215 USA
[6] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[7] Northeastern Univ, Coll Comp & Informat Sci, Boston, MA 02115 USA
关键词
GLOBAL ALIGNMENT; ALGORITHMS; SIMILARITY; LETHALITY; KERNELS;
D O I
10.1093/nar/gkz132
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Transferring many biological applications, but is complicated by divergent and convergent evolution. Many current approaches interaction network data to transfer knowledge across species, exemplified by network alignment methods. While these techniques do well, they are limited in scope, creating metrics to address one specific problem or task. We take a different approach by creating an environment where multiple knowledge transfer tasks can be performed using the same protein representations. Specifically, our kernel-based method, MUNK, integrates sequence and network structure to create functional protein representations, embedding proteins from different species in the same vector space. First we show proteins in different species that are close in MUNK-space are functionally similar. Next, we use these representations to share knowledge of synthetic lethal interactions between species. Importantly, we find that the results using MUNK-representations are at least as accurate as existing algorithms for these tasks. Finally, we generalize the notion of a phenolog ('orthologous phenotype') to use functionally similar proteins (i.e. those with similar representations). We demonstrate the utility of this broadened notion by using it to identify known phenologs and novel nonobvious ones supported by current research.
引用
收藏
页数:13
相关论文
共 61 条
  • [1] Resolving the Ortholog Conjecture: Orthologs Tend to Be Weakly, but Significantly, More Similar in Function than Paralogs
    Altenhoff, Adrian M.
    Studer, Romain A.
    Robinson-Rechavi, Marc
    Dessimoz, Christophe
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (05)
  • [2] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [3] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] Bengio Y, 2006, SEMISUPERVISED LEARN
  • [6] Exploring genetic interactions and networks with yeast
    Boone, Charles
    Bussey, Howard
    Andrews, Brenda J.
    [J]. NATURE REVIEWS GENETICS, 2007, 8 (06) : 437 - 449
  • [7] Unequal evolutionary conservation of human protein interactions in interologous networks.
    Brown, Kevin V.
    Jurisica, Igor
    [J]. GENOME BIOLOGY, 2007, 8 (05)
  • [8] Amelioration of IFN-γ and TNF-α-Induced Intestinal Epithelial Barrier Dysfunction by Berberine via Suppression of MLCK-MLC Phosphorylation Signaling Pathway
    Cao, Min
    Wang, Pei
    Sun, Chunhong
    He, Wen
    Wang, Fengjun
    [J]. PLOS ONE, 2013, 8 (05):
  • [9] The BioGRID interaction database: 2017 update
    Chatr-aryamontri, Andrew
    Oughtred, Rose
    Boucher, Lorrie
    Rust, Jennifer
    Chang, Christie
    Kolas, Nadine K.
    O'Donnell, Lara
    Oster, Sara
    Theesfeld, Chandra
    Sellam, Adnane
    Stark, Chris
    Breitkreutz, Bobby-Joe
    Dolinski, Kara
    Tyers, Mike
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D369 - D379
  • [10] SGD:: Saccharomyces Genome Database
    Cherry, JM
    Adler, C
    Ball, C
    Chervitz, SA
    Dwight, SS
    Hester, ET
    Jia, YK
    Juvik, G
    Roe, T
    Schroeder, M
    Weng, SA
    Botstein, D
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 73 - 79