Minimum curvilinearity to enhance topological prediction of protein interactions by network embedding

被引:81
作者
Cannistraci, Carlo Vittorio [1 ,2 ]
Alanis-Lobato, Gregorio [1 ,2 ]
Ravasi, Timothy [1 ,2 ]
机构
[1] KAUST, Computat Biosci Res Ctr, Comp Elect & Math Sci & Engn Div, Biol & Environm Sci & Engn Div,Integrat Syst Biol, Thuwal 239556900, Saudi Arabia
[2] Univ Calif San Diego, Dept Med, Div Med Genet, San Diego, CA 92093 USA
关键词
GENERALITY; REDUCTION;
D O I
10.1093/bioinformatics/btt208
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Most functions within the cell emerge thanks to protein-protein interactions (PPIs), yet experimental determination of PPIs is both expensive and time-consuming. PPI networks present significant levels of noise and incompleteness. Predicting interactions using only PPI-network topology (topological prediction) is difficult but essential when prior biological knowledge is absent or unreliable. Methods: Network embedding emphasizes the relations between network proteins embedded in a low-dimensional space, in which protein pairs that are closer to each other represent good candidate interactions. To achieve network denoising, which boosts prediction performance, we first applied minimum curvilinear embedding (MCE), and then adopted shortest path (SP) in the reduced space to assign likelihood scores to candidate interactions. Furthermore, we introduce (i) a new valid variation of MCE, named non-centred MCE (ncMCE); (ii) two automatic strategies for selecting the appropriate embedding dimension; and (iii) two new randomized procedures for evaluating predictions. Results: We compared our method against several unsupervised and supervisedly tuned embedding approaches and node neighbourhood techniques. Despite its computational simplicity, ncMCE-SP was the overall leader, outperforming the current methods in topological link prediction. Conclusion: Minimum curvilinearity is a valuable non-linear framework that we successfully applied to the embedding of protein networks for the unsupervised prediction of novel PPIs. The rationale for our approach is that biological and evolutionary information is imprinted in the non-linear patterns hidden behind the protein network topology, and can be exploited for predicting new protein links. The predicted PPIs represent good candidates for testing in high-throughput experiments or for exploitation in systems biology tools such as those used for network-based inference and prediction of disease-related functional modules.
引用
收藏
页码:199 / 209
页数:11
相关论文
共 31 条
  • [11] Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions
    Chua, Hon Nian
    Sung, Wing-Kin
    Wong, Limsoon
    [J]. BIOINFORMATICS, 2006, 22 (13) : 1623 - 1630
  • [12] Hinton G.E., 2003, Adv. Neural Inform. Process. Syst., V15, DOI DOI 10.5555/2968618.2968725
  • [13] Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources
    Huang, Da Wei
    Sherman, Brad T.
    Lempicki, Richard A.
    [J]. NATURE PROTOCOLS, 2009, 4 (01) : 44 - 57
  • [14] Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists
    Huang, Da Wei
    Sherman, Brad T.
    Lempicki, Richard A.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 (01) : 1 - 13
  • [15] Jolliffe I., 2002, PRINCIPAL COMPONENT, DOI [10.1007/978-1-4757-1904-8_7, 10.1016/0169-7439(87)80084-9]
  • [16] Geometric De-noising of Protein-Protein Interaction Networks
    Kuchaiev, Oleksii
    Rasajski, Marija
    Higham, Desmond J.
    Przulj, Natasa
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (08)
  • [17] Controllability of complex networks
    Liu, Yang-Yu
    Slotine, Jean-Jacques
    Barabasi, Albert-Laszlo
    [J]. NATURE, 2011, 473 (7346) : 167 - 173
  • [18] Modeling interactome: scale-free or geometric?
    Przulj, N
    Corneil, DG
    Jurisica, I
    [J]. BIOINFORMATICS, 2004, 20 (18) : 3508 - 3515
  • [19] Use and misuse of the gene ontology annotations
    Rhee, Seung Yon
    Wood, Valerie
    Dolinski, Kara
    Draghici, Sorin
    [J]. NATURE REVIEWS GENETICS, 2008, 9 (07) : 509 - 515
  • [20] Construction of reliable protein-protein interaction networks with a new interaction generality measure
    Saito, R
    Suzuki, H
    Hayashizaki, Y
    [J]. BIOINFORMATICS, 2003, 19 (06) : 756 - 763