Minimum curvilinearity to enhance topological prediction of protein interactions by network embedding

被引:81
作者
Cannistraci, Carlo Vittorio [1 ,2 ]
Alanis-Lobato, Gregorio [1 ,2 ]
Ravasi, Timothy [1 ,2 ]
机构
[1] KAUST, Computat Biosci Res Ctr, Comp Elect & Math Sci & Engn Div, Biol & Environm Sci & Engn Div,Integrat Syst Biol, Thuwal 239556900, Saudi Arabia
[2] Univ Calif San Diego, Dept Med, Div Med Genet, San Diego, CA 92093 USA
关键词
GENERALITY; REDUCTION;
D O I
10.1093/bioinformatics/btt208
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Most functions within the cell emerge thanks to protein-protein interactions (PPIs), yet experimental determination of PPIs is both expensive and time-consuming. PPI networks present significant levels of noise and incompleteness. Predicting interactions using only PPI-network topology (topological prediction) is difficult but essential when prior biological knowledge is absent or unreliable. Methods: Network embedding emphasizes the relations between network proteins embedded in a low-dimensional space, in which protein pairs that are closer to each other represent good candidate interactions. To achieve network denoising, which boosts prediction performance, we first applied minimum curvilinear embedding (MCE), and then adopted shortest path (SP) in the reduced space to assign likelihood scores to candidate interactions. Furthermore, we introduce (i) a new valid variation of MCE, named non-centred MCE (ncMCE); (ii) two automatic strategies for selecting the appropriate embedding dimension; and (iii) two new randomized procedures for evaluating predictions. Results: We compared our method against several unsupervised and supervisedly tuned embedding approaches and node neighbourhood techniques. Despite its computational simplicity, ncMCE-SP was the overall leader, outperforming the current methods in topological link prediction. Conclusion: Minimum curvilinearity is a valuable non-linear framework that we successfully applied to the embedding of protein networks for the unsupervised prediction of novel PPIs. The rationale for our approach is that biological and evolutionary information is imprinted in the non-linear patterns hidden behind the protein network topology, and can be exploited for predicting new protein links. The predicted PPIs represent good candidates for testing in high-throughput experiments or for exploitation in systems biology tools such as those used for network-based inference and prediction of disease-related functional modules.
引用
收藏
页码:199 / 209
页数:11
相关论文
共 31 条
  • [21] Interaction generality, a measurement to assess the reliability of a protein-protein interaction
    Saito, R
    Suzuki, H
    Hayashizaki, Y
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (05) : 1163 - 1168
  • [22] A NONLINEAR MAPPING FOR DATA STRUCTURE ANALYSIS
    SAMMON, JW
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1969, C 18 (05) : 401 - &
  • [23] Shaw B, 2009, P 26 INT C MACH LEAR
  • [24] Tree preserving embedding
    Shieh, Albert D.
    Hashimoto, Tatsunori B.
    Airoldi, Edoardo M.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (41) : 16916 - 16921
  • [25] The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored
    Szklarczyk, Damian
    Franceschini, Andrea
    Kuhn, Michael
    Simonovic, Milan
    Roth, Alexander
    Minguez, Pablo
    Doerks, Tobias
    Stark, Manuel
    Muller, Jean
    Bork, Peer
    Jensen, Lars J.
    von Mering, Christian
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D561 - D568
  • [26] A global geometric framework for nonlinear dimensionality reduction
    Tenenbaum, JB
    de Silva, V
    Langford, JC
    [J]. SCIENCE, 2000, 290 (5500) : 2319 - +
  • [27] van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
  • [28] Local multidimensional scaling
    Venna, Jarkko
    Kaski, Samuel
    [J]. NEURAL NETWORKS, 2006, 19 (6-7) : 889 - 899
  • [29] Venna J, 2010, J MACH LEARN RES, V11, P451
  • [30] Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data
    You, Zhu-Hong
    Lei, Ying-Ke
    Gui, Jie
    Huang, De-Shuang
    Zhou, Xiaobo
    [J]. BIOINFORMATICS, 2010, 26 (21) : 2744 - 2751