From Function to Interaction: A New Paradigm for Accurately Predicting Protein Complexes Based on Protein-to-Protein Interaction Networks

被引:21
作者
Xu, Bin [1 ]
Guan, Jihong [1 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein complex; protein-protein interaction networks; functional similarity; prediction; SEMANTIC SIMILARITY; MODULES; IDENTIFICATION; ONTOLOGY;
D O I
10.1109/TCBB.2014.2306825
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identification of protein complexes is critical to understand complex formation and protein functions. Recent advances in high-throughput experiments have provided large data sets of protein-protein interactions (PPIs). Many approaches, based on the assumption that complexes are dense subgraphs of PPI networks (PINs in short), have been proposed to predict complexes using graph clustering methods. In this paper, we introduce a novel from-function-to-interaction paradigm for protein complex detection. As proteins perform biological functions by forming complexes, we first cluster proteins using biology process (BP) annotations from gene ontology (GO). Then, we map the resulting protein clusters onto a PPI network (PIN in short), extract connected subgraphs consisting of clustered proteins from the PPI network and expand each connected subgraph with protein nodes that have rich links to the proteins in the subgraph. Such expanded subgraphs are taken as predicted complexes. We apply the proposed method (called CPredictor) to two PPI data sets of S. cerevisiae for predicting protein complexes. Experimental results show that CPredictor outperforms the existing methods. The outstanding precision of CPredictor proves that the from-function-to-interaction paradigm provides a new and effective way to computational detection of protein complexes.
引用
收藏
页码:616 / 627
页数:12
相关论文
共 39 条
[1]   CFinder:: locating cliques and overlapping modules in biological networks [J].
Adamcsek, B ;
Palla, G ;
Farkas, IJ ;
Derényi, I ;
Vicsek, T .
BIOINFORMATICS, 2006, 22 (08) :1021-1023
[2]   Development and implementation of an algorithm for detection of protein complexes in large interaction networks [J].
Altaf-Ul-Amin, Md ;
Shinbo, Yoko ;
Mihara, Kenji ;
Kurokawa, Ken ;
Kanaya, Shigehiko .
BMC BIOINFORMATICS, 2006, 7 (1)
[3]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[4]  
Bartel P.L., 1997, YEAST 2 HYBRID SYSTE
[5]   The Gene Ontology in 2010: extensions and refinements The Gene Ontology Consortium [J].
Berardini, Tanya Z. ;
Li, Donghui ;
Huala, Eva ;
Bridges, Susan ;
Burgess, Shane ;
McCarthy, Fiona ;
Carbon, Seth ;
Lewis, Suzanna E. ;
Mungall, Christopher J. ;
Abdulla, Amina ;
Wood, Valerie ;
Feltrin, Erika ;
Valle, Giorgio ;
Chisholm, Rex L. ;
Fey, Petra ;
Gaudet, Pascale ;
Kibbe, Warren ;
Basu, Siddhartha ;
Bushmanova, Yulia ;
Eilbeck, Karen ;
Siegele, Deborah A. ;
McIntosh, Brenley ;
Renfro, Daniel ;
Zweifel, Adrienne ;
Hu, James C. ;
Ashburner, Michael ;
Tweedie, Susan ;
Alam-Faruque, Yasmin ;
Apweiler, Rolf ;
Auchinchloss, Andrea ;
Bairoch, Amos ;
Barrell, Daniel ;
Binns, David ;
Blatter, Marie-Claude ;
Bougueleret, Lydie ;
Boutet, Emmanuel ;
Breuza, Lionel ;
Bridge, Alan ;
Browne, Paul ;
Chan, Wei Mun ;
Coudert, Elizabeth ;
Daugherty, Louise ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Estreicher, Anne ;
Famiglietti, Livia ;
Ferro-Rojas, Serenella ;
Feuermann, Marc ;
Foulger, Rebecca ;
Gruaz-Gumowski, Nadine .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D331-D335
[6]  
Chen B., 2012, PROC IEEE INT C BIOI, P1
[7]   Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae [J].
Collins, Sean R. ;
Kemmeren, Patrick ;
Zhao, Xue-Chu ;
Greenblatt, Jack F. ;
Spencer, Forrest ;
Holstege, Frank C. P. ;
Weissman, Jonathan S. ;
Krogan, Nevan J. .
MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (03) :439-450
[8]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[9]   A Max-Flow-Based Approach to the Identification of Protein Complexes Using Protein Interaction and Microarray Data [J].
Feng, Jianxing ;
Jiang, Rui ;
Jiang, Tao .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) :621-634
[10]   Proteome survey reveals modularity of the yeast cell machinery [J].
Gavin, AC ;
Aloy, P ;
Grandi, P ;
Krause, R ;
Boesche, M ;
Marzioch, M ;
Rau, C ;
Jensen, LJ ;
Bastuck, S ;
Dümpelfeld, B ;
Edelmann, A ;
Heurtier, MA ;
Hoffman, V ;
Hoefert, C ;
Klein, K ;
Hudak, M ;
Michon, AM ;
Schelder, M ;
Schirle, M ;
Remor, M ;
Rudi, T ;
Hooper, S ;
Bauer, A ;
Bouwmeester, T ;
Casari, G ;
Drewes, G ;
Neubauer, G ;
Rick, JM ;
Kuster, B ;
Bork, P ;
Russell, RB ;
Superti-Furga, G .
NATURE, 2006, 440 (7084) :631-636