Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences

被引:28
作者
Amos-Binks, Adam [1 ]
Patulea, Catalin [3 ]
Pitre, Sylvain [1 ]
Schoenrock, Andrew [1 ]
Gui, Yuan [2 ]
Green, James R. [3 ]
Golshani, Ashkan [2 ]
Dehne, Frank [1 ]
机构
[1] Carleton Univ, Sch Comp Sci, Ottawa, ON K1S 5B6, Canada
[2] Carleton Univ, Dept Biol, Ottawa, ON K1S 5B6, Canada
[3] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
DOMAIN INTERACTIONS; DATABASE; FAMILIES;
D O I
10.1186/1471-2105-12-225
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: While there are many methods for predicting protein-protein interaction, very few can determine the specific site of interaction on each protein. Characterization of the specific sequence regions mediating interaction (binding sites) is crucial for an understanding of cellular pathways. Experimental methods often report false binding sites due to experimental limitations, while computational methods tend to require data which is not available at the proteome-scale. Here we present PIPE-Sites, a novel method of protein specific binding site prediction based on pairs of re-occurring polypeptide sequences, which have been previously shown to accurately predict protein-protein interactions. PIPE-Sites operates at high specificity and requires only the sequences of query proteins and a database of known binary interactions with no binding site data, making it applicable to binding site prediction at the proteome-scale. Results: PIPE-Sites was evaluated using a dataset of 265 yeast and 423 human interacting proteins pairs with experimentally-determined binding sites. We found that PIPE-Sites predictions were closer to the confirmed binding site than those of two existing binding site prediction methods based on domain-domain interactions, when applied to the same dataset. Finally, we applied PIPE-Sites to two datasets of 2347 yeast and 14,438 human novel interacting protein pairs predicted to interact with high confidence. An analysis of the predicted interaction sites revealed a number of protein subsequences which are highly re-occurring in binding sites and which may represent novel binding motifs. Conclusions: PIPE-Sites is an accurate method for predicting protein binding sites and is applicable to the proteome-scale. Thus, PIPE-Sites could be useful for exhaustive analysis of protein binding patterns in whole proteomes as well as discovery of novel binding motifs. PIPE-Sites is available online at http://pipe-sites.cgmlab.org/.
引用
收藏
页数:13
相关论文
共 41 条
[1]   The BioGRID interaction database:: 2008 update [J].
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Reguly, Teresa ;
Boucher, Lorrie ;
Breitkreutz, Ashton ;
Livstone, Michael ;
Oughtred, Rose ;
Lackner, Daniel H. ;
Bahler, Jurg ;
Wood, Valerie ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D637-D640
[2]   DOMINO: a database of domain-peptide interactions [J].
Ceol, Arnaud ;
Chatr-aryamontri, Andrew ;
Santonico, Elena ;
Sacco, Roberto ;
Castagnoli, Luisa ;
Cesareni, Gianni .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D557-D560
[3]   Genetic and physical maps of Saccharomyces cerevisiae [J].
Cherry, JM ;
Ball, C ;
Weng, S ;
Juvik, G ;
Schmidt, R ;
Adler, C ;
Dunn, B ;
Dwight, S ;
Riles, L ;
Mortimer, RK ;
Botstein, D .
NATURE, 1997, 387 (6632) :67-73
[4]   The Pfam protein families database [J].
Finn, Robert D. ;
Mistry, Jaina ;
Tate, John ;
Coggill, Penny ;
Heger, Andreas ;
Pollington, Joanne E. ;
Gavin, O. Luke ;
Gunasekaran, Prasad ;
Ceric, Goran ;
Forslund, Kristoffer ;
Holm, Liisa ;
Sonnhammer, Erik L. L. ;
Eddy, Sean R. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D211-D222
[5]   A fast method to predict protein interaction sites from sequences [J].
Gallet, X ;
Charloteaux, B ;
Thomas, A ;
Brasseur, R .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 302 (04) :917-926
[6]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[7]   On the number of protein-protein interactions in the yeast proteome [J].
Grigoriev, A .
NUCLEIC ACIDS RESEARCH, 2003, 31 (14) :4157-4161
[8]   Predicting domain-domain interactions using a parsimony approach [J].
Guimaraes, Katia S. ;
Jothi, Raja ;
Zotenko, Elena ;
Przytycka, Teresa M. .
GENOME BIOLOGY, 2006, 7 (11)
[9]   Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein-protein interaction dataset [J].
Guo, Jie ;
Wu, Xiaomei ;
Zhang, Da-Yong ;
Lin, Kui .
NUCLEIC ACIDS RESEARCH, 2008, 36 (06) :2002-2011
[10]   Biochemical and structural characterization of Cren7, a novel chromatin protein conserved among Crenarchaea [J].
Guo, Li ;
Feng, Yingang ;
Zhang, Zhenfeng ;
Yao, Hongwei ;
Luo, Yuanming ;
Wang, Jinfeng ;
Huang, Li .
NUCLEIC ACIDS RESEARCH, 2008, 36 (04) :1129-1137