Prescont: Predicting protein-protein interfaces utilizing four residue properties

被引:34
作者
Zellner, Hermann [1 ]
Staudigel, Martin [2 ]
Trenner, Thomas [2 ]
Bittkowski, Meik [2 ]
Wolowski, Vincent [2 ]
Icking, Christian [2 ]
Merkl, Rainer [1 ]
机构
[1] Univ Regensburg, Inst Biophys & Phys Biochem, D-93040 Regensburg, Germany
[2] Univ Hagen, Fac Math & Comp Sci, D-58084 Hagen, Germany
关键词
support vector machine; machine learning; protein complexes; residue classification; INTERACTION-SITE PREDICTION; BINDING HOT-SPOTS; HYDROPHOBIC PATCHES; SECONDARY STRUCTURE; CONSERVATION; SEQUENCE; ACCURACY; IDENTIFICATION; RECOGNITION; FREQUENCIES;
D O I
10.1002/prot.23172
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An important task of computational biology is to identify those parts of a polypeptide chain, which are involved in interactions with other proteins. For this purpose, we have developed the program PresCont, which predicts in a robust manner amino acids that constitute protein-protein interfaces (PPIs). PresCont reaches state-of-the-art classification quality on the basis of only four residue properties that can be readily deduced from the 3D structure of an individual protein and a multiple sequence alignment (MSA) composed of homologs. The core of PresCont is a support vector machine, which assesses solvent-accessible surface area, hydrophobicity, conservation, and the local environment of each amino acid on the protein surface. For training and performance testing, we compiled three nonoverlapping datasets consisting of permanently formed or transient complexes, respectively. A comparison with SPPIDER, ProMate, and meta-PPISP showed that PresCont compares favorably with these highly sophisticated programs, and that its prediction quality is less dependent on the type of protein complex being considered. This balance is due to a mutual compensation of classification weaknesses observed for individual properties: For PPIs of permanent complexes, solvent-accessible surface and hydrophobicity contribute most to classification quality, for PPIs of transient complexes, the assessment of the local environment is most significant. Moreover, we show that for permanent complexes a segmentation of PPIs into core and rim residues has only a moderate influence on prediction quality. PresCont is available as a web service at . Proteins 2012; (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:154 / 168
页数:15
相关论文
共 89 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Data growth and its impact on the SCOP database: new developments
    Andreeva, Antonina
    Howorth, Dave
    Chandonia, John-Marc
    Brenner, Steven E.
    Hubbard, Tim J. P.
    Chothia, Cyrus
    Murzin, Alexey G.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D419 - D425
  • [3] The Universal Protein Resource (UniProt) in 2010
    Apweiler, Rolf
    Martin, Maria Jesus
    O'Donovan, Claire
    Magrane, Michele
    Alam-Faruque, Yasmin
    Antunes, Ricardo
    Barrell, Daniel
    Bely, Benoit
    Bingley, Mark
    Binns, David
    Bower, Lawrence
    Browne, Paul
    Chan, Wei Mun
    Dimmer, Emily
    Eberhardt, Ruth
    Fedotov, Alexander
    Foulger, Rebecca
    Garavelli, John
    Huntley, Rachael
    Jacobsen, Julius
    Kleen, Michael
    Laiho, Kati
    Leinonen, Rasko
    Legge, Duncan
    Lin, Quan
    Liu, Wudong
    Luo, Jie
    Orchard, Sandra
    Patient, Samuel
    Poggioli, Diego
    Pruess, Manuela
    Corbett, Matt
    di Martino, Giuseppe
    Donnelly, Mike
    van Rensburg, Pieter
    Bairoch, Amos
    Bougueleret, Lydie
    Xenarios, Ioannis
    Altairac, Severine
    Auchincloss, Andrea
    Argoud-Puy, Ghislaine
    Axelsen, Kristian
    Baratin, Delphine
    Blatter, Marie-Claude
    Boeckmann, Brigitte
    Bolleman, Jerven
    Bollondi, Laurent
    Boutet, Emmanuel
    Quintaje, Silvia Braconi
    Breuza, Lionel
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D142 - D148
  • [4] Structural analysis of a set of proteins resulting from a bacterial genomics project
    Badger, J
    Sauder, JM
    Adams, JM
    Antonysamy, S
    Bain, K
    Bergseid, MG
    Buchanan, SG
    Buchanan, MD
    Batiyenko, Y
    Christopher, JA
    Emtage, S
    Eroshkina, A
    Feil, I
    Furlong, EB
    Gajiwala, KS
    Gao, X
    He, D
    Hendle, J
    Huber, A
    Hoda, K
    Kearins, P
    Kissinger, C
    Laubert, B
    Lewis, HA
    Lin, J
    Loomis, K
    Lorimer, D
    Louie, G
    Maletic, M
    Marsh, CD
    Miller, I
    Molinari, J
    Muller-Dieckmann, HJ
    Newman, JM
    Noland, BW
    Pagarigan, B
    Park, F
    Peat, TS
    Post, KW
    Radojicic, S
    Ramos, A
    Romero, R
    Rutter, ME
    Sanderson, WE
    Schwinn, KD
    Tresser, J
    Winhoven, J
    Wright, TA
    Wu, L
    Xu, J
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (04) : 787 - 796
  • [5] Dissecting subunit interfaces in homodimeric proteins
    Bahadur, RP
    Chakrabarti, P
    Rodier, F
    Janin, J
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 (03) : 708 - 719
  • [6] A dissection of specific and non-specific protein - Protein interfaces
    Bahadur, RP
    Chakrabarti, P
    Rodier, F
    Janin, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 336 (04) : 943 - 955
  • [7] Anatomy of hot spots in protein interfaces
    Bogan, AA
    Thorn, KS
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1998, 280 (01) : 1 - 9
  • [8] Shelling the Voronoi interface of protein-protein complexes reveals patterns of residue conservation, dynamics, and composition
    Bouvier, Benjamin
    Gruenberg, Raik
    Nilges, Michael
    Cazals, Frederic
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 76 (03) : 677 - 692
  • [9] Insights into protein-protein interfaces using a Bayesian network prediction method
    Bradford, James R.
    Needham, Chris J.
    Bulpitt, Andrew J.
    Westhead, David R.
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2006, 362 (02) : 365 - 386
  • [10] Improved prediction of protein-protein binding sites using a support vector machines approach
    Bradford, JR
    Westhead, DR
    [J]. BIOINFORMATICS, 2005, 21 (08) : 1487 - 1494