HotSprint: database of computational hot spots in protein interfaces

被引:91
作者
Guney, Emre [1 ,2 ]
Tuncbag, Nurcan [1 ,2 ]
Keskin, Ozlem [1 ,2 ]
Gursoy, Attila [1 ,2 ]
机构
[1] Koc Univ, Ctr Comp Biol & Bioinformat, TR-34450 Istanbul, Turkey
[2] Koc Univ, Coll Engn, TR-34450 Istanbul, Turkey
关键词
D O I
10.1093/nar/gkm813
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a new database of computational hot spots in protein interfaces: HotSprint. Hot spots are residues comprising only a small fraction of interfaces yet accounting for the majority of the binding energy. HotSprint contains data for 35 776 protein interfaces among 49 512 protein interfaces extracted from the multi-chain structures in Protein Data Bank (PDB) as of February 2006. The conserved residues in interfaces with certain buried accessible solvent area (ASA) and complex ASA thresholds are flagged as computational hot spots. The predicted hot spots are observed to correlate with the experimental hot spots with an accuracy of 76. Several machine-learning methods (SVM, Decision Trees and Decision Lists) are also applied to predict hot spots, results reveal that our empirical approach performs better than the others. A web interface for the HotSprint database allows users to browse and query the hot spots in protein interfaces. HotSprint is available at http://prism.ccbb.ku.edu.tr/hotsprint; and it provides information for interface residues that are functionally and structurally important as well as the evolutionary history and solvent accessibility of residues in interfaces.
引用
收藏
页码:D662 / D666
页数:5
相关论文
共 30 条
[21]   Protein-protein interactions: Structurally conserved residues distinguish between binding sites and exposed protein surfaces [J].
Ma, BY ;
Elkayam, T ;
Wolfson, H ;
Nussinov, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (10) :5772-5777
[22]   Protein-protein interaction hotspots carved into sequences [J].
Ofran, Yanay ;
Rost, Burkhard .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (07) :1169-1176
[23]   Prediction of functional sites by analysis of sequence and structure conservation [J].
Panchenko, AR ;
Kondrashov, F ;
Bryant, S .
PROTEIN SCIENCE, 2004, 13 (04) :884-892
[24]   A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: Application to the evolution of five gene families [J].
Pupko, T ;
Pe'er, I ;
Hasegawa, M ;
Graur, D ;
Friedman, N .
BIOINFORMATICS, 2002, 18 (08) :1116-1123
[25]   Anchor residues in protein-protein interactions [J].
Rajamani, D ;
Thiel, S ;
Vajda, S ;
Camacho, CJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (31) :11287-11292
[26]   DATABASE OF HOMOLOGY-DERIVED PROTEIN STRUCTURES AND THE STRUCTURAL MEANING OF SEQUENCE ALIGNMENT [J].
SANDER, C ;
SCHNEIDER, R .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1991, 9 (01) :56-68
[27]   RASMOL - BIOMOLECULAR GRAPHICS FOR ALL [J].
SAYLE, RA ;
MILNERWHITE, EJ .
TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (09) :374-376
[28]   ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions [J].
Thorn, KS ;
Bogan, AA .
BIOINFORMATICS, 2001, 17 (03) :284-285
[29]  
WELLS JA, 1991, METHOD ENZYMOL, V202, P390
[30]   Multiple modes of peptide recognition by the PTB domain of the cell fate determinant Numb [J].
Zwahlen, C ;
Li, SC ;
Kay, LE ;
Pawson, T ;
Forman-Kay, JD .
EMBO JOURNAL, 2000, 19 (07) :1505-1515