Using a library of structural templates to recognise catalytic sites and explore their evolution in homologous families

被引:100
作者
Torrance, JW [1 ]
Bartlett, GJ [1 ]
Porter, CT [1 ]
Thornton, JM [1 ]
机构
[1] EMBL, European Bioinformat Inst, Cambridge CB10 1SDWEL, England
基金
英国生物技术与生命科学研究理事会;
关键词
function prediction; structural template; catalytic site atlas; active site; catalytic residue;
D O I
10.1016/j.jmb.2005.01.044
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Catalytic site structure is normally highly conserved between distantly related enzymes. As a consequence, templates representing catalytic sites have the potential to succeed at function prediction in cases where methods based on sequence or overall structure fail. There are many methods for searching protein structures for matches to structural templates, but few validated template libraries to use with these methods. We present a library of structural templates representing catalytic sites, based on information from the scientific literature. Furthermore, we analyse homologous template families to discover the diversity within families and the utility of templates for active site recognition. Templates representing the catalytic sites of homologous proteins mostly differ by less than I A root mean square deviation, even when the sequence similarity between the two proteins is low. Within these sets of homologues there is usually no discernible relationship between catalytic site structure similarity and sequence similarity. Because of this structural conservation of catalytic sites, the templates can discriminate between matches to related proteins and random matches with over 85% sensitivity and predictive accuracy. Templates based on protein backbone positions are more discriminating than those based on side-chain atoms. These analyses show encouraging prospects for prediction of functional sites in structural genomics structures of unknown function, and will be of use in analyses of convergent evolution and exploring relationships between active site geometry and chemistry. The template library can be queried via a web server at www.ebi.ac.uk/thornton-srv/databases/CSS and is available for download. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:565 / 581
页数:17
相关论文
共 56 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], 1999, Mathematical Methods of Statistics
[3]   Large-scale assessment of the utility of low-resolution protein structures for biochemical function assignment [J].
Arakaki, AK ;
Zhang, Y ;
Skolnick, J .
BIOINFORMATICS, 2004, 20 (07) :1087-1096
[4]   A GRAPH-THEORETIC APPROACH TO THE IDENTIFICATION OF 3-DIMENSIONAL PATTERNS OF AMINO-ACID SIDE-CHAINS IN PROTEIN STRUCTURES [J].
ARTYMIUK, PJ ;
POIRRETTE, AR ;
GRINDLEY, HM ;
RICE, DW ;
WILLETT, P .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (02) :327-344
[5]   The enolase superfamily: A general strategy for enzyme-catalyzed abstraction of the alpha-protons of carboxylic acids [J].
Babbitt, PC ;
Hasson, MS ;
Wedekind, JE ;
Palmer, DRJ ;
Barrett, WC ;
Reed, GH ;
Rayment, I ;
Ringe, D ;
Kenyon, GL ;
Gerlt, JA .
BIOCHEMISTRY, 1996, 35 (51) :16489-16501
[6]   An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis [J].
Barker, JA ;
Thornton, JM .
BIOINFORMATICS, 2003, 19 (13) :1644-1649
[7]   CRYSTAL-STRUCTURES AT 2.5 ANGSTROM RESOLUTION OF SERYL-TRANSFER-RNA SYNTHETASE COMPLEXED 2 ANALOGS OF SERYL ADENYLATE [J].
BELRHALI, H ;
YAREMCHUK, A ;
TUKALO, M ;
LARSEN, K ;
BERTHETCOLOMINAS, C ;
LEBERMAN, R ;
BEIJER, B ;
SPROAT, B ;
ALSNIELSEN, J ;
GRUBEL, G ;
LEGRAND, JF ;
LEHMANN, M ;
CUSACK, S .
SCIENCE, 1994, 263 (5152) :1432-1436
[8]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[9]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[10]   Towards a structural classification of phosphate binding sites in protein-nucleotide complexes: An automated all-against-all structural comparison using geometric matching [J].
Brakoulias, A ;
Jackson, RM .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (02) :250-260