FREAD revisited: Accurate loop structure prediction using a database search algorithm

被引:102
作者
Choi, Yoonjoo [1 ]
Deane, Charlotte M. [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
关键词
loop prediction; loop modeling; database search; ab initio; CASP; homology modeling; template based modeling; PROTEIN-STRUCTURE PREDICTION; AB-INITIO CONSTRUCTION; POLYPEPTIDE FRAGMENTS; REPRESENTATION; SEGMENTS; REGIONS; TURNS; MODEL; CHAIN;
D O I
10.1002/prot.22658
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Loops are the most variable regions of protein structure and are, in general, the least accurately predicted. Their prediction has been approached in two ways, ab initio and database search. In recent years, it has been thought that ab initio methods are more powerful. In light of the continued rapid expansion in the number of known protein structures, we have re-evaluated FREAD, a database search method and demonstrate that the power of database search methods may have been underestimated. We found that sequence similarity as quantified by environment specific substitution scores can be used to significantly improve prediction. In fact, FREAD performs appreciably better for an identifiable subset of loops (two thirds of shorter loops and half of the longer loops tested) than the ab initio methods of MODELLER, PLOP, and RAPPER. Within this subset, FREAD's predictive ability is length independent, in general, producing results within 2 angstrom RMSD, compared to an average of over 10 angstrom for loop length 20 for any of the other tested methods. We also benchmarked the prediction protocols on a set of 212 loops from the model structures in CASP 7 and 8. An extended version of FREAD is able to make predictions for 127 of these, it gives the best prediction of the methods tested in 61 of these cases. In examining FREAD's ability to predict in the model environment, we found that whole structure quality did not affect the quality of loop predictions.
引用
收藏
页码:1431 / 1440
页数:10
相关论文
共 50 条
[1]   ICM - A NEW METHOD FOR PROTEIN MODELING AND DESIGN - APPLICATIONS TO DOCKING AND STRUCTURE PREDICTION FROM THE DISTORTED NATIVE CONFORMATION [J].
ABAGYAN, R ;
TOTROV, M ;
KUZNETSOV, D .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1994, 15 (05) :488-506
[2]  
Alberts B., 1994, MOL BIOL CELL
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   18TH KREBS,HANS LECTURE - KNOWLEDGE-BASED PROTEIN MODELING AND DESIGN [J].
BLUNDELL, T ;
CARNEY, D ;
GARDNER, S ;
HAYES, F ;
HOWLIN, B ;
HUBBARD, T ;
OVERINGTON, J ;
SINGH, DA ;
SIBANDA, BL ;
SUTCLIFFE, M .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1988, 172 (03) :513-520
[5]   CONFORMATIONAL SAMPLING USING HIGH-TEMPERATURE MOLECULAR-DYNAMICS [J].
BRUCCOLERI, RE ;
KARPLUS, M .
BIOPOLYMERS, 1990, 29 (14) :1847-1862
[6]   PREDICTION OF SECONDARY STRUCTURE BY EVOLUTIONARY COMPARISON - APPLICATION TO THE ALPHA-SUBUNIT OF TRYPTOPHAN SYNTHASE [J].
CRAWFORD, IP ;
NIERMANN, T ;
KIRSCHNER, K .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1987, 2 (02) :118-129
[7]   Ab initio construction of polypeptide fragments: Accuracy of loop decoy discrimination by an all-atom statistical potential and the AMBER force field with the generalized born solvation model [J].
de Bakker, PIW ;
DePristo, MA ;
Burke, DF ;
Blundell, TL .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 51 (01) :21-40
[8]   CODA: A combined algorithm for predicting the structurally variable regions of protein models [J].
Deane, CM ;
Blundell, TL .
PROTEIN SCIENCE, 2001, 10 (03) :599-612
[9]  
Deane CM, 2000, PROTEINS, V40, P135, DOI 10.1002/(SICI)1097-0134(20000701)40:1<135::AID-PROT150>3.3.CO
[10]  
2-T