TASSER_low-zsc: An approach to improve structure prediction using low z-score-ranked templates

被引:3
作者
Pandit, Shashi B. [1 ]
Skolnick, Jeffrey [1 ]
机构
[1] Georgia Inst Technol, Sch Biol, Ctr Study Syst Biol, Atlanta, GA 30318 USA
基金
美国国家卫生研究院;
关键词
structure prediction; threading; TASSER; tertiary structure; PROTEIN-STRUCTURE PREDICTION; MODEL QUALITY ASSESSMENT; FOLD-RECOGNITION; HOMOLOGY DETECTION; SEQUENCE PROFILES; INFORMATION; ALIGNMENT; ALGORITHM; GENOMICS; SERVER;
D O I
10.1002/prot.22791
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In a variety of threading methods, often poorly ranked (low z-score) templates have good alignments. Here, a new method, TASSER_low-zsc that identifies these low z-score ranked templates to improve protein structure prediction accuracy, is described. The approach consists of clustering of threading templates by affinity propagation on the basis of structural similarity (thread_cluster) followed by TASSER modeling, with final models selected by using a TASSER_QA variant. To establish the generality of the approach, templates provided by two threading methods, SP3 and SPARKS(2), are examined. The SP3 and SPARKS(2) benchmark datasets consist of 351 and 357 medium/hard proteins (those with moderate to poor quality templates and/or alignments) of length <= 250 residues, respectively. For SP3 medium and hard targets, using thread cluster, the TM-scores of the best template improve by similar to 4 and 9% over the original set (without low z-score templates) respectively; after TASSER modeling/refinement and ranking, the best model improves by similar to 7 and 9% over the best model generated with the original template set. Moreover, TASSER_low-zsc generates 22% (43%) more foldable medium (hard) targets. Similar improvements are observed with low-ranked templates from SPARKS(2). The template clustering approach could be applied to other modeling methods that utilize multiple templates to improve structure prediction.
引用
收藏
页码:2769 / 2780
页数:12
相关论文
共 57 条
[1]   Iterated profile searches with PSI-BLAST - a tool for discovery in protein databases [J].
Altschul, SF ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (11) :444-447
[2]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[5]   A machine learning information retrieval approach to protein fold recognition [J].
Cheng, Jianlin ;
Baldi, Pierre .
BIOINFORMATICS, 2006, 22 (12) :1456-1463
[6]   A multi-template combination algorithm for protein comparative modeling [J].
Cheng, Jianlin .
BMC STRUCTURAL BIOLOGY, 2008, 8
[7]   Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection [J].
Chivian, Dylan ;
Baker, David .
NUCLEIC ACIDS RESEARCH, 2006, 34 (17)
[8]   Identifying native-like protein structures using physics-based potentials [J].
Dominy, BN ;
Brooks, CL .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2002, 23 (01) :147-160
[9]   3D-SHOTGUN: A novel, cooperative, fold-recognition meta-predictor [J].
Fischer, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 51 (03) :434-441
[10]  
Fischer D, 2000, Pac Symp Biocomput, P119