MULTICOM: a multi-level combination approach to protein structure prediction and its assessments in CASP8

被引:77
作者
Wang, Zheng [1 ]
Eickholt, Jesse [1 ]
Cheng, Jianlin [1 ,2 ,3 ]
机构
[1] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
[2] Univ Missouri, Inst Informat, Columbia, MO 65211 USA
[3] Univ Missouri, C Bond Life Sci Ctr, Columbia, MO 65211 USA
关键词
MULTIPLE SEQUENCE ALIGNMENT; HIGH-ACCURACY; PROGRESS; MODELS;
D O I
10.1093/bioinformatics/btq058
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein structure prediction is one of the most important problems in structural bioinformatics. Here we describe MULTICOM, a multi-level combination approach to improve the various steps in protein structure prediction. In contrast to those methods which look for the best templates, alignments and models, our approach tries to combine complementary and alternative templates, alignments and models to achieve on average better accuracy. Results: The multi-level combination approach was implemented via five automated protein structure prediction servers and one human predictor which participated in the eighth Critical Assessment of Techniques for Protein Structure Prediction (CASP8), 2008. The MULTICOM servers and human predictor were consistently ranked among the top predictors on the CASP8 benchmark. The methods can predict moderate-to high-resolution models for most template-based targets and low-resolution models for some template-free targets. The results show that the multi-level combination of complementary templates, alternative alignments and similar models aided by model quality assessment can systematically improve both template-based and template-free protein modeling.
引用
收藏
页码:882 / 888
页数:7
相关论文
共 48 条
[1]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[2]   Assessment of CASP8 structure predictions for template free targets [J].
Ben-David, Moshe ;
Noivirt-Brik, Orly ;
Paz, Aviv ;
Prilusky, Jaime ;
Sussman, Joel L. ;
Levy, Yaakov .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :50-65
[3]   SCRATCH: a protein structure and structural feature prediction server [J].
Cheng, J ;
Randall, AZ ;
Sweredoski, MJ ;
Baldi, P .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W72-W76
[4]   A machine learning information retrieval approach to protein fold recognition [J].
Cheng, Jianlin ;
Baldi, Pierre .
BIOINFORMATICS, 2006, 22 (12) :1456-1463
[5]   Prediction of global and local quality of CASP8 models by MULTICOM series [J].
Cheng, Jianlin ;
Wang, Zheng ;
Tegge, Allison N. ;
Eickholt, Jesse .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :181-184
[6]   A multi-template combination algorithm for protein comparative modeling [J].
Cheng, Jianlin .
BMC STRUCTURAL BIOLOGY, 2008, 8
[7]   Evaluation of template-based models in CASP8 with standard measures [J].
Cozzetto, Domenico ;
Kryshtafovych, Andriy ;
Fidelis, Krzysztof ;
Moult, John ;
Rost, Burkhard ;
Tramontano, Anna .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :18-28
[8]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[9]   SATCHMO:: sequence alignment and tree construction using hidden Markov models [J].
Edgar, RC ;
Sjölander, K .
BIOINFORMATICS, 2003, 19 (11) :1404-1411
[10]   MODELLER: Generation and refinement of homology-based protein structure models [J].
Fiser, A ;
Sali, A .
MACROMOLECULAR CRYSTALLOGRAPHY, PT D, 2003, 374 :461-491