MUPRED: A tool for bridging the gap between template based methods and sequence profile based methods for protein secondary structure prediction

被引：29

作者：

Bondugula, Rajkumar ^{[1
]}

Xu, Dong ^{[1
]}

机构：

[1] Univ Missouri, Christopher S Bond Life Sci Ctr 271C, Digital Biol Lab, Dept Comp Sci, Columbia, MO 65211 USA

来源：

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS | 2007年 / 66卷 / 03期

关键词：

protein secondary structure prediction; fuzzy nearest neighbor; neural network; hybrid prediction system; sequence profile; template; prediction accuracy assessment;

D O I：

10.1002/prot.21177

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Predicting secondary structures from a protein sequence is an important step for characterizing the structural properties of a protein. Existing methods for protein secondary structure prediction can be broadly classified into template based or sequence profile based methods. We propose a novel framework that bridges the gap between the two fundamentally different approaches. Our framework integrates the information from the fuzzy k-nearest neighbor algorithm and position-specific scoring matrices using a neural network. It combines the strengths of the two methods and has a better potential to use the information in both the sequence and structure databases than existing methods. We implemented the framework into a software system MUPRED. MUPRED has achieved three-state prediction accuracy (Q(3)) ranging from 79.2 to 80.14%, depending on which benchmark dataset is used. A higher Q(3) can be achieved if a query protein has a significant sequence identity (> 25%) to a template in PDB. MUPRED also estimates the prediction accuracy at the individual residue level more quantitatively than existing methods. The MUPRED web server and executables are freely available at http://digbio.missouri.edu/mupred. Proteins 2007; 66:664-670. (c) 2006 Wiley-Liss, Inc.

引用

页码：664 / 670

页数：7

共 23 条

[1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].