Triage protein fold prediction

被引:9
作者
He, HX [1 ]
McAllister, G [1 ]
Smith, TF [1 ]
机构
[1] Boston Univ, Dept Biomed Engn, Biomol Engn Res Ctr, Boston, MA 02215 USA
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 2002年 / 48卷 / 04期
关键词
protein fold prediction; threading; HMM; DSM; triage;
D O I
10.1002/prot.10194
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have constructed, in a completely automated fashion, a new structure template library for threading that represents 358 distinct SCOP folds where each model is mathematically represented as a Hidden Markov model (HMM). Because the large number of models in the library can potentially dilute the prediction measure, a new triage method for fold prediction is employed. In the first step of the triage method, the most probable structural class is predicted using a set of manually constructed, high-level, generalized structural HMMs that represent seven general protein structural classes: all-a, all-P, alpha/beta, alpha+beta, irregular small metal-binding, transmembrane P-barrel, and transmembrane a-helical. In the second step, only those fold models belonging to the determined structural class are selected for the final fold prediction. This triage method gave more predictions as well as more correct predictions compared with a simple prediction method that lacks the initial classification step. Two different schemes of assigning Bayesian model priors are presented and discussed. Proteins 2002;48: 654-663. (C) 2002 Wiley-Liss, Inc. (C) 2002 Wiley-Liss, Inc.
引用
收藏
页码:654 / 663
页数:10
相关论文
共 30 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] BAKIS R, 1976, P ASA M WASH DC APR
  • [3] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [4] Automatic pattern embedding in protein structure models
    Bienkowska, J
    He, HX
    Smith, TF
    [J]. IEEE INTELLIGENT SYSTEMS, 2001, 16 (06) : 21 - 25
  • [5] Bienkowska JR, 2000, PROTEINS, V40, P451, DOI 10.1002/1097-0134(20000815)40:3<451::AID-PROT110>3.0.CO
  • [6] 2-J
  • [7] A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE
    BOWIE, JU
    LUTHY, R
    EISENBERG, D
    [J]. SCIENCE, 1991, 253 (5016) : 164 - 170
  • [8] IDENTIFICATION OF PROTEIN FOLDS - MATCHING HYDROPHOBICITY PATTERNS OF SEQUENCE SETS WITH SOLVENT ACCESSIBILITY PATTERNS OF KNOWN STRUCTURES
    BOWIE, JU
    CLARKE, ND
    PABO, CO
    SAUER, RT
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1990, 7 (03): : 257 - 264
  • [9] AN EMPIRICAL ENERGY FUNCTION FOR THREADING PROTEIN-SEQUENCE THROUGH THE FOLDING MOTIF
    BRYANT, SH
    LAWRENCE, CE
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1993, 16 (01) : 92 - 112
  • [10] Hidden Markov models
    Eddy, SR
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) : 361 - 365