Factors limiting the performance of prediction-based fold recognition methods

被引:0
作者
de la Cruz, X [1 ]
Thornton, JM [1 ]
机构
[1] Univ London Univ Coll, Dept Biochem & Mol Biol, London WC1E 6BT, England
关键词
fold recognition; protein function identification; protein structure prediction; remote homologues; secondary structure and accessibility predictions; sequence annotation; threading;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the past few years, a new generation of fold recognition methods has been developed, in which the classical sequence information is combined with information obtained from secondary structure and, sometimes, accessibility predictions. The results are promising, indicating that this approach may compete with potential-based methods (Rost B et al., 1997, J Mol Biol 270:471-480). Here we present a systematic study of the different factors contributing to the performance of these methods, in particular when applied to the problem of fold recognition of remote homologues. Our results indicate that secondary structure and accessibility prediction methods have reached an accuracy level where they are not the major factor limiting the accuracy of fold recognition. The pattern degeneracy problem is confirmed as the major source of error of these methods. On the basis of these results, we study three different options to overcome these limitations: normalization schemes, mapping of the coil state into the different zones of the Ramachandran plot, and post-threading graphical analysis.
引用
收藏
页码:750 / 759
页数:10
相关论文
共 41 条
[1]   Seeking an ancient enzyme in Methanococcus jannaschii using ORF, a program based on predicted secondary structure comparisons [J].
Aurora, R ;
Rose, GD .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (06) :2818-2823
[2]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[3]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[4]   STATISTICS OF SEQUENCE-STRUCTURE THREADING [J].
BRYANT, SH ;
ALTSCHUL, SF .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (02) :236-244
[5]  
Bryant SH, 1996, PROTEINS, V26, P172
[6]   AN EMPIRICAL ENERGY FUNCTION FOR THREADING PROTEIN-SEQUENCE THROUGH THE FOLDING MOTIF [J].
BRYANT, SH ;
LAWRENCE, CE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1993, 16 (01) :92-112
[7]  
Dayhoff M.O., 1978, ATLAS PROTEIN SEQ ST, V5
[8]  
Feng ZK, 1996, FOLD DES, V1, P123
[9]   ALIGNMENT OF PROTEIN SEQUENCES USING SECONDARY STRUCTURE - A MODIFIED DYNAMIC-PROGRAMMING METHOD [J].
FISCHELGHODSIAN, F ;
MATHIOWITZ, G ;
SMITH, TF .
PROTEIN ENGINEERING, 1990, 3 (07) :577-581
[10]  
Fischer D, 1996, PROTEIN SCI, V5, P947