STATISTICS OF SEQUENCE-STRUCTURE THREADING

被引:108
作者
BRYANT, SH
ALTSCHUL, SF
机构
[1] Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
关键词
D O I
10.1016/0959-440X(95)80082-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The past two years have seen the rapid development of new recognition methods for protein structure prediction. These algorithms 'thread' the sequence of one protein through the known structure of another, looking for an alignment that corresponds to an energetically favorable model structure. Because they are based on energy calculation, rather than evolutionary distance, these methods extend the possibility of structure prediction by comparative modeling to a larger class of new sequences, where similarity to known structures is recognizable by no other means. The strength of the evidence they offer should be judged by objective statistical tests, however, so as to rule out the possibility that favorable scores arise from chance factors such as similarity of length, composition, or the consideration of a large number of alternative alignments. Calculation of objective p-values by analytical means is not yet possible, but it would appear that approximate values may be obtained by simulation, as they are in gapped, global sequence alignment. We propose that the results of threading experiments should include Z-scores relative to the composition-corrected score distribution obtained for shuffled and optimally aligned sequences.
引用
收藏
页码:236 / 244
页数:9
相关论文
共 73 条
[11]   EMPIRICAL AND STRUCTURAL MODELS FOR INSERTIONS AND DELETIONS IN THE DIVERGENT EVOLUTION OF PROTEINS [J].
BENNER, SA ;
COHEN, MA ;
GONNET, GH .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 229 (04) :1065-1082
[12]   CATCHING A COMMON FOLD [J].
BLUNDELL, TL ;
JOHNSON, MS .
PROTEIN SCIENCE, 1993, 2 (06) :877-883
[13]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[14]   IDENTIFICATION OF PROTEIN FOLDS - MATCHING HYDROPHOBICITY PATTERNS OF SEQUENCE SETS WITH SOLVENT ACCESSIBILITY PATTERNS OF KNOWN STRUCTURES [J].
BOWIE, JU ;
CLARKE, ND ;
PABO, CO ;
SAUER, RT .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1990, 7 (03) :257-264
[15]   INVERTED PROTEIN-STRUCTURE PREDICTION [J].
BOWIE, JU ;
EISENBERG, D .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1993, 3 (03) :437-444
[16]   THE FREQUENCY OF ION-PAIR SUBSTRUCTURES IN PROTEINS IS QUANTITATIVELY RELATED TO ELECTROSTATIC POTENTIAL - A STATISTICAL-MODEL FOR NONBONDED INTERACTIONS [J].
BRYANT, SH ;
LAWRENCE, CE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1991, 9 (02) :108-119
[17]  
BRYANT SH, 1987, INT J PEPT PROT RES, V29, P46
[18]   AN EMPIRICAL ENERGY FUNCTION FOR THREADING PROTEIN-SEQUENCE THROUGH THE FOLDING MOTIF [J].
BRYANT, SH ;
LAWRENCE, CE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1993, 16 (01) :92-112
[19]  
CASTONGUAY LA, 1995, PROTEIN SCI, V4, P472
[20]   THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS [J].
CHOTHIA, C ;
LESK, AM .
EMBO JOURNAL, 1986, 5 (04) :823-826