STATISTICS OF SEQUENCE-STRUCTURE THREADING

被引:108
作者
BRYANT, SH
ALTSCHUL, SF
机构
[1] Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
关键词
D O I
10.1016/0959-440X(95)80082-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The past two years have seen the rapid development of new recognition methods for protein structure prediction. These algorithms 'thread' the sequence of one protein through the known structure of another, looking for an alignment that corresponds to an energetically favorable model structure. Because they are based on energy calculation, rather than evolutionary distance, these methods extend the possibility of structure prediction by comparative modeling to a larger class of new sequences, where similarity to known structures is recognizable by no other means. The strength of the evidence they offer should be judged by objective statistical tests, however, so as to rule out the possibility that favorable scores arise from chance factors such as similarity of length, composition, or the consideration of a large number of alternative alignments. Calculation of objective p-values by analytical means is not yet possible, but it would appear that approximate values may be obtained by simulation, as they are in gapped, global sequence alignment. We propose that the results of threading experiments should include Z-scores relative to the composition-corrected score distribution obtained for shuffled and optimally aligned sequences.
引用
收藏
页码:236 / 244
页数:9
相关论文
共 73 条
[1]   RECOGNITION OF DISTANTLY RELATED PROTEINS THROUGH ENERGY CALCULATIONS [J].
ABAGYAN, R ;
FRISHMAN, D ;
ARGOS, P .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 19 (02) :132-140
[2]  
ABOLA EE, 1987, CRYSTALLOGRAPHIC DAT, P107
[3]   A NONLINEAR MEASURE OF SUBALIGNMENT SIMILARITY AND ITS SIGNIFICANCE LEVELS [J].
ALTSCHUL, SF ;
ERICKSON, BW .
BULLETIN OF MATHEMATICAL BIOLOGY, 1986, 48 (5-6) :617-632
[4]   ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES [J].
ALTSCHUL, SF ;
BOGUSKI, MS ;
GISH, W ;
WOOTTON, JC .
NATURE GENETICS, 1994, 6 (02) :119-129
[5]  
ALTSCHUL SF, 1985, MOL BIOL EVOL, V2, P526
[6]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[7]  
Argos Patrick, 1994, Current Opinion in Biotechnology, V5, P361, DOI 10.1016/0958-1669(94)90044-2
[8]   THE ERDOS-RENYI STRONG LAW FOR PATTERN-MATCHING WITH A GIVEN PROPORTION OF MISMATCHES [J].
ARRATIA, R ;
WATERMAN, MS .
ANNALS OF PROBABILITY, 1989, 17 (03) :1152-1169
[9]   AN EXTREME VALUE THEORY FOR SEQUENCE MATCHING [J].
ARRATIA, R ;
GORDON, L ;
WATERMAN, M .
ANNALS OF STATISTICS, 1986, 14 (03) :971-993
[10]   `A PHASE TRANSITION FOR THE SCORE IN MATCHING RANDOM SEQUENCES ALLOWING DELETIONS [J].
Arratia, Richard ;
Waterman, Michael S. .
ANNALS OF APPLIED PROBABILITY, 1994, 4 (01) :200-225