What is the probability of a chance prediction of a protein structure with an rmsd of 6 Å?

被引:195
作者
Reva, BA
Finkelstein, AV
Skolnick, J
机构
[1] Scripps Res Inst, Dept Mol Biol, La Jolla, CA 92037 USA
[2] Russian Acad Sci, Inst Math Problems Biol, Pushchino 142292, Moscow Region, Russia
[3] Russian Acad Sci, Inst Prot Res, Pushchino 142292, Moscow Region, Russia
来源
FOLDING & DESIGN | 1998年 / 3卷 / 02期
基金
美国国家卫生研究院;
关键词
rmsd distribution; threading; protein-like structure; protein structure prediction;
D O I
10.1016/S1359-0278(98)00019-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: The root mean square deviation (rmsd) between corresponding atoms of two protein chains is a commonly used measure of similarity between two protein structures, The smaller the rmsd is between two structures, the more similar are these two structures. In protein structure prediction, one needs the rmsd between predicted and experimental structures for which a prediction can be considered to be successful. Success is obvious only when the rmsd is as small as that for closely homologous proteins (< 3 Angstrom). To estimate the quality of the prediction in the more general case, one has to compare the native structure not only with the predicted one but also with randomly chosen protein-like folds, One can ask: how many such structures must be considered to find a structure with a given rmsd from the native structure? Results: We calculated the rmsd values between native structures of 142 proteins and all compact structures obtained in the threading of these protein chains over 364 non-homologous structures, The rmsd distributions have a Gaussian form, with the average rmsd approximately proportional to the radius of gyration. Conclusions: We estimated the number of protein-like structures required to obtain a structure within an rmsd of 6 Angstrom to be 10(4)-10(5) for chains of 60-80 residues and 10(11)-10(12) structures for chains of 160-200 residues. The probability of obtaining a 6 Angstrom rmsd by chance is so remote that when such structures are obtained from a prediction algorithm, it should be considered quite successful.
引用
收藏
页码:141 / 147
页数:7
相关论文
共 19 条
  • [1] THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS
    CHOTHIA, C
    LESK, AM
    [J]. EMBO JOURNAL, 1986, 5 (04) : 823 - 826
  • [2] ON THE PREDICTION OF PROTEIN-STRUCTURE - THE SIGNIFICANCE OF THE ROOT-MEAN-SQUARE DEVIATION
    COHEN, FE
    STERNBERG, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1980, 138 (02) : 321 - 333
  • [3] Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions
    Dandekar, T
    Argos, P
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1996, 256 (03) : 645 - 660
  • [4] FOLDING THE MAIN-CHAIN OF SMALL PROTEINS WITH THE GENETIC ALGORITHM
    DANDEKAR, T
    ARGOS, P
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1994, 236 (03) : 844 - 861
  • [5] Meeting review: The Second Meeting on the Critical Assessment of Techniques for Protein Structure Prediction (CASP2), Asilomar, California, December 13-16, 1996
    Dunbrack, RL
    Gerloff, DL
    Bower, M
    Chen, XW
    Lichtarge, O
    Cohen, FE
    [J]. FOLDING & DESIGN, 1997, 2 (02): : R27 - R42
  • [6] FINKELSTEIN AV, 1987, PROGRAM FITT RMSD CA
  • [7] IDENTIFICATION OF NATIVE PROTEIN FOLDS AMONGST A LARGE NUMBER OF INCORRECT MODELS - THE CALCULATION OF LOW-ENERGY CONFORMATIONS FROM POTENTIALS OF MEAN FORCE
    HENDLICH, M
    LACKNER, P
    WEITCKUS, S
    FLOECKNER, H
    FROSCHAUER, R
    GOTTSBACHER, K
    CASARI, G
    SIPPL, MJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (01) : 167 - 180
  • [8] HOBOHM U, 1992, PROTEIN SCI, V1, P409
  • [9] DISCUSSION OF SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS
    KABSCH, W
    [J]. ACTA CRYSTALLOGRAPHICA SECTION A, 1978, 34 (SEP): : 827 - 828
  • [10] A TOOLKIT FOR COMPUTATIONAL MOLECULAR-BIOLOGY .2. ON THE OPTIMAL SUPERPOSITION OF 2 SETS OF COORDINATES
    LESK, AM
    [J]. ACTA CRYSTALLOGRAPHICA SECTION A, 1986, 42 : 110 - 113