Extension of a local backbone description using a structural alphabet:: A new approach to the sequence-structure relationship

被引:43
作者
de Brevern, AG [1 ]
Valadié, H [1 ]
Hazout, S [1 ]
Etchebest, C [1 ]
机构
[1] Univ Denis DIDEROT Paris 7, INSERM, U436, EBGM, F-75251 Paris, France
关键词
3D local structure prediction; 3D protein topology; probabilistic approach; sequence-structure relationship; structural alphabet; 3D overlapping motifs;
D O I
10.1110/ps.0220502
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein Blocks (PBs) comprise a structural alphabet of 16 protein fragments, each 5 Cot long. They make it possible to approximate and correctly predict local protein three-dimensional (3D) structures. We have selected the 72 most frequent sequences of five PBs, which we call Structural Words (SWs). Analysis of four different protein data banks shows that SWs cover 92% of the amino acids in them and provide a good structural approximation for residues (i.e., sequences) 9 Calpha long. We present most of them in a simple network that describes 90% of the overall residues and, interestingly, includes more than 80% of the amino acids present in coils. Analysis of the network shows the specificity and quality of the 3D descriptions as well as a new type of relation between local folds and amino acid distribution. The results show that the 3D structure of these protein data banks can be easily described by a combination of subgraphs included in the network. Finally, a Bayesian probabilistic approach improved the prediction rate by 4%.
引用
收藏
页码:2871 / 2886
页数:16
相关论文
共 80 条
  • [1] Helix capping
    Aurora, R
    Rose, GD
    [J]. PROTEIN SCIENCE, 1998, 7 (01) : 21 - 38
  • [2] Protein structure prediction and structural genomics
    Baker, D
    Sali, A
    [J]. SCIENCE, 2001, 294 (5540) : 93 - 96
  • [3] HELIX GEOMETRY IN PROTEINS
    BARLOW, DJ
    THORNTON, JM
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1988, 201 (03) : 601 - 619
  • [4] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [5] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [6] Bonneau R, 2001, PROTEINS, P119
  • [7] Ab initio protein structure prediction: Progress and prospects
    Bonneau, R
    Baker, D
    [J]. ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2001, 30 : 173 - 189
  • [8] Boutonnet NS, 1998, PROTEINS, V30, P193, DOI 10.1002/(SICI)1097-0134(19980201)30:2<193::AID-PROT9>3.0.CO
  • [9] 2-O
  • [10] The ASTRAL compendium for protein structure and sequence analysis
    Brenner, SE
    Koehl, P
    Levitt, R
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 254 - 256