Constructing amino acid residue substitution classes maximally indicative of local protein structure

被引:0
作者
Thompson, MJ [1 ]
Goldstein, RA [1 ]
机构
[1] UNIV MICHIGAN, DEPT CHEM, DIV BIOPHYS RES, ANN ARBOR, MI 48109 USA
关键词
protein evolution; structure prediction; information theory; amino acid substitution; multiple sequence alignment;
D O I
10.1002/(SICI)1097-0134(199605)25:1<28::AID-PROT3>3.0.CO;2-G
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Using an information theoretic formalism, we optimize classes of amino acid substitution to be maximally indicative of local protein structure. Our statistically-derived classes are loosely identifiable with the heuristic constructions found in previously published work, However, while these other methods provide a more rigid idealization of physicochemically constrained residue substitution, our classes provide substantially more structural information with many fewer parameters. Moreover, these substitution classes are consistent with the paradigmatic view of the sequence-to-structure relationship in globular proteins which holds that the three-dimensional architecture is predominantly determined by the arrangement of hydrophobic and polar side chains with weak constraints on the actual amino acid identities. More specific constraints are imposed on the placement of prolines, glycines, and the charged residues, These substitution classes have been used in highly accurate predictions of residue solvent accessibility. They could also be used in the identification of homologous proteins, the construction and refinement of multiple sequence alignments, and as a means of condensing and codifying the information in multiple sequence alignments for secondary structure prediction and tertiary fold recognition. (C) 1996 Wiley-Liss, Inc.
引用
收藏
页码:28 / 37
页数:10
相关论文
共 66 条
[1]  
ABOLA EE, 1987, BONN CAMBRIDGE CHEST, P107
[2]  
[Anonymous], 1989, Prediction of protein structures and the principles of protein conformation
[3]  
[Anonymous], CURR OPIN STRUCT BIO
[4]  
BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3578
[5]   PROSITE - A DICTIONARY OF SITES AND PATTERNS IN PROTEINS [J].
BAIROCH, A .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2241-2245
[6]   FLEXIBLE PROTEIN-SEQUENCE PATTERNS - A SENSITIVE METHOD TO DETECT WEAK STRUCTURAL SIMILARITIES [J].
BARTON, GJ ;
STERNBERG, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 212 (02) :389-402
[7]   PROTEIN-STRUCTURE PREDICTION [J].
BARTON, GJ ;
RUSSELL, RB .
NATURE, 1993, 361 (6412) :505-506
[8]   BONA-FIDE PREDICTION OF ASPECTS OF PROTEIN CONFORMATION - ASSIGNING INTERIOR AND SURFACE RESIDUES FROM PATTERNS OF VARIATION AND CONSERVATION IN HOMOLOGOUS PROTEIN SEQUENCES [J].
BENNER, SA ;
BADCOE, I ;
COHEN, MA ;
GERLOFF, DL .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 235 (03) :926-958
[9]   PATTERNS OF DIVERGENCE IN HOMOLOGOUS PROTEINS AS INDICATORS OF SECONDARY AND TERTIARY STRUCTURE - A PREDICTION OF THE STRUCTURE OF THE CATALYTIC DOMAIN OF PROTEIN-KINASES [J].
BENNER, SA ;
GERLOFF, D .
ADVANCES IN ENZYME REGULATION, 1991, 31 :121-181
[10]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542