Correlation between sequence hydrophobicity and surface-exposure pattern of database proteins

被引:93
作者
Moelbert, S
Emberly, E
Tang, C [1 ]
机构
[1] NEC Labs Amer, Princeton, NJ 08540 USA
[2] Univ Lausanne, Inst Phys Theor, CH-1015 Lausanne, Switzerland
[3] Rockefeller Univ, Ctr Studies Phys & Biol, New York, NY 10021 USA
关键词
hydrophobicity; protein folding; surface exposure; secondary structure; designability;
D O I
10.1110/ps.03431704
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Hydrophobicity is thought to be one of the primary forces driving the folding of proteins. On average, hydrophobic residues occur preferentially in the core, whereas polar residues tend to occur at the surface of a folded protein. By analyzing the known protein structures, we quantify the degree to which the hydrophobicity sequence of a protein correlates with its pattern of surface exposure. We have assessed the statistical significance of this correlation for several hydrophobicity scales in the literature, and find that the computed correlations are significant but far from optimal. We show that this less than optimal correlation arises primarily from the large degree of mutations that naturally occurring proteins can tolerate. Lesser effects are due in part to forces other than hydrophobicity, and we quantify this by analyzing the surface-exposure distributions of all amino acids. Lastly, we show that our database findings are consistent with those found from an off-lattice hydrophobic-polar model of protein folding.
引用
收藏
页码:752 / 762
页数:11
相关论文
共 42 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]  
BRANDEN C, 1999, INTRO PROTEIN STRUCT, P6
[3]  
Chan HS, 2000, PROTEINS, V40, P543, DOI 10.1002/1097-0134(20000901)40:4<543::AID-PROT20>3.0.CO
[4]  
2-O
[5]  
Chan Hue Sun, 2002, Appl Bioinformatics, V1, P121
[6]   HYDROPHOBIC BONDING AND ACCESSIBLE SURFACE-AREA IN PROTEINS [J].
CHOTHIA, C .
NATURE, 1974, 248 (5446) :338-339
[7]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[8]  
Creighton T. E., 1993, PROTEINS STRUCTURES, P154
[9]   Multiple-sequence information provides protection against mis-specified potential energy functions in the lattice model of proteins [J].
Cui, Y ;
Wong, WH .
PHYSICAL REVIEW LETTERS, 2000, 85 (24) :5242-5245
[10]   Proteins from scratch [J].
DeGrado, WF .
SCIENCE, 1997, 278 (5335) :80-81