Protein flexibility and intrinsic disorder

被引:261
作者
Radivojac, P
Obradovic, Z
Smith, DK
Zhu, G
Vucetic, S
Brown, CJ
Lawson, JD
Dunker, AK
机构
[1] Washington State Univ, Sch Mol Biosci, Pullman, WA 99164 USA
[2] Hong Kong Univ Sci & Technol, Dept Biochem, Kowloon, Hong Kong, Peoples R China
[3] Univ Hong Kong, Dept Biochem, Hong Kong, Hong Kong, Peoples R China
[4] Temple Univ, Ctr Informat Sci & Technol, Philadelphia, PA 19122 USA
关键词
temperature factor; natively unfolded; intrinsically unstructured; flexibility prediction;
D O I
10.1110/ps.03128904
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Comparisons were made among four categories of protein flexibility: (1) low-13-factor ordered regions, (2) high-B-factor ordered regions, (3) short disordered regions, and (4) long disordered regions. Amino acid compositions of the four categories were found to be significantly different from each other, with high-B-factor ordered and short disordered regions being the most similar pair. The high-B-factor (flexible) ordered regions are characterized by a higher average flexibility index, higher average hydrophilicity, higher average absolute net charge, and higher total charge than disordered regions. The low-B-factor regions are significantly enriched in hydrophobic residues and depleted in the total number of charged residues compared to the other three categories. We examined the predictability of the high-B-factor regions and developed a predictor that discriminates between regions of low and high-B-factors. This predictor achieved an accuracy of 70% and a correlation of 0.43 with experimental data, outperforming the 64% accuracy and 0.32 correlation of predictors based solely on flexibility indices. To further clarify the differences between short disordered regions and ordered regions, a predictor of short disordered regions was developed. Its relatively high accuracy of 81% indicates considerable differences between ordered and disordered regions. The distinctive amino acid biases of high-B-factor ordered regions, short disordered regions, and long disordered regions indicate that the sequence determinants for these flexibility categories differ from one another, whereas the significantly-greater-than-chance predictability of these categories from sequence suggest that flexible ordered regions, short disorder, and long disorder are, to a significant degree, encoded at the primary structure level.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 61 条
[1]  
Altman R. B., 1994, PROT PEPT LETT, V2, P120
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
[Anonymous], 2012, CRYSTALLOGRAPHY MADE
[4]   CORRECT STRUCTURE PREDICTION [J].
BENNER, SA ;
COHEN, MA ;
GERLOFF, D .
NATURE, 1992, 359 (6398) :781-781
[5]   GenBank [J].
Benson, DA ;
Boguski, MS ;
Lipman, DJ ;
Ostell, J ;
Ouellette, BFF ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :12-17
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]  
BHASKARAN R, 1988, INT J PEPT PROT RES, V32, P241
[8]   Sequence and structure-based prediction of eukaryotic protein phosphorylation sites [J].
Blom, N ;
Gammeltoft, S ;
Brunak, S .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 294 (05) :1351-1362
[9]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[10]   Evolutionary rate heterogeneity in proteins with long disordered regions [J].
Brown, CJ ;
Takayama, S ;
Campen, AM ;
Vise, P ;
Marshall, TW ;
Oldfield, CJ ;
Williams, CJ ;
Dunker, AK .
JOURNAL OF MOLECULAR EVOLUTION, 2002, 55 (01) :104-110