Understanding the physical properties that control protein crystallization by analysis of large-scale experimental data

被引:106
|
作者
Price, W. Nicholson, II [1 ,2 ]
Chen, Yang [1 ,2 ]
Handelman, Samuel K. [1 ,2 ]
Neely, Helen [1 ,2 ]
Manor, Philip [1 ,2 ]
Karlin, Richard [1 ,2 ]
Nair, Rajesh [1 ,3 ]
Liu, Jinfeng [1 ,3 ]
Baran, Michael [1 ,4 ]
Everett, John [1 ,4 ]
Tong, Saichiu N. [1 ,4 ]
Forouhar, Farhad [1 ,2 ]
Swaminathan, Swarup S. [1 ,2 ]
Acton, Thomas [1 ,4 ]
Xiao, Rong [1 ,4 ]
Luft, Joseph R. [1 ,5 ]
Lauricella, Angela [1 ,5 ]
DeTitta, George T. [1 ,5 ]
Rost, Burkhard [1 ,3 ]
Montelione, Gaetano T. [1 ,4 ,6 ]
Hunt, John F. [1 ,2 ]
机构
[1] Columbia Univ, NE Struct Genom Consortium, New York, NY 10027 USA
[2] Columbia Univ, Dept Biol Sci, New York, NY 10027 USA
[3] Columbia Univ, Dept Biochem & Mol Biophys, New York, NY 10032 USA
[4] Rutgers State Univ, Ctr Adv Biotechnol & Med, Dept Mol Biol & Biochem, Piscataway, NJ 08854 USA
[5] Hauptman Woodward Inst, Buffalo, NY 14203 USA
[6] Univ Med & Dent New Jersey, Robert Wood Johnson Med Sch, Dept Biochem, Piscataway, NJ 08854 USA
基金
美国国家卫生研究院;
关键词
RNA-POLYMERASE-II; STRUCTURAL GENOMICS; ANGSTROM RESOLUTION; ENTROPY; PROTEOMICS; SERVER; TRANSCRIPTION; MUTATIONS; DISCOVERY; DISORDER;
D O I
10.1038/nbt.1514
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Crystallization is the most serious bottleneck in high-throughput protein-structure determination by diffraction methods. We have used data mining of the large-scale experimental results of the Northeast Structural Genomics Consortium and experimental folding studies to characterize the biophysical properties that control protein crystallization. This analysis leads to the conclusion that crystallization propensity depends primarily on the prevalence of well-ordered surface epitopes capable of mediating interprotein interactions and is not strongly influenced by overall thermodynamic stability. We identify specific sequence features that correlate with crystallization propensity and that can be used to estimate the crystallization probability of a given construct. Analyses of entire predicted proteomes demonstrate substantial differences in the amino acid-sequence properties of human versus eubacterial proteins, which likely reflect differences in biophysical properties, including crystallization propensity. Our thermodynamic measurements do not generally support previous claims regarding correlations between sequence properties and protein stability.
引用
收藏
页码:51 / 57
页数:7
相关论文
共 50 条
  • [1] Large-Scale Protein Analysis of Experimental Retinal Artery Occlusion
    Vestergaard, Nanna
    Cehofski, Lasse Jorgensen
    Alsing, Alexander Norgard
    Kruse, Anders
    Nielsen, Jonas Ellegaard
    Schlosser, Anders
    Sorensen, Grith Lykke
    Honore, Bent
    Vorum, Henrik
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (09)
  • [2] Large-scale identification of membrane proteins with properties favorable for crystallization
    Kim, Jared
    Kagawa, Allison
    Kurasaki, Kellie
    Ataie, Niloufar
    Cho, Il Kyu
    Li, Qing X.
    Ng, Ho Leung
    PROTEIN SCIENCE, 2015, 24 (11) : 1756 - 1763
  • [3] Screening and selection methods for large-scale analysis of protein function
    Lin, HN
    Cornish, VW
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2002, 41 (23) : 4403 - 4425
  • [4] Tag-Count Analysis of Large-Scale Proteomic Data
    Branson, Owen E.
    Freitas, Michael A.
    JOURNAL OF PROTEOME RESEARCH, 2016, 15 (12) : 4742 - 4746
  • [5] Snowflake Data Warehouse for Large-Scale and Diverse Biological Data Management and Analysis
    Koreeda, Tatsuya
    Honda, Hiroshi
    Onami, Jun-ichi
    GENES, 2025, 16 (01)
  • [6] Field-omics-understanding large-scale molecular data from field crops
    Alexandersson, Erik
    Jacobson, Dan
    Vivier, Melane A.
    Weckwerth, Wolfram
    Andreasson, Erik
    FRONTIERS IN PLANT SCIENCE, 2014, 5
  • [7] Large-Scale Quality Analysis of Published ChIP-seq Data
    Marinov, Georgi K.
    Kundaje, Anshul
    Park, Peter J.
    Wold, Barbara J.
    G3-GENES GENOMES GENETICS, 2014, 4 (02): : 209 - 223
  • [8] Variant calling and quality control of large-scale human genome sequencing data
    Jew, Brandon
    Sul, Jae Hoon
    EMERGING TOPICS IN LIFE SCIENCES, 2019, 3 (04) : 399 - 409
  • [9] Analysis of crystallization data in the Protein Data Bank
    Kirkwood, Jobic
    Hargreavcs, David
    O'Kecfe, Simon
    Wilson, Julie
    ACTA CRYSTALLOGRAPHICA SECTION F-STRUCTURAL BIOLOGY COMMUNICATIONS, 2015, 71 : 1228 - 1234
  • [10] Understanding large scale sequencing datasets through changes to protein folding
    Shorthouse, David
    Lister, Harris
    Freeman, Gemma S.
    Hall, Benjamin A.
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (05) : 517 - 524