Emergence of protein fold families through rational design

被引:167
|
作者
Ding, Feng [1 ]
Dokholyan, Nikolay V. [1 ]
机构
[1] Univ N Carolina, Dept Biochem & Biophys, Chapel Hill, NC USA
关键词
D O I
10.1371/journal.pcbi.0020085
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Diverse proteins with similar structures are grouped into families of homologs and analogs, if their sequence similarity is higher or lower, respectively, than 20%-30%. It was suggested that protein homologs and analogs originate from a common ancestor and diverge in their distinct evolutionary time scales, emerging as a consequence of the physical properties of the protein sequence space. Although a number of studies have determined key signatures of protein family organization, the sequence-structure factors that differentiate the two evolution-related protein families remain unknown. Here, we stipulate that subtle structural changes, which appear due to accumulating mutations in the homologous families, lead to distinct packing of the protein core and, thus, novel compositions of core residues. The latter process leads to the formation of distinct families of homologs. We propose that such differentiation results in the formation of analogous families. To test our postulate, we developed a molecular modeling and design toolkit, Medusa, to computationally design protein sequences that correspond to the same fold family. We find that analogous proteins emerge when a backbone structure deviates only 1-2 angstrom root-mean-square deviation from the original structure. For close homologs, core residues are highly conserved. However, when the overall sequence similarity drops to; 25%-30%, the composition of core residues starts to diverge, thereby forming novel families of protein homologs. This direct observation of the formation of protein homologs within a specific fold family supports our hypothesis. The conservation of amino acids in designed sequences recapitulates that of the naturally occurring sequences, thereby validating our computational design methodology.
引用
收藏
页码:725 / 733
页数:9
相关论文
共 50 条
  • [1] Emergence of protein fold families through rational design (vol 2, pg 725, 2006)
    Ding, Feng
    Dokholyan, Nikolay V.
    PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (10) : 1325 - 1325
  • [2] IDENTIFICATION AND CLASSIFICATION OF PROTEIN FOLD FAMILIES
    ORENGO, CA
    FLORES, TP
    TAYLOR, WR
    THORNTON, JM
    PROTEIN ENGINEERING, 1993, 6 (05): : 485 - 500
  • [3] Expanding the space of protein geometries by computational design of de novo fold families
    Pan, Xingjie
    Thompson, Michael C.
    Zhang, Yang
    Liu, Lin
    Fraser, James S.
    Kelly, Mark J. S.
    Kortemme, Tanja
    SCIENCE, 2020, 369 (6507) : 1132 - +
  • [4] Structural protein reorganization and fold emergence investigated through amino acid sequence permutations
    Giovanni Minervini
    Alessandro Masiero
    Emilio Potenza
    Silvio C. E. Tosatto
    Amino Acids, 2015, 47 : 147 - 152
  • [5] Structural protein reorganization and fold emergence investigated through amino acid sequence permutations
    Minervini, Giovanni
    Masiero, Alessandro
    Potenza, Emilio
    Tosatto, Silvio C. E.
    AMINO ACIDS, 2015, 47 (01) : 147 - 152
  • [6] PROTEIN DESIGN I like to fold it, fold it
    Deane, Caitlin
    NATURE CHEMICAL BIOLOGY, 2017, 13 (09) : 923 - 923
  • [7] Learning generative models for protein fold families
    Balakrishnan, Sivaraman
    Kamisetty, Hetunandan
    Carbonell, Jaime G.
    Lee, Su-In
    Langmead, Christopher James
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (04) : 1061 - 1078
  • [8] Improving CBCA synthase activity through rational protein design
    Thomas, Fabian
    Kayser, Oliver
    JOURNAL OF BIOTECHNOLOGY, 2023, 363 : 40 - 49
  • [9] Protein Design through Systematic Catalytic Loop Exchange in the (β/α)8 Fold
    Ochoa-Leyva, Adrian
    Soberon, Xavier
    Sanchez, Filiberto
    Arguello, Martha
    Montero-Moran, Gabriela
    Saab-Rincon, Gloria
    JOURNAL OF MOLECULAR BIOLOGY, 2009, 387 (04) : 949 - 964
  • [10] THE FSSP DATABASE OF STRUCTURALLY ALIGNED PROTEIN FOLD FAMILIES
    HOLM, L
    SANDER, C
    NUCLEIC ACIDS RESEARCH, 1994, 22 (17) : 3600 - 3609