De novo protein design.: II.: Plasticity in sequence space

被引:61
|
作者
Koehl, P [1 ]
Levitt, M [1 ]
机构
[1] Stanford Univ, Dept Biol Struct, Stanford, CA 94305 USA
关键词
protein design; random energy model; sequence space; Monte Carlo; fold recognition;
D O I
10.1006/jmbi.1999.3212
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It is generally accepted that many different protein sequences have similar folded structures, and that there is a relatively high probability that a new sequence possesses a previously observed fold. An indirect consequence of this is that protein design should define the sequence space accessible to a given structure, rather than providing a single optimized sequence. We have recently developed a new approach for protein sequence design, which optimizes the complete sequence of a protein based on the knowledge of its backbone structure, its amino acid composition and a physical energy function including van der Waals interactions, electrostatics, and environment free energy. The specificity of the designed sequence for its template backbone is imposed by keeping the amino acid composition fixed. Here, we show that our procedure converges in sequence space, albeit not to the native sequence of the protein. We observe that while polar residues are well conserved in our designed sequences, non-polar amino acids at the surface of a protein are often replaced by polar residues. The designed sequences provide a multiple alignment of sequences that all adopt the same three-dimensional fold. This alignment is used to derive a profile matrix for chicken triose phosphate isomerase, TIM. The matrix is found to recognize significantly the native sequence for TIM, as well as closely related sequences. Possible application of this approach to protein fold recognition is discussed. (C) 1999 Academic Press.
引用
收藏
页码:1183 / 1193
页数:11
相关论文
共 50 条
  • [21] CONCRETE-FACE ROCKFILL DAM: II. DESIGN.
    Cooke, J.Barry
    Sherard, James L.
    Journal of geotechnical engineering, 1987, ll3 (l0): : 3 - 1132
  • [22] Computational combinatorial protein design: Sequence search and statistical design.
    Yang, X
    Saven, JG
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 228 : U523 - U523
  • [23] Expanding the space of protein geometries by computational design of de novo fold families
    Pan, Xingjie
    Thompson, Michael C.
    Zhang, Yang
    Liu, Lin
    Fraser, James S.
    Kelly, Mark J. S.
    Kortemme, Tanja
    SCIENCE, 2020, 369 (6507) : 1132 - +
  • [24] Exploring the dark matter of protein space through de novo peptide design
    Woolfson, Derek
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [25] Role for chemoinformatics in structure-based de novo ligand design.
    Johnson, AP
    Boda, K
    Lengyel, T
    Weaver, S
    Vigh, A
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 228 : U363 - U363
  • [26] De novo protein design, a retrospective
    Korendovych, Ivan, V
    DeGrado, William F.
    QUARTERLY REVIEWS OF BIOPHYSICS, 2020, 53
  • [27] Core-directed protein design. II. Rescue of a multiply mutated and destabilized variant of ubiquitin
    Finucane, MD
    Woolfson, DN
    BIOCHEMISTRY, 1999, 38 (36) : 11613 - 11623
  • [28] Novel formulations for the sequence selection problem in de novo protein design with flexible templates
    Fung, H. K.
    Taylor, M. S.
    Floudas, C. A.
    OPTIMIZATION METHODS & SOFTWARE, 2007, 22 (01): : 51 - 71
  • [29] De novo protein sequence analysis of Macaca mulatta
    Nilesh S Tannu
    Scott E Hemby
    BMC Genomics, 8
  • [30] ANNIE: integrated de novo protein sequence annotation
    Ooi, Hong Sain
    Kwo, Chia Yee
    Wildpaner, Michael
    Sirota, Fernanda L.
    Eisenhaber, Birgit
    Maurer-Stroh, Sebastian
    Wong, Wing Cheong
    Schleiffer, Alexander
    Eisenhaber, Frank
    Schneider, Georg
    NUCLEIC ACIDS RESEARCH, 2009, 37 : W435 - W440