De novo protein design by inversion of the AlphaFold structure prediction network

被引:32
|
作者
Goverde, Casper A. [1 ,2 ]
Wolf, Benedict [1 ,2 ]
Khakzad, Hamed [1 ,2 ]
Rosset, Stephane [1 ]
Correia, Bruno E. [1 ,2 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[2] Swiss Inst Bioinformat SIB, Lausanne, Switzerland
基金
欧洲研究理事会; 瑞士国家科学基金会;
关键词
AlphaFold2; computational structural biology; De novo protein design; machine learning; structure prediction network inversion; COMPUTATIONAL DESIGN; FOLD;
D O I
10.1002/pro.4653
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
De novo protein design enhances our understanding of the principles that govern protein folding and interactions, and has the potential to revolutionize biotechnology through the engineering of novel protein functionalities. Despite recent progress in computational design strategies, de novo design of protein structures remains challenging, given the vast size of the sequence-structure space. AlphaFold2 (AF2), a state-of-the-art neural network architecture, achieved remarkable accuracy in predicting protein structures from amino acid sequences. This raises the question whether AF2 has learned the principles of protein folding sufficiently for de novo design. Here, we sought to answer this question by inverting the AF2 network, using the prediction weight set and a loss function to bias the generated sequences to adopt a target fold. Initial design trials resulted in de novo designs with an overrepresentation of hydrophobic residues on the protein surface compared to their natural protein family, requiring additional surface optimization. In silico validation of the designs showed protein structures with the correct fold, a hydrophilic surface and a densely packed hydrophobic core. In vitro validation showed that 7 out of 39 designs were folded and stable in solution with high melting temperatures. In summary, our design workflow solely based on AF2 does not seem to fully capture basic principles of de novo protein design, as observed in the protein surface's hydrophobic vs. hydrophilic patterning. However, with minimal post-design intervention, these pipelines generated viable sequences as assessed experimental characterization. Thus, such pipelines show the potential to contribute to solving outstanding challenges in de novo protein design.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Multimodal Machine Learning for Protein Variant Effect Prediction and De Novo Design
    Shen, Yang
    PROTEIN SCIENCE, 2023, 32 (12)
  • [32] De novo protein design, a retrospective
    Korendovych, Ivan, V
    DeGrado, William F.
    QUARTERLY REVIEWS OF BIOPHYSICS, 2020, 53
  • [33] Improving De novo Protein Structure Prediction using Contact Maps Information
    Santos, Karina B.
    Rocha, Gregorio K.
    Custodio, Fabio L.
    Barbosa, Helio J. C.
    Dardenne, Laurent E.
    2017 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2017, : 107 - 112
  • [34] SimFold energy function for de novo protein structure prediction: Consensus with Rosetta
    Fujitsuka, Y
    Chikenji, G
    Takada, S
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (02) : 381 - 398
  • [35] Protein structure prediction: Combining de novo modeling with sparse experimental data
    Latek, Dorota
    Ekonomiuk, Dariusz
    Kolinski, Andrzej
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2007, 28 (10) : 1668 - 1676
  • [36] Protein structure prediction by combination of fold-recognition with de novo folding
    Bujnicki, JM
    Feder, M
    Gajda, MJ
    Kosinski, J
    Gront, D
    Kolinski, A
    FEBS JOURNAL, 2005, 272 : 104 - 104
  • [37] State-of-the-art web services for de novo protein structure prediction
    Abriata, Luciano A.
    Dal Peraro, Matteo
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [38] LRFragLib: an effective algorithm to identify fragments for de novo protein structure prediction
    Wang, Tong
    Yang, Yuedong
    Zhou, Yaoqi
    Gong, Haipeng
    BIOINFORMATICS, 2017, 33 (05) : 677 - 684
  • [39] De novo protein design targeting voltage-gated sodium channel 1.7 using RFdiffusion and AlphaFold2
    Harris, Brandon J.
    Sukhthankar, Amogh
    Yarov-Yarovoy, Vladimir
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 108A - 108A
  • [40] Sonification based de novo protein design using artificial intelligence, structure prediction, and analysis using molecular modeling
    Yu, Chi-Hua
    Buehler, Markus J.
    APL BIOENGINEERING, 2020, 4 (01):