De novo protein design by inversion of the AlphaFold structure prediction network

被引:32
|
作者
Goverde, Casper A. [1 ,2 ]
Wolf, Benedict [1 ,2 ]
Khakzad, Hamed [1 ,2 ]
Rosset, Stephane [1 ]
Correia, Bruno E. [1 ,2 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[2] Swiss Inst Bioinformat SIB, Lausanne, Switzerland
基金
欧洲研究理事会; 瑞士国家科学基金会;
关键词
AlphaFold2; computational structural biology; De novo protein design; machine learning; structure prediction network inversion; COMPUTATIONAL DESIGN; FOLD;
D O I
10.1002/pro.4653
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
De novo protein design enhances our understanding of the principles that govern protein folding and interactions, and has the potential to revolutionize biotechnology through the engineering of novel protein functionalities. Despite recent progress in computational design strategies, de novo design of protein structures remains challenging, given the vast size of the sequence-structure space. AlphaFold2 (AF2), a state-of-the-art neural network architecture, achieved remarkable accuracy in predicting protein structures from amino acid sequences. This raises the question whether AF2 has learned the principles of protein folding sufficiently for de novo design. Here, we sought to answer this question by inverting the AF2 network, using the prediction weight set and a loss function to bias the generated sequences to adopt a target fold. Initial design trials resulted in de novo designs with an overrepresentation of hydrophobic residues on the protein surface compared to their natural protein family, requiring additional surface optimization. In silico validation of the designs showed protein structures with the correct fold, a hydrophilic surface and a densely packed hydrophobic core. In vitro validation showed that 7 out of 39 designs were folded and stable in solution with high melting temperatures. In summary, our design workflow solely based on AF2 does not seem to fully capture basic principles of de novo protein design, as observed in the protein surface's hydrophobic vs. hydrophilic patterning. However, with minimal post-design intervention, these pipelines generated viable sequences as assessed experimental characterization. Thus, such pipelines show the potential to contribute to solving outstanding challenges in de novo protein design.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Advances in protein structure prediction and de novo protein design:: A review
    Floudas, CA
    Fung, HK
    McAllister, SR
    Mönnigmann, M
    Rajgaria, R
    CHEMICAL ENGINEERING SCIENCE, 2006, 61 (03) : 966 - 988
  • [2] De novo protein design with a denoising diffusion network independent of pretrained structure prediction models
    Liu, Yufeng
    Wang, Sheng
    Dong, Jixin
    Chen, Linghui
    Wang, Xinyu
    Wang, Lei
    Li, Fudong
    Wang, Chenchen
    Zhang, Jiahai
    Wang, Yuzhu
    Wei, Si
    Chen, Quan
    Liu, Haiyan
    NATURE METHODS, 2024, 21 (11) : 2107 - 2116
  • [3] Protein structure prediction beyond AlphaFold
    Wei, Guo-Wei
    NATURE MACHINE INTELLIGENCE, 2019, 1 (08) : 336 - 337
  • [4] Protein structure prediction beyond AlphaFold
    Guo-Wei Wei
    Nature Machine Intelligence, 2019, 1 : 336 - 337
  • [5] DE-NOVO PREDICTION OF PROTEIN TERTIARY STRUCTURE
    SKOLNICK, J
    KOLINSKI, A
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1994, 207 : 130 - POLY
  • [6] Sampling Bottlenecks in De novo Protein Structure Prediction
    Kim, David E.
    Blum, Ben
    Bradley, Philip
    Baker, David
    JOURNAL OF MOLECULAR BIOLOGY, 2009, 393 (01) : 249 - 260
  • [7] De novo protein design by deep network hallucination
    Ivan Anishchenko
    Samuel J. Pellock
    Tamuka M. Chidyausiku
    Theresa A. Ramelot
    Sergey Ovchinnikov
    Jingzhou Hao
    Khushboo Bafna
    Christoffer Norn
    Alex Kang
    Asim K. Bera
    Frank DiMaio
    Lauren Carter
    Cameron M. Chow
    Gaetano T. Montelione
    David Baker
    Nature, 2021, 600 : 547 - 552
  • [8] De novo protein design by deep network hallucination
    Anishchenko, Ivan
    Pellock, Samuel J.
    Chidyausiku, Tamuka M.
    Ramelot, Theresa A.
    Ovchinnikov, Sergey
    Hao, Jingzhou
    Bafna, Khushboo
    Norn, Christoffer
    Kang, Alex
    Bera, Asim K.
    DiMaio, Frank
    Carter, Lauren
    Chow, Cameron M.
    Montelione, Gaetano T.
    Baker, David
    NATURE, 2021, 600 (7889) : 547 - +
  • [9] Highly accurate protein structure prediction with AlphaFold
    Jumper, John
    Evans, Richard
    Pritzel, Alexander
    Green, Tim
    Figurnov, Michael
    Ronneberger, Olaf
    Tunyasuvunakool, Kathryn
    Bates, Russ
    Zidek, Augustin
    Potapenko, Anna
    Bridgland, Alex
    Meyer, Clemens
    Kohl, Simon A. A.
    Ballard, Andrew J.
    Cowie, Andrew
    Romera-Paredes, Bernardino
    Nikolov, Stanislav
    Jain, Rishub
    Adler, Jonas
    Back, Trevor
    Petersen, Stig
    Reiman, David
    Clancy, Ellen
    Zielinski, Michal
    Steinegger, Martin
    Pacholska, Michalina
    Berghammer, Tamas
    Bodenstein, Sebastian
    Silver, David
    Vinyals, Oriol
    Senior, Andrew W.
    Kavukcuoglu, Koray
    Kohli, Pushmeet
    Hassabis, Demis
    NATURE, 2021, 596 (7873) : 583 - +
  • [10] Protein Design Using Structure-Prediction Networks: AlphaFold and RoseTTAFold as Protein Structure Foundation Models
    Wang, Jue
    Watson, Joseph L.
    Lisanza, Sidney L.
    COLD SPRING HARBOR PERSPECTIVES IN BIOLOGY, 2024, 16 (07):