ESM-scan-A tool to guide amino acid substitutions

被引:0
作者
Totaro, Massimo G. [1 ]
Vide, Ursula [1 ]
Zausinger, Regina [1 ]
Winkler, Andreas [1 ,2 ]
Oberdorfer, Gustav [1 ,2 ]
机构
[1] Graz Univ Technol, Inst Biochem, Petersgasse 12-2, A-8010 Graz, Austria
[2] BioTechMed, Graz, Austria
基金
奥地利科学基金会; 欧洲研究理事会;
关键词
in-silico deep mutational scanning; protein design; protein engineering; protein function; structural biology; PROTEIN STABILITY; PREDICTION; POTENTIALS; MUTATIONS; AFFINITY; DATABASE; DESIGN;
D O I
10.1002/pro.5221
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein structure prediction and (re)design have gone through a revolution in the last 3 years. The tremendous progress in these fields has been almost exclusively driven by readily available machine learning algorithms applied to protein folding and sequence design problems. Despite these advancements, predicting site-specific mutational effects on protein stability and function remains an unsolved problem. This is a persistent challenge, mainly because the free energy of large systems is very difficult to compute with absolute accuracy and subtle changes to protein structures are hard to capture with computational models. Here, we describe the implementation and use of ESM-Scan, which uses the ESM zero-shot predictor to scan entire protein sequences for preferential amino acid changes, thus enabling in silico deep mutational scanning experiments. We benchmark ESM-Scan on its predictive capabilities for stability and functionality of sequence changes using three publicly available datasets and proceed by experimentally testing the tool's performance on a challenging test case of a blue-light-activated diguanylate cyclase from Methylotenera species (MsLadC), where it accurately predicted the importance of a highly conserved residue in a region involved in allosteric product inhibition. Our experimental results show that the ESM-zero shot model is capable of inferring the effects of a set of amino acid substitutions in their correlation between predicted fitness and experimental results. ESM-Scan is publicly available at .
引用
收藏
页数:12
相关论文
共 96 条
  • [1] Accurate structure prediction of biomolecular interactions with AlphaFold 3
    Abramson, Josh
    Adler, Jonas
    Dunger, Jack
    Evans, Richard
    Green, Tim
    Pritzel, Alexander
    Ronneberger, Olaf
    Willmore, Lindsay
    Ballard, Andrew J.
    Bambrick, Joshua
    Bodenstein, Sebastian W.
    Evans, David A.
    Hung, Chia-Chun
    O'Neill, Michael
    Reiman, David
    Tunyasuvunakool, Kathryn
    Wu, Zachary
    Zemgulyte, Akvile
    Arvaniti, Eirini
    Beattie, Charles
    Bertolli, Ottavia
    Bridgland, Alex
    Cherepanov, Alexey
    Congreve, Miles
    Cowen-Rivers, Alexander I.
    Cowie, Andrew
    Figurnov, Michael
    Fuchs, Fabian B.
    Gladman, Hannah
    Jain, Rishub
    Khan, Yousuf A.
    Low, Caroline M. R.
    Perlin, Kuba
    Potapenko, Anna
    Savy, Pascal
    Singh, Sukhdeep
    Stecula, Adrian
    Thillaisundaram, Ashok
    Tong, Catherine
    Yakneen, Sergei
    Zhong, Ellen D.
    Zielinski, Michal
    Zidek, Augustin
    Bapst, Victor
    Kohli, Pushmeet
    Jaderberg, Max
    Hassabis, Demis
    Jumper, John M.
    [J]. NATURE, 2024, 630 (8016) : 493 - 500
  • [2] Protein Function Analysis through Machine Learning
    Avery, Chris
    Patterson, John
    Grear, Tyler
    Frater, Theodore
    Jacobs, Donald J.
    [J]. BIOMOLECULES, 2022, 12 (09)
  • [3] Flex ddG: Rosetta Ensemble-Based Estimation of Changes in Protein-Protein Binding Affinity upon Mutation
    Barlow, Kyle A.
    Conchuir, Shane O.
    Thompson, Samuel
    Suresh, Pooja
    Lucas, James E.
    Heinonen, Markus
    Kortemme, Tanja
    [J]. JOURNAL OF PHYSICAL CHEMISTRY B, 2018, 122 (21) : 5389 - 5399
  • [4] Improved prediction of protein-protein binding sites using a support vector machines approach
    Bradford, JR
    Westhead, DR
    [J]. BIOINFORMATICS, 2005, 21 (08) : 1487 - 1494
  • [5] Brandes N, 2022, bioRxiv, DOI [10.1101/2022.08.25.505311, 10.1101/2022.08.25.505311, DOI 10.1101/2022.08.25.505311]
  • [6] ProteinBERT: a universal deep-learning model of protein sequence and function
    Brandes, Nadav
    Ofer, Dan
    Peleg, Yam
    Rappoport, Nadav
    Linial, Michal
    [J]. BIOINFORMATICS, 2022, 38 (08) : 2102 - 2110
  • [7] Computational Modeling of Protein Stability: Quantitative Analysis Reveals Solutions to Pervasive Problems
    Broom, Aron
    Trainor, Kyle
    Jacobi, Zachary
    Meiering, Elizabeth M.
    [J]. STRUCTURE, 2020, 28 (06) : 717 - +
  • [8] Computational tools help improve protein stability but with a solubility tradeoff
    Broom, Aron
    Jacobi, Zachary
    Trainor, Kyle
    Meiering, Elizabeth M.
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2017, 292 (35) : 14349 - 14361
  • [9] Brown BP, 2023, bioRxiv, DOI [10.1101/2023.08.06.552168, 10.1101/2023.08.06.552168, DOI 10.1101/2023.08.06.552168]
  • [10] Improved prediction of protein-protein interactions using AlphaFold2
    Bryant, P.
    Pozzati, G.
    Elofsson, A.
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)