Machine Learning-Guided Protein Engineering

被引:60
|
作者
Kouba, Petr [1 ,3 ,4 ]
Kohout, Pavel [1 ,2 ]
Haddadi, Faraneh [1 ,2 ]
Bushuiev, Anton [3 ]
Samusevich, Raman [3 ,5 ]
Sedlar, Jiri [3 ]
Damborsky, Jiri [1 ,2 ]
Pluskal, Tomas [5 ]
Sivic, Josef [3 ]
Mazurenko, Stanislav [1 ,2 ]
机构
[1] Masaryk Univ, Loschmidt Labs, Dept Expt Biol, Brno 62500, Czech Republic
[2] St Annes Univ Hosp Brno, Int Clin Res Ctr, Brno 65691, Czech Republic
[3] Czech Tech Univ, Czech Inst Informat Robot & Cybernet, Prague 16000, Czech Republic
[4] Czech Tech Univ, Fac Elect Engn, Prague 16627, Czech Republic
[5] Czech Acad Sci, Inst Organ Chem & Biochem, Prague 16000, Czech Republic
基金
欧盟地平线“2020”;
关键词
activity; artificial intelligence; biocatalysis; deep learning; protein design; AMINO-ACID; HALOALKANE DEHALOGENASES; ACCURATE PREDICTION; SEQUENCE; DATABASE; DESIGN; MODELS; EVOLUTION; LANGUAGE; MUTATIONS;
D O I
10.1021/acscatal.3c02743
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Recent progress in engineering highly promising biocatalysts has increasingly involved machine learning methods. These methods leverage existing experimental and simulation data to aid in the discovery and annotation of promising enzymes, as well as in suggesting beneficial mutations for improving known targets. The field of machine learning for protein engineering is gathering steam, driven by recent success stories and notable progress in other areas. It already encompasses ambitious tasks such as understanding and predicting protein structure and function, catalytic efficiency, enantioselectivity, protein dynamics, stability, solubility, aggregation, and more. Nonetheless, the field is still evolving, with many challenges to overcome and questions to address. In this Perspective, we provide an overview of ongoing trends in this domain, highlight recent case studies, and examine the current limitations of machine learning-based methods. We emphasize the crucial importance of thorough experimental validation of emerging models before their use for rational protein design. We present our opinions on the fundamental problems and outline the potential directions for future research.
引用
收藏
页码:13863 / 13895
页数:33
相关论文
共 50 条
  • [1] Benchmarking protein structure predictors to assist machine learning-guided peptide discovery
    Aldas-Bulos, Victor Daniel
    Plisson, Fabien
    DIGITAL DISCOVERY, 2023, 2 (04): : 981 - 993
  • [2] Machine-learning-guided directed evolution for protein engineering
    Yang, Kevin K.
    Wu, Zachary
    Arnold, Frances H.
    NATURE METHODS, 2019, 16 (08) : 687 - 694
  • [3] Meta learning addresses noisy and under-labeled data in machine learning-guided antibody engineering
    Minot, Mason
    Reddy, Sai T.
    CELL SYSTEMS, 2024, 15 (01) : 4 - 18.e4
  • [4] Machine learning-guided synthesis of advanced inorganic materials
    Tang, Bijun
    Lu, Yuhao
    Zhou, Jiadong
    Chouhan, Tushar
    Wang, Han
    Golani, Prafful
    Xu, Manzhang
    Xu, Quan
    Guan, Cuntai
    Liu, Zheng
    MATERIALS TODAY, 2020, 41 : 72 - 80
  • [5] Machine Learning-Guided Systematic Search of DNA Sequences for Sorting Carbon Nanotubes
    Lin, Zhiwei
    Yang, Yoona
    Jagota, Anand
    Zheng, Ming
    ACS NANO, 2022, 16 (03) : 4705 - 4713
  • [6] Deep learning-guided video compression for machine vision tasks
    Kim, Aro
    Woo, Seung-taek
    Park, Minho
    Kim, Dong-hwi
    Lim, Hanshin
    Jung, Soon-heung
    Kwak, Sangwoon
    Park, Sang-hyo
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2024, 2024 (01)
  • [7] Machine learning-guided malate dehydrogenase engineering for improved production of L-malic acid in Aspergillus niger
    Zhang, Zihan
    Zheng, Yuanyuan
    Zhang, Chi
    Xu, Qing
    Xue, Feng
    MOLECULAR CATALYSIS, 2025, 578
  • [8] Machine learning-guided discovery and design of non-hemolytic peptides
    Plisson, Fabien
    Ramirez-Sanchez, Obed
    Martinez-Hernandez, Cristina
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [9] Machine learning-guided discovery of gas evolving electrode bubble inactivation
    Lake, Jack R.
    Rufer, Simon
    James, Jim
    Pruyne, Nathan
    Scourtas, Aristana
    Schwarting, Marcus
    Ambadkar, Aadit
    Foster, Ian
    Blaiszik, Ben
    Varanasi, Kripa K.
    NANOSCALE, 2025, 17 (03)
  • [10] AMPGAN v2: Machine Learning-Guided Design of Antimicrobial Peptides
    Van Oort, Colin M.
    Ferrell, Jonathon B.
    Remington, Jacob M.
    Wshah, Safwan
    Li, Jianing
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (05) : 2198 - 2207