Machine-learning-guided directed evolution for protein engineering

被引:621
|
作者
Yang, Kevin K. [1 ]
Wu, Zachary [1 ]
Arnold, Frances H. [1 ]
机构
[1] CALTECH, Div Chem & Chem Engn, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
STABILITY CHANGES; SEQUENCE; MUTATIONS; PREDICTION; KERNEL; MODEL;
D O I
10.1038/s41592-019-0496-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein engineering through machine-learning-guided directed evolution enables the optimization of protein functions. Machine-learning approaches predict how sequence maps to function in a data-driven manner without requiring a detailed model of the underlying physics or biological pathways. Such methods accelerate directed evolution by learning from the properties of characterized variants and using that information to select sequences that are likely to exhibit improved properties. Here we introduce the steps required to build machine-learning sequence-function models and to use those models to guide engineering, making recommendations at each stage. This review covers basic concepts relevant to the use of machine learning for protein engineering, as well as the current literature and applications of this engineering paradigm. We illustrate the process with two case studies. Finally, we look to future opportunities for machine learning to enable the discovery of unknown protein functions and uncover the relationship between protein sequence and function.
引用
收藏
页码:687 / 694
页数:8
相关论文
共 50 条
  • [41] CMP Process Optimization Engineering by Machine Learning
    Yu, Hsiang-Meng
    Lin, Chih-Chen
    Hsu, Min-Hsuan
    Chen, Yen-Ting
    Chen, Kuang-Wei
    Luoh, Tuung
    Yang, Ling-Wuu
    Yang, Tahone
    Chen, Kuang-Chao
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2021, 34 (03) : 280 - 285
  • [42] Rational Protein Engineering Guided by Deep Mutational Scanning
    Shin, HyeonSeok
    Cho, Byung-Kwan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (09): : 23094 - 23110
  • [43] Persistent spectral theory-guided protein engineering
    Qiu, Yuchi
    Wei, Guo-Wei
    NATURE COMPUTATIONAL SCIENCE, 2023, 3 (2): : 149 - 163
  • [44] Physics guided machine learning using simplified theories
    Pawar, Suraj
    San, Omer
    Aksoylu, Burak
    Rasheed, Adil
    Kvamsdal, Trond
    PHYSICS OF FLUIDS, 2021, 33 (01)
  • [45] Advances in generating functional diversity for directed protein evolution
    Shivange, Amol V.
    Marienhagen, Jan
    Mundhada, Hemanshu
    Schenk, Alexander
    Schwaneberg, Ulrich
    CURRENT OPINION IN CHEMICAL BIOLOGY, 2009, 13 (01) : 19 - 25
  • [46] Theory -guided machine learning for damage characterization of composites
    Zobeiry, Navid
    Reiner, Johannes
    Vaziri, Reza
    COMPOSITE STRUCTURES, 2020, 246
  • [47] Machine learning-guided co-optimization of fitness and diversity facilitates combinatorial library design in enzyme engineering
    Ding, Kerr
    Chin, Michael
    Zhao, Yunlong
    Huang, Wei
    Mai, Binh Khanh
    Wang, Huanan
    Liu, Peng
    Yang, Yang
    Luo, Yunan
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [48] Protein function in precision medicine: deep understanding with machine learning
    Rost, Burkhard
    Radivojac, Predrag
    Bromberg, Yana
    FEBS LETTERS, 2016, 590 (15) : 2327 - 2341
  • [49] Machine learning meets enzyme engineering: examples in the design of polyethylene terephthalate hydrolases
    Ali, Rohan
    Zhang, Yifei
    FRONTIERS OF CHEMICAL SCIENCE AND ENGINEERING, 2024, 18 (12)
  • [50] A visualized bibliometric analysis of mapping research trends of machine learning in engineering (MLE)
    Su, Miao
    Peng, Hui
    Li, Shaofan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186