Recent Advances in Machine Learning Variant Effect Prediction Tools for Protein Engineering

被引:16
|
作者
Horne, Jesse [1 ]
Shukla, Diwakar [1 ,2 ,3 ,4 ]
机构
[1] Univ Illinois, Dept Chem & Biomol Engn, Champaign, IL 61801 USA
[2] Univ Illinois, Dept Bioengn, Champaign, IL 61801 USA
[3] Univ Illinois, Canc Ctr Illinois, Dept Plant Biol, Champaign, IL 61801 USA
[4] Univ Illinois, Ctr Biophys & Quantitat Biol, Champaign, IL 61801 USA
基金
美国国家卫生研究院;
关键词
AMINO-ACID SUBSTITUTIONS; INSIGHTS; STABILITY; EVOLUTION; MODELS;
D O I
10.1021/acs.iecr.1c04943
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Proteins are Nature's molecular machinery and comprise diverse roles while consisting of chemically similar building blocks. In recent years, protein engineering and design have become important research areas, with many applications in the pharmaceutical, energy, and biocatalysis fields, among others- where the aim is to ultimately create a protein given desired structural and functional properties. It is often critical to model the relationship between a protein's sequence, folded structure, and biological function to assist in such protein engineering pursuits. However, significant challenges remain in concretely mapping an amino acid sequence to specific protein properties and biological activities. Mutations may enhance or diminish molecular protein function, and the epistatic interactions between mutations result in an inherently complex mapping between genetic modifications and protein function. Therefore, estimating the quantitative effects of mutations on protein function(s) remains a grand challenge of biology, bioinformatics, and many related fields and would rapidly accelerate protein engineering tasks when successful. Such estimation is often known as variant effect prediction (VEP). However, progress has been demonstrated in recent years with the development of machine learning (ML) methods in modeling the relationship between mutations and protein function. In this Review, recent advances in variant effect prediction (VEP) are discussed as tools for protein engineering, focusing on techniques incorporating gains from the broader ML community and challenges in estimating biomolecular functional differences. Primary developments highlighted include convolutional neural networks, graph neural networks, and natural language embeddings for protein sequences.
引用
收藏
页码:6235 / 6245
页数:11
相关论文
共 50 条
  • [1] Recent Advances in Machine Learning Based Prediction of RNA-Protein Interactions
    Sagar, Amit
    Xue, Bin
    PROTEIN AND PEPTIDE LETTERS, 2019, 26 (08): : 601 - 619
  • [2] Recent advances in machine learning applications in metabolic engineering
    Patra, Pradipta
    Disha, B. R.
    Kundu, Pritam
    Das, Manali
    Ghosh, Amit
    BIOTECHNOLOGY ADVANCES, 2023, 62
  • [3] Recent Advances in the Prediction of Protein Structural Classes: Feature Descriptors and Machine Learning Algorithms
    Zhu, Lin
    Davari, Mehdi D.
    Li, Wenjin
    CRYSTALS, 2021, 11 (04)
  • [4] Variant Effect Prediction in the Age of Machine Learning
    Bromberg, Yana
    Prabakaran, R.
    Kabir, Anowarul
    Shehu, Amarda
    COLD SPRING HARBOR PERSPECTIVES IN BIOLOGY, 2024, 16 (07):
  • [5] Multimodal Machine Learning for Protein Variant Effect Prediction and De Novo Design
    Shen, Yang
    PROTEIN SCIENCE, 2023, 32 (12)
  • [6] Recent advances in the integration of protein mechanics and machine learning
    Chen, Yen-Lin
    Chang, Shu-Wei
    EXTREME MECHANICS LETTERS, 2024, 72
  • [7] Advances in the Prediction of Protein Subcellular Locations with Machine Learning
    Zhang, Ting-He
    Zhang, Shao-Wu
    CURRENT BIOINFORMATICS, 2019, 14 (05) : 406 - 421
  • [8] Recent advances and application of machine learning in food flavor prediction and regulation
    Ji, Huizhuo
    Pu, Dandan
    Yan, Wenjing
    Zhang, Qingchuan
    Zuo, Min
    Zhang, Yuyu
    TRENDS IN FOOD SCIENCE & TECHNOLOGY, 2023, 138 : 738 - 751
  • [9] Advances in Protein Contact Map Prediction Based on Machine Learning
    Xie, Jiang
    Ding, Wang
    Chen, Luonan
    Guo, Qiang
    Zhang, Wu
    MEDICINAL CHEMISTRY, 2015, 11 (03) : 265 - 270
  • [10] Machine learning tools in production engineering
    Michael Rom
    Matthias Brockmann
    Michael Herty
    Elisa Iacomini
    The International Journal of Advanced Manufacturing Technology, 2022, 121 : 4793 - 4804