Improving protein solubility and activity by introducing small peptide tags designed with machine learning models

被引:41
|
作者
Han, Xi [1 ]
Ning, Wenbo [1 ]
Ma, Xiaoqiang [2 ]
Wang, Xiaonan [1 ]
Zhou, Kang [1 ,2 ]
机构
[1] Natl Univ Singapore, Dept Chem & Biomol Engn, Singapore 117585, Singapore
[2] Singapore MIT Alliance Res & Technol, Disrupt Sustainable Technol Agr Precis, Singapore 138602, Singapore
来源
METABOLIC ENGINEERING COMMUNICATIONS | 2020年 / 11卷
基金
新加坡国家研究基金会;
关键词
Protein solubility; Protein activity; Machine learning; Optimization; Peptide tags; ESCHERICHIA-COLI; EXPRESSION;
D O I
10.1016/j.mec.2020.e00138
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Improving catalytic ability of enzymes is critical to the success of many metabolic engineering projects, but the search space of possible protein mutants is too large to explore exhaustively through experiments. To some extent, highly soluble enzymes tend to exhibit high activity due to their better folding quality. Here, we demonstrate that an optimization algorithm based on a regression model can effectively design short peptide tags to improve solubility of a few model enzymes. Based on the protein sequence information, a support vector regression model we recently developed was used to evaluate protein solubility after small peptide tags were introduced to a target protein. The optimization algorithm guided the sequences of the tags to evolve towards variants that had higher solubility. The optimization results were validated successfully by measuring solubility and activity of the model enzyme with and without the identified tags. The solubility of one protein (tyrosine ammonia lyase) was more than doubled and its activity was improved by 250%. This strategy successfully increased solubility of another two enzymes (aldehyde dehydrogenase and 1-deoxy-D-xylulose-5-phosphate synthase) we tested. The presented optimization methodology thus provides a valuable tool for improving enzyme performance for metabolic engineering and other biotechnology projects.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Development and evaluation of data-driven designed tags (DDTs) for controlling protein solubility
    Hirose, Shuichi
    Kawamura, Yoshifumi
    Mori, Masatoshi
    Yokota, Kiyonobu
    Noguchi, Tamotsu
    Goshima, Naoki
    NEW BIOTECHNOLOGY, 2011, 28 (03) : 225 - 231
  • [2] Mutational analysis of protein solubility enhancement using short peptide tags
    Kato, Atsushi
    Maki, Kosuke
    Ebina, Teppei
    Kuwajima, Kunihiro
    Soda, Kunitsugu
    Kuroda, Yutaka
    BIOPOLYMERS, 2007, 85 (01) : 12 - 18
  • [3] Atomistic Descriptors for Machine Learning Models of Solubility Parameters for Small Molecules and Polymers
    Chi, Mingzhe
    Gargouri, Rihab
    Schrader, Tim
    Damak, Kamel
    Maalej, Ramzi
    Sierka, Marek
    POLYMERS, 2022, 14 (01)
  • [4] Develop machine learning-based regression predictive models for engineering protein solubility
    Han, Xi
    Wang, Xiaonan
    Zhou, Kang
    BIOINFORMATICS, 2019, 35 (22) : 4640 - 4646
  • [5] Accelerating the Screening of Small Peptide Ligands by Combining Peptide-Protein Docking and Machine Learning
    Codina, Josep-Ramon
    Mascini, Marcello
    Dikici, Emre
    Deo, Sapna K.
    Daunert, Sylvia
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (15)
  • [6] Improving Vectorization Heuristics in a Dynamic Compiler with Machine Learning Models
    Mosaner, Raphael
    Barany, Gergoe
    Leopoldseder, David
    Moessenboeck, Hanspeter
    PROCEEDINGS OF THE 14TH ACM SIGPLAN INTERNATIONAL WORKSHOP ON VIRTUAL MACHINES AND INTERMEDIATE LANGUAGES, VMIL 2022, 2022, : 36 - 47
  • [7] Intelligence computational analysis of letrozole solubility in supercritical solvent via machine learning models
    Alqarni, Mohammed
    Ashour, Amal Adnan
    Shafie, Alaa
    Alqarni, Ali
    Felemban, Mohammed Fareed
    Shukr, Bandar Saud
    Alzubaidi, Mohammed Abdullah
    Algahtani, Fahad Saeed
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [8] Development of machine learning-based solubility models for estimation of Hydrogen solubility in oil: Models assessment and validation
    Jin, Hulin
    Jin, Zhiran
    Kim, Yong-Guk
    Fan, Chunyang
    CASE STUDIES IN THERMAL ENGINEERING, 2023, 51
  • [9] Explainable Machine Learning for Improving Logistic Regression Models
    Yang, Yimin
    Wu, Min
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2021,
  • [10] Evaluation of Machine Learning Models for Aqueous Solubility Prediction in Drug Discovery
    Xue, Nian
    Zhang, Yuzhu
    Liu, Sensen
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 26 - 33