Improving protein solubility and activity by introducing small peptide tags designed with machine learning models

被引:41
|
作者
Han, Xi [1 ]
Ning, Wenbo [1 ]
Ma, Xiaoqiang [2 ]
Wang, Xiaonan [1 ]
Zhou, Kang [1 ,2 ]
机构
[1] Natl Univ Singapore, Dept Chem & Biomol Engn, Singapore 117585, Singapore
[2] Singapore MIT Alliance Res & Technol, Disrupt Sustainable Technol Agr Precis, Singapore 138602, Singapore
来源
METABOLIC ENGINEERING COMMUNICATIONS | 2020年 / 11卷
基金
新加坡国家研究基金会;
关键词
Protein solubility; Protein activity; Machine learning; Optimization; Peptide tags; ESCHERICHIA-COLI; EXPRESSION;
D O I
10.1016/j.mec.2020.e00138
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Improving catalytic ability of enzymes is critical to the success of many metabolic engineering projects, but the search space of possible protein mutants is too large to explore exhaustively through experiments. To some extent, highly soluble enzymes tend to exhibit high activity due to their better folding quality. Here, we demonstrate that an optimization algorithm based on a regression model can effectively design short peptide tags to improve solubility of a few model enzymes. Based on the protein sequence information, a support vector regression model we recently developed was used to evaluate protein solubility after small peptide tags were introduced to a target protein. The optimization algorithm guided the sequences of the tags to evolve towards variants that had higher solubility. The optimization results were validated successfully by measuring solubility and activity of the model enzyme with and without the identified tags. The solubility of one protein (tyrosine ammonia lyase) was more than doubled and its activity was improved by 250%. This strategy successfully increased solubility of another two enzymes (aldehyde dehydrogenase and 1-deoxy-D-xylulose-5-phosphate synthase) we tested. The presented optimization methodology thus provides a valuable tool for improving enzyme performance for metabolic engineering and other biotechnology projects.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Serum Protein Fishing for Machine Learning-Boosted Diagnostic Classification of Small Nodules of Lung
    Wang, Mengjie
    Dai, Xin
    Yang, Xu
    Jin, Baichuan
    Xie, Yueli
    Xu, Chenlu
    Liu, Qiqi
    Wang, Lichao
    Ying, Lisha
    Lu, Weishan
    Chen, Qixun
    Fu, Ting
    Su, Dan
    Liu, Yuan
    Tan, Weihong
    ACS NANO, 2024, 18 (05) : 4038 - 4055
  • [42] Developing statistical and machine learning models for predicting CO 2 solubility in live crude oils
    Bhattacherjee, Rupom
    Botchway, Kodjo
    Pashin, Jack C.
    Chakraborty, Goutam
    Bikkina, Prem
    FUEL, 2024, 368
  • [43] Mapping membrane activity in undiscovered peptide sequence space using machine learning
    Lee, Ernest Y.
    Fulan, Benjamin M.
    Wong, Gerard C. L.
    Ferguson, Andrew L.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (48) : 13588 - 13593
  • [44] Machine learning and deep learning models for human activity recognition in security and surveillance: a review
    Waghchaware, Sheetal
    Joshi, Radhika
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (08) : 4405 - 4436
  • [45] Protein structure prediction (RMSD ≤ 5 Å) using machine learning models
    Pathak, Yadunath
    Rana, Prashant Singh
    Singh, P. K.
    Saraswat, Mukesh
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 14 (01) : 71 - 85
  • [46] Protein Language Models and Machine Learning Facilitate the Identification of Antimicrobial Peptides
    Medina-Ortiz, David
    Contreras, Seba
    Fernandez, Diego
    Soto-Garcia, Nicole
    Moya, Ivan
    Cabas-Mora, Gabriel
    Olivera-Nappa, Alvaro
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (16)
  • [47] Probabilistic and Machine Learning Models for the Protein Scaffold Gap Filling Problem
    Badal, Kushal
    Qingge, Letu
    Liu, Xiaowen
    Zhu, Binhai
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT III, ISBRA 2024, 2024, 14956 : 28 - 39
  • [48] Introducing a Chemically Intuitive Core-Substituent Fingerprint Designed to Explore Structural Requirements for Effective Similarity Searching and Machine Learning
    Janela, Tiago
    Takeuchi, Kosuke
    Bajorath, Juergen
    MOLECULES, 2022, 27 (07):
  • [49] Voice activity detection based on statistical models and machine learning approaches
    Shin, Jong Won
    Chang, Joon-Hyuk
    Kim, Nam Soo
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (03): : 515 - 530
  • [50] Improving groundwater nitrate concentration prediction using local ensemble of machine learning models
    Mahboobi, Hojjatollah
    Shakiba, Alireza
    Mirbagheri, Babak
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 345