Improving protein solubility and activity by introducing small peptide tags designed with machine learning models

被引:41
|
作者
Han, Xi [1 ]
Ning, Wenbo [1 ]
Ma, Xiaoqiang [2 ]
Wang, Xiaonan [1 ]
Zhou, Kang [1 ,2 ]
机构
[1] Natl Univ Singapore, Dept Chem & Biomol Engn, Singapore 117585, Singapore
[2] Singapore MIT Alliance Res & Technol, Disrupt Sustainable Technol Agr Precis, Singapore 138602, Singapore
来源
METABOLIC ENGINEERING COMMUNICATIONS | 2020年 / 11卷
基金
新加坡国家研究基金会;
关键词
Protein solubility; Protein activity; Machine learning; Optimization; Peptide tags; ESCHERICHIA-COLI; EXPRESSION;
D O I
10.1016/j.mec.2020.e00138
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Improving catalytic ability of enzymes is critical to the success of many metabolic engineering projects, but the search space of possible protein mutants is too large to explore exhaustively through experiments. To some extent, highly soluble enzymes tend to exhibit high activity due to their better folding quality. Here, we demonstrate that an optimization algorithm based on a regression model can effectively design short peptide tags to improve solubility of a few model enzymes. Based on the protein sequence information, a support vector regression model we recently developed was used to evaluate protein solubility after small peptide tags were introduced to a target protein. The optimization algorithm guided the sequences of the tags to evolve towards variants that had higher solubility. The optimization results were validated successfully by measuring solubility and activity of the model enzyme with and without the identified tags. The solubility of one protein (tyrosine ammonia lyase) was more than doubled and its activity was improved by 250%. This strategy successfully increased solubility of another two enzymes (aldehyde dehydrogenase and 1-deoxy-D-xylulose-5-phosphate synthase) we tested. The presented optimization methodology thus provides a valuable tool for improving enzyme performance for metabolic engineering and other biotechnology projects.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Multiple machine learning algorithms assisted QSPR models for aqueous solubility: Comprehensive assessment with CRITIC-TOPSIS
    Zhu, Tengyi
    Chen, Ying
    Tao, Cuicui
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 857
  • [22] Improving empirical models and forecasts with saturation-based machine learning
    Martinez, Andrew B.
    Ericsson, Neil R.
    ANNALS OF OPERATIONS RESEARCH, 2025, 346 (01) : 447 - 487
  • [23] Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms
    Zhuyifan Ye
    Defang Ouyang
    Journal of Cheminformatics, 13
  • [24] Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms
    Ye, Zhuyifan
    Ouyang, Defang
    JOURNAL OF CHEMINFORMATICS, 2021, 13 (01)
  • [25] Development of Advanced Machine Learning Models for Predicting CO2 Solubility in Brine
    Du, Xuejia
    Thakur, Ganesh C.
    ENERGIES, 2025, 18 (05)
  • [26] Machine Learning Methods for Predicting HLA-Peptide Binding Activity
    Luo, Heng
    Ye, Hao
    Ng, Hui Wen
    Shi, Leming
    Tong, Weida
    Mendrick, Donna L.
    Hong, Huixiao
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2015, 9 : 21 - 29
  • [27] Machine Learning in Quantitative Protein-peptide Affinity Prediction: Implications for Therapeutic Peptide Design
    Li, Zhongyan
    Miao, Qingqing
    Yan, Fugang
    Meng, Yang
    Zhou, Pcng
    CURRENT DRUG METABOLISM, 2019, 20 (03) : 170 - 176
  • [28] Application of CO2 Supercritical Fluid to Optimize the Solubility of Oxaprozin: Development of Novel Machine Learning Predictive Models
    Alshahrani, Saad M.
    Al Saqr, Ahmed
    Alfadhel, Munerah M.
    Alshetaili, Abdullah S.
    Almutairy, Bjad K.
    Alsubaiyel, Amal M.
    Almari, Ali H.
    Alamoudi, Jawaher Abdullah
    Abourehab, Mohammed A. S.
    MOLECULES, 2022, 27 (18):
  • [29] Machine learning on protein-protein interaction prediction: models, challenges and trends
    Tang, Tao
    Zhang, Xiaocai
    Liu, Yuansheng
    Peng, Hui
    Zheng, Binshuang
    Yin, Yanlin
    Zeng, Xiangxiang
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (02)
  • [30] Human Activity Classification Using Basic Machine Learning Models
    Khanal, Bikram
    Rivas, Pablo
    Orduz, Javier
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 121 - 126