Improving protein solubility and activity by introducing small peptide tags designed with machine learning models

被引:41
|
作者
Han, Xi [1 ]
Ning, Wenbo [1 ]
Ma, Xiaoqiang [2 ]
Wang, Xiaonan [1 ]
Zhou, Kang [1 ,2 ]
机构
[1] Natl Univ Singapore, Dept Chem & Biomol Engn, Singapore 117585, Singapore
[2] Singapore MIT Alliance Res & Technol, Disrupt Sustainable Technol Agr Precis, Singapore 138602, Singapore
来源
METABOLIC ENGINEERING COMMUNICATIONS | 2020年 / 11卷
基金
新加坡国家研究基金会;
关键词
Protein solubility; Protein activity; Machine learning; Optimization; Peptide tags; ESCHERICHIA-COLI; EXPRESSION;
D O I
10.1016/j.mec.2020.e00138
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Improving catalytic ability of enzymes is critical to the success of many metabolic engineering projects, but the search space of possible protein mutants is too large to explore exhaustively through experiments. To some extent, highly soluble enzymes tend to exhibit high activity due to their better folding quality. Here, we demonstrate that an optimization algorithm based on a regression model can effectively design short peptide tags to improve solubility of a few model enzymes. Based on the protein sequence information, a support vector regression model we recently developed was used to evaluate protein solubility after small peptide tags were introduced to a target protein. The optimization algorithm guided the sequences of the tags to evolve towards variants that had higher solubility. The optimization results were validated successfully by measuring solubility and activity of the model enzyme with and without the identified tags. The solubility of one protein (tyrosine ammonia lyase) was more than doubled and its activity was improved by 250%. This strategy successfully increased solubility of another two enzymes (aldehyde dehydrogenase and 1-deoxy-D-xylulose-5-phosphate synthase) we tested. The presented optimization methodology thus provides a valuable tool for improving enzyme performance for metabolic engineering and other biotechnology projects.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Longitudinal microeconomic and machine learning models of skeletal activity generation
    Ozonder, Gozde
    Miller, Eric J.
    TRAVEL BEHAVIOUR AND SOCIETY, 2021, 23 : 40 - 64
  • [32] Machine learning proteochemometric models for Cereblon glue activity predictions
    Prael, Francis J.
    Cox, Jiayi
    Sturm, Noe
    Kutchukian, Peter
    Forrester, William C.
    Michaud, Gregory
    Blank, Jutta
    Shen, Lingling
    Rodriguez-Perez, Raquel
    ARTIFICIAL INTELLIGENCE IN THE LIFE SCIENCES, 2024, 6
  • [33] Machine Learning Models for Activity Recognition and Authentication of Smartphone Users
    Ahmadi, S. Sareh
    Rashad, Sherif
    Elgazzar, Heba
    2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 561 - 567
  • [34] Improving Streamflow Prediction Using Multiple Hydrological Models and Machine Learning Methods
    Solanki, Hiren
    Vegad, Urmin
    Kushwaha, Anuj
    Mishra, Vimal
    WATER RESOURCES RESEARCH, 2025, 61 (01)
  • [35] Developing and Improving Risk Models using Machine-learning Based Algorithms
    Wang, Yan
    Ni, Xuelei Sherry
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 281 - 282
  • [36] Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
    Campos, Joao R.
    Costa, Ernesto
    Vieira, Marco
    IEEE ACCESS, 2019, 7 : 177661 - 177674
  • [37] Improving Hybrid Models for Precipitation Forecasting by Combining Nonlinear Machine Learning Methods
    Parviz, Laleh
    Rasouli, Kabir
    Torabi Haghighi, Ali
    WATER RESOURCES MANAGEMENT, 2023, 37 (10) : 3833 - 3855
  • [38] Improving the performance of machine learning models for biotechnology: The quest for deus ex machina
    Mey, Friederike
    Clauwaert, Jim
    Van Huffel, Kirsten
    Waegeman, Willem
    De Mey, Marjan
    BIOTECHNOLOGY ADVANCES, 2021, 53
  • [39] Improving Hybrid Models for Precipitation Forecasting by Combining Nonlinear Machine Learning Methods
    Laleh Parviz
    Kabir Rasouli
    Ali Torabi Haghighi
    Water Resources Management, 2023, 37 : 3833 - 3855
  • [40] Serum Protein Fishing for Machine Learning-Boosted Diagnostic Classification of Small Nodules of Lung
    Wang, Mengjie
    Dai, Xin
    Yang, Xu
    Jin, Baichuan
    Xie, Yueli
    Xu, Chenlu
    Liu, Qiqi
    Wang, Lichao
    Ying, Lisha
    Lu, Weishan
    Chen, Qixun
    Fu, Ting
    Su, Dan
    Liu, Yuan
    Tan, Weihong
    ACS NANO, 2024, 18 (05) : 4038 - 4055