MPEK: a multitask deep learning framework based on pretrained language models for enzymatic reaction kinetic parameters prediction

被引:6
|
作者
Wang, Jingjing [1 ]
Yang, Zhijiang [1 ]
Chen, Chang [1 ]
Yao, Ge [1 ]
Wan, Xiukun [1 ]
Bao, Shaoheng [1 ]
Ding, Junjie [1 ]
Wang, Liangliang [1 ]
Jiang, Hui [1 ]
机构
[1] State Key Lab NBC Protect Civilian, 37 South Cent St, Beijing 102205, Peoples R China
关键词
multitask deep learning; pretraining; enzymatic reaction; kcat prediction; Km prediction; PROTEIN; EVOLUTIONARY; BIOCATALYSIS; RESOURCE;
D O I
10.1093/bib/bbae387
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Enzymatic reaction kinetics are central in analyzing enzymatic reaction mechanisms and target-enzyme optimization, and thus in biomanufacturing and other industries. The enzyme turnover number (kcat) and Michaelis constant (Km), key kinetic parameters for measuring enzyme catalytic efficiency, are crucial for analyzing enzymatic reaction mechanisms and the directed evolution of target enzymes. Experimental determination of kcat and Km is costly in terms of time, labor, and cost. To consider the intrinsic connection between kcat and Km and further improve the prediction performance, we propose a universal pretrained multitask deep learning model, MPEK, to predict these parameters simultaneously while considering pH, temperature, and organismal information. Through testing on the same kcat and Km test datasets, MPEK demonstrated superior prediction performance over the previous models. Specifically, MPEK achieved the Pearson coefficient of 0.808 for predicting kcat, improving ca. 14.6% and 7.6% compared to the DLKcat and UniKP models, and it achieved the Pearson coefficient of 0.777 for predicting Km, improving ca. 34.9% and 53.3% compared to the Kroll_model and UniKP models. More importantly, MPEK was able to reveal enzyme promiscuity and was sensitive to slight changes in the mutant enzyme sequence. In addition, in three case studies, it was shown that MPEK has the potential for assisted enzyme mining and directed evolution. To facilitate in silico evaluation of enzyme catalytic efficiency, we have established a web server implementing this model, which can be accessed at http://mathtc.nscc-tj.cn/mpek.
引用
收藏
页数:11
相关论文
共 9 条
  • [1] CatPred: a comprehensive framework for deep learning in vitro enzyme kinetic parameters
    Boorla, Veda Sheersh
    Maranas, Costas D.
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [2] Language Models Based on Deep Learning: A Review
    Wang N.-Y.
    Ye Y.-X.
    Liu L.
    Feng L.-Z.
    Bao T.
    Peng T.
    Peng, Tao (tpeng@jlu.edu.cn), 1600, Chinese Academy of Sciences (32): : 1082 - 1115
  • [3] EITLEM-Kinetics: A deep-learning framework for kinetic parameter prediction of mutant enzymes
    Shen, Xiaowei
    Cui, Ziheng
    Long, Jianyu
    Zhang, Shiding
    Chen, Biqiang
    Tan, Tianwei
    CHEM CATALYSIS, 2024, 4 (09):
  • [4] xDeep-AcPEP: Deep Learning Method for Anticancer Peptide Activity Prediction Based on Convolutional Neural Network and Multitask Learning
    Chen, Jiarui
    Cheong, Hong Hin
    Siu, Shirley W., I
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (08) : 3789 - 3803
  • [5] LMPhosSite: A Deep Learning-Based Approach for General Protein Phosphorylation Site Prediction Using Embeddings from the Local Window Sequence and Pretrained Protein Language Model
    Pakhrin, Subash C.
    Pokharel, Suresh
    Pratyush, Pawel
    Chaudhari, Meenal
    Ismail, Hamid D.
    Dukka, B. K. C. B.
    JOURNAL OF PROTEOME RESEARCH, 2023, 22 (08) : 2548 - 2557
  • [6] An efficient hardware architecture based on an ensemble of deep learning models for COVID-19 prediction
    Sakthivel, R.
    Thaseen, I. Sumaiya
    Vanitha, M.
    Deepa, M.
    Angulakshmi, M.
    Mangayarkarasi, R.
    Mahendran, Anand
    Alnumay, Waleed
    Chatterjee, Puspita
    SUSTAINABLE CITIES AND SOCIETY, 2022, 80
  • [7] A brief survey of deep learning-based models for CircRNA-protein binding sites prediction
    Shen, Zhen
    Yuan, Lin
    Bao, Wenzheng
    Wang, Siguo
    Zhang, Qinhu
    Huang, De-Shuang
    NEUROCOMPUTING, 2025, 628
  • [8] Accurate RNA 3D structure prediction using a language model-based deep learning approach
    Shen, Tao
    Hu, Zhihang
    Sun, Siqi
    Liu, Di
    Wong, Felix
    Wang, Jiuming
    Chen, Jiayang
    Wang, Yixuan
    Hong, Liang
    Xiao, Jin
    Zheng, Liangzhen
    Krishnamoorthi, Tejas
    King, Irwin
    Wang, Sheng
    Yin, Peng
    Collins, James J.
    Li, Yu
    NATURE METHODS, 2024, : 2287 - 2298
  • [9] Towards Real-Time Sleep Stage Prediction and Online Calibration Based on Architecturally Switchable Deep Learning Models
    Zhu, Hangyu
    Wu, Yonglin
    Guo, Yao
    Fu, Cong
    Shu, Feng
    Yu, Huan
    Chen, Wei
    Chen, Chen
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (01) : 470 - 481