Deep learning for NAD/NADP cofactor prediction and engineering using transformer attention analysis in enzymes

被引:0
|
作者
Kim, Jaehyung [1 ]
Woo, Jihoon [1 ]
Park, Joon Young [1 ]
Kim, Kyung-Jin [2 ]
Kim, Donghyuk [1 ]
机构
[1] Ulsan Natl Inst Sci & Technol UNIST, Sch Energy & Chem Engn, Ulsan 44919, South Korea
[2] Kyungpook Natl Univ, KNU Inst Microbiol, Sch Life Sci, BK21 FOUR KNU Creat Biores Grp, Daegu 41566, South Korea
基金
新加坡国家研究基金会;
关键词
NAD(P) specificity; Cofactor switching; Deep learning; Explainable AI; Protein engineering; Synthetic biology; COENZYME SPECIFICITY; REDUCTASE; BINDING; CLASSIFICATION; DEHYDROGENASE; PREFERENCE; PHOSPHATE; SUBSTRATE; SEQUENCE;
D O I
10.1016/j.ymben.2024.11.007
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Understanding and manipulating the cofactor preferences of NAD(P)-dependent oxidoreductases, the most widely distributed enzyme group in nature, is increasingly crucial in bioengineering. However, large-scale identification of the cofactor preferences and the design of mutants to switch cofactor specificity remain as complex tasks. Here, we introduce DISCODE (Deep learning-based Iterative pipeline to analyze Specificity of COfactors and to Design Enzyme), a novel transformer-based deep learning model to predict NAD(P) cofactor preferences. For model training, a total of 7,132 NAD(P)-dependent enzyme sequences were collected. Leveraging whole-length sequence information, DISCODE classifies the cofactor preferences of NAD(P)dependent oxidoreductase protein sequences without structural or taxonomic limitation. The model showed 97.4% and 97.3% of accuracy and F1 score, respectively. A notable feature of DISCODE is the interpretability of its transformer layers. Analysis of attention layers in the model enables identification of several residues that showed significantly higher attention weights. They were well aligned with structurally important residues that closely interact with NAD(P), facilitating the identification of key residues for determining cofactor specificities. These key residues showed high consistency with verified cofactor switching mutants. Integrated into an enzyme design pipeline, DISCODE coupled with attention analysis, enables a fully automated approach to redesign cofactor specificity.
引用
收藏
页码:86 / 94
页数:9
相关论文
共 50 条
  • [41] Deep-ProBind: binding protein prediction with transformer-based deep learning model
    Khan, Salman
    Noor, Sumaiya
    Awan, Hamid Hussain
    Iqbal, Shehryar
    Alqahtani, Salman A.
    Dilshad, Naqqash
    Ahmad, Nijad
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [42] GraphKM: machine and deep learning for KM prediction of wildtype and mutant enzymes
    He, Xiao
    Yan, Ming
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [43] GraphKM: machine and deep learning for KM prediction of wildtype and mutant enzymes
    Xiao He
    Ming Yan
    BMC Bioinformatics, 25
  • [44] Comparative performance analysis of quantum machine learning with deep learning for diabetes prediction
    Gupta, Himanshu
    Varshney, Hirdesh
    Sharma, Tarun Kumar
    Pachauri, Nikhil
    Verma, Om Prakash
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (04) : 3073 - 3087
  • [45] Nonintrusive Load Monitoring (NILM) Using a Deep Learning Model with a Transformer-Based Attention Mechanism and Temporal Pooling
    Azad, Mohammad Irani
    Rajabi, Roozbeh
    Estebsari, Abouzar
    ELECTRONICS, 2024, 13 (02)
  • [46] Sea Surface Height Prediction With Deep Learning Based on Attention Mechanism
    Liu, Jingjing
    Jin, Baogang
    Wang, Lei
    Xu, Lingyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [47] Mobile traffic prediction with attention-based hybrid deep learning
    Wang, Li
    Che, Linxiao
    Lam, Kwok-Yan
    Liu, Wenqiang
    Li, Feng
    PHYSICAL COMMUNICATION, 2024, 66
  • [48] Feature Engineering for Mid-Price Prediction With Deep Learning
    Ntakaris, Adamantios
    Mirone, Giorgio
    Kanniainen, Juho
    Gabbouj, Moncef
    Iosifidis, Alexandros
    IEEE ACCESS, 2019, 7 : 82390 - 82412
  • [49] XDeMo: a novel deep learning framework for DNA motif mining using transformer models
    Chaurasia, Rajashree
    Ghose, Udayan
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
  • [50] DeepPGD: A Deep Learning Model for DNA Methylation Prediction Using Temporal Convolution, BiLSTM, and Attention Mechanism
    Teragawa, Shoryu
    Wang, Lei
    Liu, Yi
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (15)