Deep learning for NAD/NADP cofactor prediction and engineering using transformer attention analysis in enzymes

被引：0

作者：

Kim, Jaehyung ^{[1
]}

Woo, Jihoon ^{[1
]}

Park, Joon Young ^{[1
]}

Kim, Kyung-Jin ^{[2
]}

Kim, Donghyuk ^{[1
]}

机构：

[1] Ulsan Natl Inst Sci & Technol UNIST, Sch Energy & Chem Engn, Ulsan 44919, South Korea

[2] Kyungpook Natl Univ, KNU Inst Microbiol, Sch Life Sci, BK21 FOUR KNU Creat Biores Grp, Daegu 41566, South Korea

来源：

METABOLIC ENGINEERING | 2025年 / 87卷

基金：

新加坡国家研究基金会;

关键词：

NAD(P) specificity; Cofactor switching; Deep learning; Explainable AI; Protein engineering; Synthetic biology; COENZYME SPECIFICITY; REDUCTASE; BINDING; CLASSIFICATION; DEHYDROGENASE; PREFERENCE; PHOSPHATE; SUBSTRATE; SEQUENCE;

D O I：

10.1016/j.ymben.2024.11.007

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

Understanding and manipulating the cofactor preferences of NAD(P)-dependent oxidoreductases, the most widely distributed enzyme group in nature, is increasingly crucial in bioengineering. However, large-scale identification of the cofactor preferences and the design of mutants to switch cofactor specificity remain as complex tasks. Here, we introduce DISCODE (Deep learning-based Iterative pipeline to analyze Specificity of COfactors and to Design Enzyme), a novel transformer-based deep learning model to predict NAD(P) cofactor preferences. For model training, a total of 7,132 NAD(P)-dependent enzyme sequences were collected. Leveraging whole-length sequence information, DISCODE classifies the cofactor preferences of NAD(P)dependent oxidoreductase protein sequences without structural or taxonomic limitation. The model showed 97.4% and 97.3% of accuracy and F1 score, respectively. A notable feature of DISCODE is the interpretability of its transformer layers. Analysis of attention layers in the model enables identification of several residues that showed significantly higher attention weights. They were well aligned with structurally important residues that closely interact with NAD(P), facilitating the identification of key residues for determining cofactor specificities. These key residues showed high consistency with verified cofactor switching mutants. Integrated into an enzyme design pipeline, DISCODE coupled with attention analysis, enables a fully automated approach to redesign cofactor specificity.

引用

页码：86 / 94

页数：9

共 50 条

[41] Deep-ProBind: binding protein prediction with transformer-based deep learning model
Khan, Salman
Noor, Sumaiya
Awan, Hamid Hussain
Iqbal, Shehryar
Alqahtani, Salman A.
Dilshad, Naqqash
Ahmad, Nijad
BMC BIOINFORMATICS, 2025, 26 (01):
[42] GraphKM: machine and deep learning for KM prediction of wildtype and mutant enzymes
He, Xiao
Yan, Ming
BMC BIOINFORMATICS, 2024, 25 (01)
[43] GraphKM: machine and deep learning for KM prediction of wildtype and mutant enzymes
Xiao He
Ming Yan
BMC Bioinformatics, 25
[44] Comparative performance analysis of quantum machine learning with deep learning for diabetes prediction
Gupta, Himanshu
Varshney, Hirdesh
Sharma, Tarun Kumar
Pachauri, Nikhil
Verma, Om Prakash
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (04) : 3073 - 3087
[45] Nonintrusive Load Monitoring (NILM) Using a Deep Learning Model with a Transformer-Based Attention Mechanism and Temporal Pooling
Azad, Mohammad Irani
Rajabi, Roozbeh
Estebsari, Abouzar
ELECTRONICS, 2024, 13 (02)
[46] Sea Surface Height Prediction With Deep Learning Based on Attention Mechanism
Liu, Jingjing
Jin, Baogang
Wang, Lei
Xu, Lingyu
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[47] Mobile traffic prediction with attention-based hybrid deep learning
Wang, Li
Che, Linxiao
Lam, Kwok-Yan
Liu, Wenqiang
Li, Feng
PHYSICAL COMMUNICATION, 2024, 66
[48] Feature Engineering for Mid-Price Prediction With Deep Learning
Ntakaris, Adamantios
Mirone, Giorgio
Kanniainen, Juho
Gabbouj, Moncef
Iosifidis, Alexandros
IEEE ACCESS, 2019, 7 : 82390 - 82412
[49] XDeMo: a novel deep learning framework for DNA motif mining using transformer models
Chaurasia, Rajashree
Ghose, Udayan
NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
[50] DeepPGD: A Deep Learning Model for DNA Methylation Prediction Using Temporal Convolution, BiLSTM, and Attention Mechanism
Teragawa, Shoryu
Wang, Lei
Liu, Yi
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (15)

← 1 2 3 4 5 →