A deep learning model for prediction of lysine crotonylation sites by fusing multi-features based on multi-head self-attention mechanism

被引:0
作者
Liang, Yunyun [1 ]
Li, Minwei [1 ]
机构
[1] Xian Polytech Univ, Sch Sci, Xian 710048, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国国家自然科学基金;
关键词
Lysine crotonylation sites; Natural Language processing; Hand-crafted features; Multi-head self-attention mechanism; Convolutional neural network; Bidirectional gated recurrent unit; HISTONE CROTONYLATION; NETWORK;
D O I
10.1038/s41598-025-04058-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Lysine crotonylation (Kcr) is an important post-translational modification, which is present in both histone and non-histone proteins, and plays a key role in a variety of biological processes such as metabolism and cell differentiation. Therefore, rapid and accurate identification of this modification has become a key task to study its biological effects. In the past few years, some calculation methods have been developed, but there is room for improvement in prediction performance. In this paper, we propose an effective model named DeepMM-Kcr, which is based on multiple features and an innovative deep learning framework. Multiple features are extracted from natural language processing features and hand-crafted features, where natural language processing features include token embedding and positional embedding encoded by transformer, and hand-crafted features include one-hot, amino acid index and position-weighted amino acid composition, and encoded by bidirectional long short-term memory network. Then natural language processing features and hand-crafted features are fusing by multi-head self-attention mechanism. Finally, a deep learning framework is constructed based on convolutional neural network, bidirectional gated recurrent unit and multilayer perceptron for robust prediction of Kcr sites. On the independent test set, the accuracy of DeepMM-Kcr is highest among the existing models. The experimental results show that our model has very good performance in predicting Kcr sites. The source datasets and codes (in Python) are publicly available at https://github.com/yunyunliang88/DeepMM-Kcr.
引用
收藏
页数:12
相关论文
共 51 条
[1]   predML-Site: Predicting Multiple Lysine PTM Sites With Optimal Feature Representation and Data Imbalance Minimization [J].
Ahmed, Sabit ;
Rahman, Afrida ;
Hasan, Md. Al Mehedi ;
Rahman, Julia ;
Ben Islam, Md Khaled ;
Ahmad, Shamim .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) :3624-3634
[2]  
Bengio Y., 2014, P 2014 C EMP METH NA, P1724
[3]   NeuroPred-CLQ: incorporating deep temporal convolutional networks and multi-head attention mechanism to predict neuropeptides [J].
Chen, Shouzhi ;
Li, Qing ;
Zhao, Jianping ;
Bin, Yannan ;
Zheng, Chunhou .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
[4]   nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning [J].
Chen, Yong-Zi ;
Wang, Zhuo-Zhi ;
Wang, Yanan ;
Ying, Guoguang ;
Chen, Zhen ;
Song, Jiangning .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
[5]   iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss [J].
Dou, Lijun ;
Zhang, Zilong ;
Xu, Lei ;
Zou, Quan .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 :3268-3279
[6]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211
[7]   AFP-MFL: accurate identification of antifungal peptides using multi-view feature learning [J].
Fang, Yitian ;
Xu, Fan ;
Wei, Lesong ;
Jiang, Yi ;
Chen, Jie ;
Wei, Leyi ;
Wei, Dong-Qing .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
[8]   Proteomic approaches beyond expression profiling and PTM analysis [J].
Fu, Jiaqi ;
Wu, Mei ;
Liu, Xiaoyun .
ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2018, 410 (17) :4051-4060
[9]   Structured Sparse Regularized TSK Fuzzy System for predicting therapeutic peptides [J].
Guo, Xiaoyi ;
Jiang, Yizhang ;
Zou, Quan .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
[10]   ViBE: a hierarchical BERT model to identify eukaryotic viruses using metagenome sequencing data [J].
Gwak, Ho-Jin ;
Rho, Mina .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)