TMSC-m7G: A transformer architecture based on multi-sense-scaled embedding features and convolutional neural network to identify RNA N7-methylguanosine sites
被引:8
作者:
Zhang, Shengli
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
Key Lab Computat Sci & Applicat Hainan Prov, Haikou 571158, Peoples R ChinaXidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
Zhang, Shengli
[1
,2
]
Xu, Yujie
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R ChinaXidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
Xu, Yujie
[1
]
Liang, Yunyun
论文数: 0引用数: 0
h-index: 0
机构:
Xian Polytech Univ, Sch Sci, Xian 710048, Peoples R ChinaXidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
Liang, Yunyun
[3
]
机构:
[1] Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
[2] Key Lab Computat Sci & Applicat Hainan Prov, Haikou 571158, Peoples R China
[3] Xian Polytech Univ, Sch Sci, Xian 710048, Peoples R China
RNA N7-methylguanosine;
Natural language processing;
Word embedding;
Transformer;
Convolutional neural network;
CAP STRUCTURE;
CD-HIT;
MODEL;
IDENTIFICATION;
REVEALS;
PROTEIN;
ROLES;
CODE;
D O I:
10.1016/j.csbj.2023.11.052
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
RNA N7-methylguanosine (m7G) is a crucial chemical modification of RNA molecules, whose principal duty is to maintain RNA function and protein translation. Studying and predicting RNA N7-methylguanosine sites aid in comprehending the biological function of RNA and the development of new drug therapy regimens. In the present scenario, the efficacy of techniques, specifically deep learning and machine learning, stands out in the prediction of RNA N7-methylguanosine sites, leading to improved accuracy and identification efficiency. In this study, we propose a model leveraging the transformer framework that integrates natural language processing and deep learning to predict m7G sites, called TMSC-m7G. In TMSC-m7G, a combination of multi-sense-scaled token embedding and fixed-position embedding is used to replace traditional word embedding for the extraction of contextual information from sequences. Moreover, a convolutional layer is added in the encoder to make up for the shortage of local information acquisition in transformer. The model's robustness and generalization are validated through 10-fold cross-validation and an independent dataset test. Results demonstrate outstanding performance in comparison to the most advanced models available. Among them, the Accuracy of TMSC-m7G reaches 98.70% and 92.92% on the benchmark dataset and independent dataset, respectively. To facilitate the popularization and use of the model, we have developed an intuitive online prediction tool, which is easily accessible for free at http://39.105.212.81/.
机构:
Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
North China Univ Sci & Technol, Sch Life Sci, Ctr Genom & Computat Biol, Tangshan 063000, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Chen, Wei
;
Feng, Pengmian
论文数: 0引用数: 0
h-index: 0
机构:
Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Feng, Pengmian
;
Song, Xiaoming
论文数: 0引用数: 0
h-index: 0
机构:
North China Univ Sci & Technol, Sch Life Sci, Ctr Genom & Computat Biol, Tangshan 063000, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Song, Xiaoming
;
Lv, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Univ Elect Sci & Technol China, Ctr Informat Biol, Sch Life Sci & Technol, Key Lab Neuroinformat,Minist Educ, Chengdu 610054, Sichuan, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Lv, Hao
;
Lin, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Univ Elect Sci & Technol China, Ctr Informat Biol, Sch Life Sci & Technol, Key Lab Neuroinformat,Minist Educ, Chengdu 610054, Sichuan, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
机构:
Tianjin Univ, Fac Intelligence & Comp, Sch Comp Sci & Technol, Tianjin, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Dai, Chichi
;
Feng, Pengmian
论文数: 0引用数: 0
h-index: 0
机构:
Chengdu Univ Tradit Chinese Med, Chengdu, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Feng, Pengmian
;
Cui, Lizhen
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ, Sch Software, Jinan, Peoples R China
E Commerce Res Ctr, Jinan, Peoples R China
Res Ctr Software & Data Engn, Jinan, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Cui, Lizhen
;
Su, Ran
论文数: 0引用数: 0
h-index: 0
机构:
Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Su, Ran
;
Chen, Wei
论文数: 0引用数: 0
h-index: 0
机构:
North China Univ Sci & Technol, Sch Life Sci, Qinhuangdao, Hebei, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Chen, Wei
;
Wei, Leyi
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ, Sch Software, Jinan, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
机构:
Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
North China Univ Sci & Technol, Sch Life Sci, Ctr Genom & Computat Biol, Tangshan 063000, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Chen, Wei
;
Feng, Pengmian
论文数: 0引用数: 0
h-index: 0
机构:
Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Feng, Pengmian
;
Song, Xiaoming
论文数: 0引用数: 0
h-index: 0
机构:
North China Univ Sci & Technol, Sch Life Sci, Ctr Genom & Computat Biol, Tangshan 063000, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Song, Xiaoming
;
Lv, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Univ Elect Sci & Technol China, Ctr Informat Biol, Sch Life Sci & Technol, Key Lab Neuroinformat,Minist Educ, Chengdu 610054, Sichuan, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
Lv, Hao
;
Lin, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Univ Elect Sci & Technol China, Ctr Informat Biol, Sch Life Sci & Technol, Key Lab Neuroinformat,Minist Educ, Chengdu 610054, Sichuan, Peoples R ChinaChengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
机构:
Tianjin Univ, Fac Intelligence & Comp, Sch Comp Sci & Technol, Tianjin, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Dai, Chichi
;
Feng, Pengmian
论文数: 0引用数: 0
h-index: 0
机构:
Chengdu Univ Tradit Chinese Med, Chengdu, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Feng, Pengmian
;
Cui, Lizhen
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ, Sch Software, Jinan, Peoples R China
E Commerce Res Ctr, Jinan, Peoples R China
Res Ctr Software & Data Engn, Jinan, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Cui, Lizhen
;
Su, Ran
论文数: 0引用数: 0
h-index: 0
机构:
Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Su, Ran
;
Chen, Wei
论文数: 0引用数: 0
h-index: 0
机构:
North China Univ Sci & Technol, Sch Life Sci, Qinhuangdao, Hebei, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
Chen, Wei
;
Wei, Leyi
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Univ, Sch Software, Jinan, Peoples R ChinaTianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China