MTTLm6A: A multi-task transfer learning approach for base-resolution mRNA m6A site prediction based on an improved transformer

被引:2
作者
Wang, Honglei [1 ,2 ]
Zeng, Wenliang [1 ]
Huang, Xiaoling [1 ]
Liu, Zhaoyang [1 ]
Sun, Yanjing [1 ]
Zhang, Lin [1 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou, Peoples R China
[2] Xuzhou Coll Ind Technol, Sch Informat Engn, Xuzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
RNA modification site; multi-task learning; transfer learning; natural language processing; deep learning; NETWORK; LANGUAGE; MODEL;
D O I
10.3934/mbe.2024013
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
N6-methyladenosine (m6A) is a crucial RNA modification involved in various biological activities. Computational methods have been developed for the detection of m6A sites in Saccharomyces cerevisiae at base-resolution due to their cost-effectiveness and efficiency. However, the generalization of these methods has been hindered by limited base-resolution datasets. Additionally, RMBase contains a vast number of low-resolution m6A sites for Saccharomyces cerevisiae, and base -resolution sites are often inferred from these low-resolution results through post-calibration. We propose MTTLm6A, a multi-task transfer learning approach for base-resolution mRNA m6A site prediction based on an improved transformer. First, the RNA sequences are encoded by using one-hot encoding. Then, we construct a multi-task model that combines a convolutional neural network with a multi-head-attention deep framework. This model not only detects low-resolution m6A sites, it also assigns reasonable probabilities to the predicted sites. Finally, we employ transfer learning to predict base-resolution m6A sites based on the low-resolution m6A sites. Experimental results on Saccharomyces cerevisiae m6A and Homo sapiens m1A data demonstrate that MTTLm6A respectively achieved area under the receiver operating characteristic (AUROC) values of 77.13% and 92.9%, outperforming the state-of-the-art models. At the same time, it shows that the model has strong generalization ability. To enhance user convenience, we have made a user-friendly web server for MTTLm6A publicly available at http://47.242.23.141/MTTLm6A/index.php.
引用
收藏
页码:272 / 299
页数:28
相关论文
共 71 条
[1]   Nanopore-Based Detection of Viral RNA Modifications [J].
Abebe, Jonathan S. ;
Verstraten, Ruth ;
Depledge, Daniel P. .
MBIO, 2022, 13 (03)
[2]  
Ahmed F., 2011, J. Nat. Sci. Biol. Med., V2, P1414
[3]   Machine learning and deep learning methods that use omics data for metastasis prediction [J].
Albaradei, Somayah ;
Thafar, Maha ;
Alsaedi, Asim ;
Van Neste, Christophe ;
Gojobori, Takashi ;
Essack, Magbubah ;
Gao, Xin .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 :5008-5018
[4]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[5]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[6]   RM2Target: a comprehensive database for targets of writers, erasers and readers of RNA modifications [J].
Bao, Xiaoqiong ;
Zhang, Yin ;
Li, Huiqin ;
Teng, Yuyan ;
Ma, Lixia ;
Chen, Zhihang ;
Luo, Xiaotong ;
Zheng, Jian ;
Zhao, An ;
Ren, Jian ;
Zuo, Zhixiang .
NUCLEIC ACIDS RESEARCH, 2023, 51 (D1) :D269-D279
[7]   Feature extraction approaches for biological sequences: a comparative study of mathematical features [J].
Bonidia, Robson P. ;
Sampaio, Lucas D. H. ;
Domingues, Douglas S. ;
Paschoal, Alexandre R. ;
Lopes, Fabricio M. ;
de Carvalho, Andre C. P. L. F. ;
Sanches, Danilo S. .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
[8]   The emerging role of RNA modifications in the regulation of mRNA stability [J].
Boo, Sung Ho ;
Kim, Yoon Ki .
EXPERIMENTAL AND MOLECULAR MEDICINE, 2020, 52 (03) :400-408
[9]   iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition [J].
Chen, Wei ;
Ding, Hui ;
Zhou, Xu ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2018, 561 :59-65
[10]   MethyRNA: a web server for identification of N6-methyladenosine sites [J].
Chen, Wei ;
Tang, Hua ;
Lin, Hao .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2017, 35 (03) :683-687