DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences

被引:19
作者
Luo, Zhengtao [1 ]
Su, Wei [2 ,3 ]
Lou, Liliang [1 ]
Qiu, Wangren [1 ]
Xiao, Xuan [1 ]
Xu, Zhaochun [1 ]
机构
[1] Jingdezhen Ceram Univ, Comp Dept, Jingdezhen 333403, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Chengdu 610054, Peoples R China
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu 610054, Peoples R China
关键词
N6,2 '-O-dimethyladenosine; m(6)Am site identification; deep learning; N6-METHYLADENOSINE SITES; WEB SERVER; IDENTIFICATION; METHYLATION; PREDICTION; PROTEIN; M6A;
D O I
10.3390/ijms231911026
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
N6,2'-O-dimethyladenosine (m(6)Am) is a post-transcriptional modification that may be associated with regulatory roles in the control of cellular functions. Therefore, it is crucial to accurately identify transcriptome-wide m(6)Am sites to understand underlying m(6)Am-dependent mRNA regulation mechanisms and biological functions. Here, we used three sequence-based feature-encoding schemes, including one-hot, nucleotide chemical property (NCP), and nucleotide density (ND), to represent RNA sequence samples. Additionally, we proposed an ensemble deep learning framework, named DLm6Am, to identify m(6)Am sites. DLm6Am consists of three similar base classifiers, each of which contains a multi-head attention module, an embedding module with two parallel deep learning sub-modules, a convolutional neural network (CNN) and a Bi-directional long short-term memory (BiLSTM), and a prediction module. To demonstrate the superior performance of our model's architecture, we compared multiple model frameworks with our method by analyzing the training data and independent testing data. Additionally, we compared our model with the existing state-of-the-art computational methods, m6AmPred and MultiRM. The accuracy (ACC) for the DLm6Am model was improved by 6.45% and 8.42% compared to that of m6AmPred and MultiRM on independent testing data, respectively, while the area under receiver operating characteristic curve (AUROC) for the DLm6Am model was increased by 4.28% and 5.75%, respectively. All the results indicate that DLm6Am achieved the best prediction performance in terms of ACC, Matthews correlation coefficient (MCC), AUROC, and the area under precision and recall curves (AUPR). To further assess the generalization performance of our proposed model, we implemented chromosome-level leave-out cross-validation, and found that the obtained AUROC values were greater than 0.83, indicating that our proposed method is robust and can accurately predict m(6)Am sites.
引用
收藏
页数:14
相关论文
共 49 条
  • [1] Cap-specific terminal N6-methylation of RNA by an RNA polymerase II-associated methyltransferase
    Akichika, Shinichiro
    Hirano, Seiichi
    Shichino, Yuichi
    Suzuki, Takeo
    Nishimasu, Hiroshi
    Ishitani, Ryuichiro
    Sugita, Ai
    Hirose, Yutaka
    Iwasaki, Shintaro
    Nureki, Osamu
    Suzuki, Tsutomu
    [J]. SCIENCE, 2019, 363 (6423) : 141 - 141
  • [2] DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning
    Angermueller, Christof
    Lee, Heather J.
    Reik, Wolf
    Stegle, Oliver
    [J]. GENOME BIOLOGY, 2017, 18
  • [3] [Anonymous], DLM6AM DEEP LEARNING
  • [4] Dynamic regulation of N6,2′-O-dimethyladenosine (m6Am) in obesity
    Ben-Haim, Moshe Shay
    Pinto, Yishay
    Moshitch-Moshkovitz, Sharon
    Hershkovitz, Vera
    Kol, Nitzan
    Diamant-Levi, Tammy
    Beeri, Michal Schnaider
    Amariglio, Ninette
    Cohen, Haim Y.
    Rechavi, Gideon
    [J]. NATURE COMMUNICATIONS, 2021, 12 (01)
  • [5] MODOMICS: a database of RNA modification pathways. 2021 update
    Boccaletto, Pietro
    Stefaniak, Filip
    Ray, Angana
    Cappannini, Andrea
    Mukherjee, Sunandan
    Purta, Elzbieta
    Kurkowska, Malgorzata
    Shirvanizadeh, Niloofar
    Destefanis, Eliana
    Groza, Paula
    Avsar, Gulben
    Romitelli, Antonia
    Pir, Pinar
    Dassi, Erik
    Conticello, Silvestro G.
    Aguilo, Francesca
    Bujnicki, Janusz M.
    [J]. NUCLEIC ACIDS RESEARCH, 2022, 50 (D1) : D231 - D235
  • [6] Identification of the m6Am Methyltransferase PCIF1 Reveals the Location and Functions of m6Am in the Transcriptome
    Boulias, Konstantinos
    Toczydlowska-Socha, Diana
    Hawley, Ben R.
    Liberman, Noa
    Takashima, Ken
    Zaccara, Sara
    Guez, Theo
    Vasseur, Jean-Jacques
    Debart, Francoise
    Aravind, L.
    Jaffrey, Samie R.
    Greer, Eric Lieberman
    [J]. MOLECULAR CELL, 2019, 75 (03) : 631 - +
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] WHISTLE: a high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach
    Chen, Kunqi
    Wei, Zhen
    Zhang, Qing
    Wu, Xiangyu
    Rong, Rong
    Lu, Zhiliang
    Su, Jionglong
    de Magalhaes, Joao Pedro
    Rigden, Daniel J.
    Meng, Jia
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (07)
  • [9] MethyRNA: a web server for identification of N6-methyladenosine sites
    Chen, Wei
    Tang, Hua
    Lin, Hao
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2017, 35 (03) : 683 - 687
  • [10] iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites
    Chen, Wei
    Feng, Pengmian
    Yang, Hui
    Ding, Hui
    Lin, Hao
    Chou, Kuo-Chen
    [J]. MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 11 : 468 - 474