A convolution neural network-based computational model to identify the occurrence sites of various RNA modifications by fusing varied features

被引:10
作者
Tahir, Muhammad [1 ]
Hayat, Maqsood [1 ]
Chong, Kil To [2 ,3 ]
机构
[1] Abdul Wali Khan Univ Mardan, Dept Comp Sci, Mardan 23200, KP, Pakistan
[2] Jeonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[3] Jeonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
Deep learning; RNA Modifications; k-Gram; Feature extraction; Convolution neural network; Data processing; SEQUENCE-BASED PREDICTOR; N-6-METHYLADENOSINE SITES; N6-METHYLADENOSINE SITES; METHYLATION; 5-METHYLCYTOSINE; PROTEINS;
D O I
10.1016/j.chemolab.2021.104233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RNA modification occurs in both prokaryotic and eukaryotic genomes, which is considered one of the major RNA properties. RNA modifications are the main portions of the regulatory landscape found in genes, which contain several bioprocesses at the post-transcriptional level. Therefore, the identification of RNA modifications residue information is essential for determining their molecular functions and their relevant mechanisms. Although the wet lab experimental works for identification of RNA modification sites have produced satisfactory results, these experimental-based approaches are highly labor-intensive and precious. So, it is indispensable to establish a novel and robust computational approach for the prediction of RNA modification sites. To solve these issues, an intelligent computational predictor called ?iRNA-Mod-CNN?, using deep learning hypotheses is developed to identify RNA modification sites. First, the biological sequences are encoded by implementing the one-hot encoding method. Then encoded feature vector is provided to the convolution neural network (CNN) model in order to discern the conceal information. Further, k-Gram feature space is amalgamated with CNN feature space. The computational predictor ?iRNA-Mod-CNN? showed significant improvement over the existing methods, producing 99.56%, 92.39%, and 86.66% of accuracies on m1A, m6A, and m5C benchmark datasets, respectively.
引用
收藏
页数:6
相关论文
共 66 条
  • [1] [Anonymous], 2015, KERAS DEEP LEARNING
  • [2] Convolutional neural networks for classification of alignments of non-coding RNA sequences
    Aoki, Genta
    Sakakibara, Yasubumi
    [J]. BIOINFORMATICS, 2018, 34 (13) : 237 - 244
  • [3] SITE-SPECIFIC METHYLATION OF 16S RIBOSOMAL-RNA CAUSED BY PCT, A PACTAMYCIN RESISTANCE DETERMINANT FROM THE PRODUCING ORGANISM, STREPTOMYCES-PACTUM
    BALLESTA, JPG
    CUNDLIFFE, E
    [J]. JOURNAL OF BACTERIOLOGY, 1991, 173 (22) : 7213 - 7218
  • [4] Prediction of linear B-cell epitopes using amino acid pair antigenicity scale
    Chen, J.
    Liu, H.
    Yang, J.
    Chou, K.-C.
    [J]. AMINO ACIDS, 2007, 33 (03) : 423 - 428
  • [5] WHISTLE: a high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach
    Chen, Kunqi
    Wei, Zhen
    Zhang, Qing
    Wu, Xiangyu
    Rong, Rong
    Lu, Zhiliang
    Su, Jionglong
    de Magalhaes, Joao Pedro
    Rigden, Daniel J.
    Meng, Jia
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (07)
  • [6] m6A RNA Methylation Is Regulated by MicroRNAs and Promotes Reprogramming to Pluripotency
    Chen, Tong
    Hao, Ya-Juan
    Zhang, Ying
    Li, Miao-Miao
    Wang, Meng
    Han, Weifang
    Wu, Yongsheng
    Lv, Ying
    Hao, Jie
    Wang, Libin
    Li, Ang
    Yang, Ying
    Jin, Kang-Xuan
    Zhao, Xu
    Li, Yuhuan
    Ping, Xiao-Li
    Lai, Wei-Yi
    Wu, Li-Gang
    Jiang, Guibin
    Wang, Hai-Lin
    Sang, Lisi
    Wang, Xiu-Jie
    Yang, Yun-Gui
    Zhou, Qi
    [J]. CELL STEM CELL, 2015, 16 (03) : 289 - 301
  • [7] iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences
    Chen, Wei
    Feng, Pengmian
    Yang, Hui
    Ding, Hui
    Lin, Hao
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2017, 8 (03) : 4208 - 4217
  • [8] RAMPred: identifying the N-1-methyladenosine sites in eukaryotic transcriptomes
    Chen, Wei
    Feng, Pengmian
    Tang, Hua
    Ding, Hui
    Lin, Hao
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [9] Identifying 2′-O-methylationation sites by integrating nucleotide chemical properties and nucleotide compositions
    Chen, Wei
    Feng, Pengmian
    Tang, Hua
    Ding, Hui
    Lin, Hao
    [J]. GENOMICS, 2016, 107 (06) : 255 - 258
  • [10] iRNA-Methyl: Identifying N6-methyladenosine sites using pseudo nucleotide composition
    Chen, Wei
    Feng, Pengmian
    Ding, Hui
    Lin, Hao
    Chou, Kuo-Chen
    [J]. ANALYTICAL BIOCHEMISTRY, 2015, 490 : 26 - 33