A convolution neural network-based computational model to identify the occurrence sites of various RNA modifications by fusing varied features

被引:10
作者
Tahir, Muhammad [1 ]
Hayat, Maqsood [1 ]
Chong, Kil To [2 ,3 ]
机构
[1] Abdul Wali Khan Univ Mardan, Dept Comp Sci, Mardan 23200, KP, Pakistan
[2] Jeonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[3] Jeonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
Deep learning; RNA Modifications; k-Gram; Feature extraction; Convolution neural network; Data processing; SEQUENCE-BASED PREDICTOR; N-6-METHYLADENOSINE SITES; N6-METHYLADENOSINE SITES; METHYLATION; 5-METHYLCYTOSINE; PROTEINS;
D O I
10.1016/j.chemolab.2021.104233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RNA modification occurs in both prokaryotic and eukaryotic genomes, which is considered one of the major RNA properties. RNA modifications are the main portions of the regulatory landscape found in genes, which contain several bioprocesses at the post-transcriptional level. Therefore, the identification of RNA modifications residue information is essential for determining their molecular functions and their relevant mechanisms. Although the wet lab experimental works for identification of RNA modification sites have produced satisfactory results, these experimental-based approaches are highly labor-intensive and precious. So, it is indispensable to establish a novel and robust computational approach for the prediction of RNA modification sites. To solve these issues, an intelligent computational predictor called ?iRNA-Mod-CNN?, using deep learning hypotheses is developed to identify RNA modification sites. First, the biological sequences are encoded by implementing the one-hot encoding method. Then encoded feature vector is provided to the convolution neural network (CNN) model in order to discern the conceal information. Further, k-Gram feature space is amalgamated with CNN feature space. The computational predictor ?iRNA-Mod-CNN? showed significant improvement over the existing methods, producing 99.56%, 92.39%, and 86.66% of accuracies on m1A, m6A, and m5C benchmark datasets, respectively.
引用
收藏
页数:6
相关论文
共 66 条
[51]   DeepMRMP: A new predictor for multiple types of RNA modification sites using deep learning [J].
Sun, Pingping ;
Chen, Yongbing ;
Liu, Bo ;
Gao, Yanxin ;
Han, Ye ;
He, Fei ;
Ji, Jinchao .
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (06) :6231-6241
[52]   kDeepBind: Prediction of RNA-Proteins binding sites using convolution neural network and k-gram features [J].
Tahir, Muhammad ;
Tayara, Hilal ;
Hayat, Maqsood ;
Chong, Kil To .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 208
[53]   Prediction of N6-methyladenosine sites using convolution neural network model based on distributed feature representations [J].
Tahir, Muhammad ;
Hayat, Maqsood ;
Chong, Kil To .
NEURAL NETWORKS, 2020, 129 :385-391
[54]   iPseU-CNN: Identifying RNA Pseudouridine Sites Using Convolutional Neural Networks [J].
Tahir, Muhammad ;
Tayara, Hilal ;
Chong, Kul To .
MOLECULAR THERAPY NUCLEIC ACIDS, 2019, 16 :463-470
[55]   A Two-Layer Computational Model for Discrimination of Enhancer and Their Types Using Hybrid Features Pace of Pseudo K-Tuple Nucleotide Composition [J].
Tahir, Muhammad ;
Hayat, Maqsood ;
Khan, Sher Afzal .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) :6719-6727
[56]   Machine learning based identification of protein-protein interactions using derived features of physiochemical properties and evolutionary profiles [J].
Tahir, Muhammad ;
Hayat, Maqsood .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 78 :61-71
[57]   Sequence based predictor for discrimination of enhancer and their types by applying general form of Chou's trinucleotide composition [J].
Tahir, Muhammad ;
Hayat, Maqsood ;
Kabir, Muhammad .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 146 :69-75
[58]   iNuc-STNC: a sequence-based predictor for identification of nucleosome positioning in genomes by extending the concept of SAAC and Chou's PseAAC [J].
Tahir, Muhammad ;
Hayat, Maqsood .
MOLECULAR BIOSYSTEMS, 2016, 12 (08) :2587-2593
[59]   iSS-CNN: Identifying splicing sites using convolution neural network [J].
Tayara, Hilal ;
Tahir, Muhammad ;
Chong, Kil To .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 188 :63-69
[60]   iDrug-Target: predicting the interactions between drug compounds and target proteins in cellular networking via benchmark dataset optimization approach [J].
Xiao, Xuan ;
Min, Jian-Liang ;
Lin, Wei-Zhong ;
Liu, Zi ;
Cheng, Xiang ;
Chou, Kuo-Chen .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2015, 33 (10) :2221-2233