4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network

被引:55
作者
Khanal, Jhabindra [1 ]
Nazari, Iman [1 ]
Tayara, Hilal [1 ]
Chong, Kil To [2 ]
机构
[1] Chonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[2] Chonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
Convolutional neural network; DNA methylation; DNA N4-methylcytosine(4mC); sequence analysis; SEQUENCE-BASED PREDICTOR; DNA METHYLATION; REPAIR; GENES; BASE; RNA;
D O I
10.1109/ACCESS.2019.2943169
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The epigenetic modification, DNA N4 - methylcytosine(4mC) plays an important role in DNA expression, repair, and replication. It simply plays a crucial role in restriction-modification systems. The better and accurate prediction of 4mC sites in DNA is much-needed work to understand their functional behavior that leads to help in both drug discovery and biomedical research. Therefore, an accurate computational model is required. In this work, we present an efficient one-dimensional convolutional neural network (CNN) model, called 4mCCNN, for 4mc sites identifications in Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Escherichia coli, Geoalkalibacter subterraneus, and Geobacter pickeringii. Existing methods were developed by machine learning algorithms for identifying the 4mc sites using handcrafted features, while the proposed model extracts the features of the 4mC sites from DNA sequence automatically using the CNN model. The performance of the proposed model has been evaluated on benchmark datasets and achieved generally better outcomes in identifying 4mc sites as compared to the state-of-the-art predictors. The developed 4mCNN model was constructed in a web server at https://home.jbnu.ac.kr/NSCL/4mCCNN.htm
引用
收藏
页码:145455 / 145461
页数:7
相关论文
共 50 条
[1]  
[Anonymous], 2012, Cited on
[2]   Implications of Newly Identified Brain eQTL Genes and Their Interactors in Schizophrenia [J].
Cai, Lei ;
Huang, Tao ;
Su, Jingjing ;
Zhang, Xinxin ;
Chen, Wenzhong ;
Zhang, Fuquan ;
He, Lin ;
Chou, Kuo-Chen .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 12 :433-442
[3]   Epigenetic gene regulation in the bacterial world [J].
Casadesus, Josep ;
Low, David .
MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 2006, 70 (03) :830-+
[4]   iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties [J].
Chen, Wei ;
Yang, Hui ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao .
BIOINFORMATICS, 2017, 33 (22) :3518-3523
[5]   iRNA-PseU: Identifying RNA pseudouridine sites [J].
Chen, Wei ;
Tang, Hua ;
Ye, Jing ;
Lin, Hao ;
Chou, Kuo-Chen .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2016, 5 :e332
[6]   IACP: a sequence-based tool for identifying anticancer peptides [J].
Chen, Wei ;
Ding, Hui ;
Feng, Pengmian ;
Lin, Hao ;
Chou, Kuo-Chen .
ONCOTARGET, 2016, 7 (13) :16895-16909
[7]   iRNA-Methyl: Identifying N6-methyladenosine sites using pseudo nucleotide composition [J].
Chen, Wei ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2015, 490 :26-33
[8]   iTIS-PseTNC: A sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Deng, En-Ze ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2014, 462 :76-83
[9]  
Chou K-C., 2009, Nat Sci, V01, P63
[10]   An Unprecedented Revolution in Medicinal Chemistry Driven by the Progress of Biological Science [J].
Chou, Kuo-Chen .
CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2017, 17 (21) :2337-2358