ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks

被引:15
作者
Abbas, Zeeshan [1 ,2 ]
Tayara, Hilal [3 ]
Chong, Kil To [1 ,4 ]
机构
[1] Jeonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[2] Air Univ, Inst Av & Aeronaut IAA, Islamabad 44000, Pakistan
[3] Jeonbuk Natl Univ, Sch Int Engn & Sci, Jeonju 54896, South Korea
[4] Jeonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
bioinformatics; DNA methylation; ElasticNet; epigenomics; epigenome engineering; neural networks; DNA N6-METHYLADENINE SITES; ALZHEIMERS-DISEASE; METHYLATION; N-6-ADENINE; MODEL;
D O I
10.3390/ijms23158314
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
N6-methyladenine (6mA) has been recognized as a key epigenetic alteration that affects a variety of biological activities. Precise prediction of 6mA modification sites is essential for understanding the logical consistency of biological activity. There are various experimental methods for identifying 6mA modification sites, but in silico prediction has emerged as a potential option due to the very high cost and labor-intensive nature of experimental procedures. Taking this into consideration, developing an efficient and accurate model for identifying N6-methyladenine is one of the top objectives in the field of bioinformatics. Therefore, we have created an in silico model for the classification of 6mA modifications in plant genomes. ENet-6mA uses three encoding methods, including one-hot, nucleotide chemical properties (NCP), and electron-ion interaction potential (EIIP), which are concatenated and fed as input to ElasticNet for feature reduction, and then the optimized features are given directly to the neural network to get classified. We used a benchmark dataset of rice for five-fold cross-validation testing and three other datasets from plant genomes for cross-species testing purposes. The results show that the model can predict the N6-methyladenine sites very well, even cross-species. Additionally, we separated the datasets into different ratios and calculated the performance using the area under the precision-recall curve (AUPRC), achieving 0.81, 0.79, and 0.50 with 1:10 (positive:negative) samples for F. vesca, R. chinensis, and A. thaliana, respectively.
引用
收藏
页数:9
相关论文
共 38 条
[1]   ZayyuNet - A Unified Deep Learning Model for the Identification of Epigenetic Modifications Using Raw Genomic Sequences [J].
Abbas, Zeeshan ;
Tayara, Hilal ;
Chong, Kil To .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (04) :2533-2544
[2]   TS-m6A-DL: Tissue-specific identification of N6-methyladenosine sites using a universal deep learning model [J].
Abbas, Zeeshan ;
Tayara, Hilal ;
Zou, Quan ;
Chong, Kil To .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 :4619-4625
[3]   SpineNet-6mA: A Novel Deep Learning Tool for Predicting DNA N6-Methyladenine Sites in Genomes [J].
Abbas, Zeeshan ;
Tayara, Hilal ;
Chong, Kil To .
IEEE ACCESS, 2020, 8 :201450-201457
[4]   Identification of a DNA N6-Adenine Methyltransferase Complex and Its Impact on Chromatin Organization [J].
Beh, Leslie Y. ;
Debelouchina, Galia T. ;
Clay, Derek M. ;
Thompson, Robert E. ;
Lindblad, Kelsi A. ;
Hutton, Elizabeth R. ;
Brecht, John R. ;
Sebra, Robert P. ;
Muir, Tom W. ;
Landweber, Laura F. .
CELL, 2019, 177 (07) :1781-+
[5]   Epigenetic gene regulation in the bacterial world [J].
Casadesus, Josep ;
Low, David .
MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 2006, 70 (03) :830-+
[6]   i6mA-Pred: identifying DNA N6 - methyladenine sites in the rice genome [J].
Chen, Wei ;
Lv, Hao ;
Nie, Fulei ;
Lin, Hao .
BIOINFORMATICS, 2019, 35 (16) :2796-2800
[7]   iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties [J].
Chen, Wei ;
Yang, Hui ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao .
BIOINFORMATICS, 2017, 33 (22) :3518-3523
[8]   A comprehensive analysis of resting state fMRI measures to classify individual patients with Alzheimer's disease [J].
de Vos, Frank ;
Koini, Marisa ;
Schouten, Tijn M. ;
Seiler, Stephan ;
van der Grond, Jeroen ;
Lechner, Anita ;
Schmidt, Reinhold ;
de Rooij, Mark ;
Rombouts, Serge A. R. B. .
NEUROIMAGE, 2018, 167 :62-72
[9]   Combining multiple anatomical MRI measures improves Alzheimer's disease classification [J].
de Vos, Frank ;
Schouten, Tijn M. ;
Hafkemeijer, Anne ;
Dopper, Elise G. P. ;
van Swieten, John C. ;
de Rooij, Mark ;
van der Grond, Jeroen ;
Rombouts, Serge A. R. B. .
HUMAN BRAIN MAPPING, 2016, 37 (05) :1920-1929
[10]   Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity [J].
Edger, Patrick P. ;
VanBuren, Robert ;
Colle, Marivi ;
Poorten, Thomas J. ;
Wai, Ching Man ;
Niederhuth, Chad E. ;
Alger, Elizabeth I. ;
Ou, Shujun ;
Acharya, Charlotte B. ;
Wang, Jie ;
Callow, Pete ;
McKain, Michael R. ;
Shi, Jinghua ;
Collier, Chad ;
Xiong, Zhiyong ;
Mower, Jeffrey P. ;
Slovin, Janet P. ;
Hytonen, Timo ;
Jiang, Ning ;
Childs, Kevin L. ;
Knapp, Steven J. .
GIGASCIENCE, 2017, 7 (02)