iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice

被引:61
|
作者
Lv, Hao [1 ]
Dao, Fu-Ying [1 ]
Guan, Zheng-Xing [1 ]
Zhang, Dan [1 ]
Tan, Jiu-Xin [1 ]
Zhang, Yong [1 ]
Chen, Wei [2 ]
Lin, Hao [1 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Informat Biol, Sch Life Sci & Technol, Minist Educ,Key Lab Neuroinformat, Chengdu, Sichuan, Peoples R China
[2] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu, Sichuan, Peoples R China
关键词
N6-methyladenine; mono-nucleotide binary encoding; random forest; cross-validation; web-server; SEQUENCE-BASED PREDICTOR; DNA METHYLATION; UPDATED RESOURCE; PROTEIN; N-6-ADENINE; IDENTIFICATION; GENES; MODEL; V2.0;
D O I
10.3389/fgene.2019.00793
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensive. Therefore, it is necessary to develop computational methods to identify 6mA sites on a genomic scale, especially for plant genomes. Based on this consideration, the study aims to develop a machine learning-based method of predicting 6mA sites in the rice genome. We initially used mononucleotide binary encoding to formulate positive and negative samples. Subsequently, the machine learning algorithm named Random Forest was utilized to perform the classification for identifying 6mA sites. Our proposed method could produce an area under the receiver operating characteristic curve of 0.964 with an overall accuracy of 0.917, as indicated by the fivefold cross-validation test. Furthermore, an independent dataset was established to assess the generalization ability of our method. Finally, an area under the receiver operating characteristic curve of 0.981 was obtained, suggesting that the proposed method had good performance of predicting 6mA sites in the rice genome. For the convenience of retrieving 6mA sites, on the basis of the computational method, we built a freely accessible web server named iDNA6mA-Rice at http://lin-group.cn/server/iDNA6mA-Rice.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] iDNA6mA-Rice-DL: A local web server for identifying DNA N6-methyladenine sites in rice genome by deep learning method
    He, Shiqian
    Kong, Liang
    Chen, Jing
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2021, 19 (05)
  • [2] SNNRice6mA: A Deep Learning Method for Predicting DNA N6-Methyladenine Sites in Rice Genome
    Yu, Haitao
    Dai, Zhiming
    FRONTIERS IN GENETICS, 2019, 10
  • [3] A machine learning approach to identify N6-methyladenine sites in the rice genome
    Kong, Linghua
    Zhang, Zhongwang
    Zhao, Xueda
    Journal of Biotech Research, 2023, 14 : 115 - 123
  • [4] iDNA6mA (5-step rule): Identification of DNA N6-methyladenine sites in the rice genome by intelligent computational model via Chou's 5-step rule
    Tahir, Muhammad
    Tayara, Hilal
    Chong, Kil To
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 189 : 96 - 101
  • [5] GC6mA-Pred: A deep learning approach to identify DNA N6-methyladenine sites in the rice genome
    Cai, Jianhua
    Xiao, Guobao
    Su, Ran
    METHODS, 2022, 204 : 14 - 21
  • [6] i6mA-DNCP: Computational Identification of DNA N6-Methyladenine Sites in the Rice Genome Using Optimized Dinucleotide-Based Features
    Kong, Liang
    Zhang, Lichao
    GENES, 2019, 10 (10)
  • [7] 6mA-Finder: a novel online tool for predicting DNA N6-methyladenine sites in genomes
    Xu, Haodong
    Hu, Ruifeng
    Jia, Peilin
    Zhao, Zhongming
    BIOINFORMATICS, 2020, 36 (10) : 3257 - 3259
  • [8] SpineNet-6mA: A Novel Deep Learning Tool for Predicting DNA N6-Methyladenine Sites in Genomes
    Abbas, Zeeshan
    Tayara, Hilal
    Chong, Kil To
    IEEE ACCESS, 2020, 8 : 201450 - 201457
  • [9] i6mA-Pred: identifying DNA N6 - methyladenine sites in the rice genome
    Chen, Wei
    Lv, Hao
    Nie, Fulei
    Lin, Hao
    BIOINFORMATICS, 2019, 35 (16) : 2796 - 2800
  • [10] 6mA-Pred: identifying DNA N6-methyladenine sites based on deep learning
    Huang, Qianfei
    Zhou, Wenyang
    Guo, Fei
    Xu, Lei
    Zhang, Lichao
    PEERJ, 2021, 9