Detection of DNA N6-Methyladenine Modification through SMRT-seq Features and Machine Learning Model

被引:0
作者
Guo, Yichu [1 ]
Zhang, Yixuan [2 ]
Liu, Xiaoqing [3 ]
He, Pingan [4 ]
Zeng, Yuni [2 ]
Dai, Qi [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou 310018, Peoples R China
[2] Zhejiang Sci Tech Univ, Coll Comp Sci, Hangzhou 310018, Peoples R China
[3] Hangzhou Dianzi Univ, Coll Sci, Hangzhou 310018, Peoples R China
[4] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
DNA; 6mA; SMRT-seq features; Machine learning; MeDIP-seq; Detection method; SINGLE-MOLECULE; METHYLATION; N-6-ADENINE; CLASSIFICATION; GENOME; SITES; WIDE;
D O I
10.2174/0115748936300671240523044154
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Introduction N6-methyldeoxyadenine (6mA) is the most prevalent DNA modification in both prokaryotes and eukaryotes. While single-molecule real-time sequencing (SMRT-seq) can detect 6mA events at the individual nucleotide level, its practical application is hindered by a high rate of false positives.Methods We propose a computational model for identifying DNA 6mA that incorporates comprehensive site features from SMRT-seq and employs machine learning classifiers.Results The results demonstrate that 99.54% and 96.55% of the identified DNA 6mA instances in C.reinhardtii correspond with motifs and peak regions identified by methylated DNA immunoprecipitation sequencing (MeDIP-seq), respectively. Compared to SMRT-seq, the proportion of predicted DNA 6mA instances within MeDIP-seq peak regions increases by 2% to 70% across the six bacterial strains.Conclusion Our proposed method effectively reduces the false-positive rate in DNA 6mA prediction.
引用
收藏
页码:392 / 401
页数:10
相关论文
共 28 条
[1]   High Utility Periodic Frequent Pattern Mining in Multiple Sequences [J].
Chen, Chien-Ming ;
Zhang, Zhenzhou ;
Wu, Jimmy Ming-Tai ;
Lakshmanna, Kuruva .
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 137 (01) :733-759
[2]   Real-Time DNA Sequencing from Single Polymerase Molecules [J].
Eid, John ;
Fehr, Adrian ;
Gray, Jeremy ;
Luong, Khai ;
Lyle, John ;
Otto, Geoff ;
Peluso, Paul ;
Rank, David ;
Baybayan, Primo ;
Bettman, Brad ;
Bibillo, Arkadiusz ;
Bjornson, Keith ;
Chaudhuri, Bidhan ;
Christians, Frederick ;
Cicero, Ronald ;
Clark, Sonya ;
Dalal, Ravindra ;
deWinter, Alex ;
Dixon, John ;
Foquet, Mathieu ;
Gaertner, Alfred ;
Hardenbol, Paul ;
Heiner, Cheryl ;
Hester, Kevin ;
Holden, David ;
Kearns, Gregory ;
Kong, Xiangxu ;
Kuse, Ronald ;
Lacroix, Yves ;
Lin, Steven ;
Lundquist, Paul ;
Ma, Congcong ;
Marks, Patrick ;
Maxham, Mark ;
Murphy, Devon ;
Park, Insil ;
Pham, Thang ;
Phillips, Michael ;
Roy, Joy ;
Sebra, Robert ;
Shen, Gene ;
Sorenson, Jon ;
Tomaney, Austin ;
Travers, Kevin ;
Trulson, Mark ;
Vieceli, John ;
Wegener, Jeffrey ;
Wu, Dawn ;
Yang, Alicia ;
Zaccarin, Denis .
SCIENCE, 2009, 323 (5910) :133-138
[3]   Genome-wide mapping of methylated adenine residues in pathogenic Escherichia coli using single-molecule real-time sequencing [J].
Fang, Gang ;
Munera, Diana ;
Friedman, David I. ;
Mandlik, Anjali ;
Chao, Michael C. ;
Banerjee, Onureena ;
Feng, Zhixing ;
Losic, Bojan ;
Mahajan, Milind C. ;
Jabado, Omar J. ;
Deikus, Gintaras ;
Clark, Tyson A. ;
Luong, Khai ;
Murray, Iain A. ;
Davis, Brigid M. ;
Keren-Paz, Alona ;
Chess, Andrew ;
Roberts, Richard J. ;
Korlach, Jonas ;
Turner, Steve W. ;
Kumar, Vipin ;
Waldor, Matthew K. ;
Schadt, Eric E. .
NATURE BIOTECHNOLOGY, 2012, 30 (12) :1232-+
[4]   Mammalian DNA N6-methyladenosine Challenges and new insights [J].
Feng, Xinran ;
He, Chuan .
MOLECULAR CELL, 2023, 83 (03) :343-351
[5]   Detecting DNA Modifications from SMRT Sequencing Data by Modeling Sequence Context Dependence of Polymerase Kinetic [J].
Feng, Zhixing ;
Fang, Gang ;
Korlach, Jonas ;
Clark, Tyson ;
Khai Luong ;
Zhang, Xuegong ;
Wong, Wing ;
Schadt, Eric .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (03)
[6]  
Flusberg BA, 2010, NAT METHODS, V7, P461, DOI [10.1038/NMETH.1459, 10.1038/nmeth.1459]
[7]   N6-Methyldeoxyadenosine Marks Active Transcription Start Sites in Chlamydomonas [J].
Fu, Ye ;
Luo, Guan-Zheng ;
Chen, Kai ;
Deng, Xin ;
Yu, Miao ;
Han, Dali ;
Hao, Ziyang ;
Liu, Jianzhao ;
Lu, Xingyu ;
Dore, Louis C. ;
Weng, Xiaocheng ;
Ji, Quanjiang ;
Mets, Laurens ;
He, Chuan .
CELL, 2015, 161 (04) :879-892
[8]   DNA Methylation on N6-Adenine in C-elegans [J].
Greer, Eric Lieberman ;
Blanco, Mario Andres ;
Gu, Lei ;
Sendinc, Erdem ;
Liu, Jianzhao ;
Aristizabal-Corrales, David ;
Hsu, Chih-Hung ;
Aravind, L. ;
He, Chuan ;
Shi, Yang .
CELL, 2015, 161 (04) :868-878
[9]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422
[10]   Tet Proteins Can Convert 5-Methylcytosine to 5-Formylcytosine and 5-Carboxylcytosine [J].
Ito, Shinsuke ;
Shen, Li ;
Dai, Qing ;
Wu, Susan C. ;
Collins, Leonard B. ;
Swenberg, James A. ;
He, Chuan ;
Zhang, Yi .
SCIENCE, 2011, 333 (6047) :1300-1303