Characterization and machine learning prediction of allele-specific DNA methylation

被引:7
|
作者
He, Jianlin [1 ]
Sun, Ming-an [2 ]
Wang, Zhong [3 ,4 ]
Wang, Qianfei [1 ]
Li, Qing [3 ,4 ]
Xie, Hehuang [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Beijing Inst Genom, Lab Genome Variat & Precis Biomed, Beijing 100101, Peoples R China
[2] Virginia Tech, Epigenom & Computat Biol Lab, Virginia Bioinformat Inst, Blacksburg, VA 24060 USA
[3] Sun Yat Sen Univ, Sch Pharmaceut Sci, Guangzhou 510080, Guangdong, Peoples R China
[4] Sun Yat Sen Univ, Ctr Cellular & Struct Biol, Guangzhou 510080, Guangdong, Peoples R China
[5] Virginia Tech, Dept Biol Sci, Blacksburg, VA 24060 USA
基金
美国国家科学基金会;
关键词
Allele-specific DNA methylation; SNP; Epigenetic variation; Logistic regression classifier; GENE; ENHANCERS; SEQUENCE; CPGS;
D O I
10.1016/j.ygeno.2015.09.007
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A large collection of Single Nucleotide Polymorphisms (SNPs) has been identified in the human genome. Currently, the epigenetic influences of SNPs on their neighboring CpG sites remain elusive. A growing body of evidence suggests that locus-specific information, including genomic features and local epigenetic state, may play important roles in the epigenetic readout of SNPs. In this study, we made use of mouse methylomes with known SNPs to develop statistical models for the prediction of SNP associated allele-specific DNA methylation (ASM). ASM has been classified into parent-of-origin dependent ASM (P-ASM) and sequence-dependent ASM (S-ASM), which comprises scattered-S-ASM (sS-ASM) and clustered-S-ASM (cS-ASM). We found that P-ASM and cS-ASM CpG sites are both enriched in CpG rich regions, promoters and exons, while sS-ASM CpG sites are enriched in simple repeat and regions with high frequent SNP occurrence. Using Lasso-grouped Logistic Regression (LGLR), we selected 21 out of 282 genomic and methylation related features that are powerful in distinguishing cS-ASM CpG sites and trained the classifiers with machine learning techniques. Based on 5-fold cross-validation, the logistic regression classifier was found to be the best for cS-ASM prediction with an ACC of 0.77, an AUC of 0.84 and an MCC of 0.54. Lastly, we applied the logistic regression classifier on human brain methylome and predicted 608 genes associated with cS-ASM. Gene ontology term enrichment analysis indicated that these cS-ASM associated genes are significantly enriched in the category coding for transcripts with alternative splicing forms. In summary, this study provided an analytical procedure for cS-ASM prediction and shed new light on the understanding of different types of ASM events. Published by Elsevier Inc.
引用
收藏
页码:331 / 339
页数:9
相关论文
共 50 条
  • [1] Bumblebee Workers Show Differences in Allele-Specific DNA Methylation and Allele-Specific Expression
    Marshall, Hollie
    Jones, Alun R. C.
    Lonsdale, Zoe N.
    Mallon, Eamonn B.
    GENOME BIOLOGY AND EVOLUTION, 2020, 12 (08): : 1471 - 1481
  • [2] Allele-specific DNA methylation: beyond imprinting
    Tycko, Benjamin
    HUMAN MOLECULAR GENETICS, 2010, 19 : R210 - R220
  • [3] Allele-specific extension on microarray for DNA methylation analysis
    Wu, Zhixiang
    Luo, Junfeng
    Ge, Qinyu
    Zhang, Dingdong
    Wang, Yan
    Jia, Chao
    Lu, Zuhong
    ANALYTICA CHIMICA ACTA, 2007, 603 (02) : 199 - 204
  • [4] Genomic landscape of human allele-specific DNA methylation
    Fang, Fang
    Hodges, Emily
    Molaro, Antoine
    Dean, Matthew
    Hannon, Gregory J.
    Smith, Andrew D.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (19) : 7332 - 7337
  • [5] ANALYSIS OF ALLELE-SPECIFIC METHYLATION
    BEDFORD, MT
    VANHELDEN, PD
    TRENDS IN GENETICS, 1990, 6 (05) : 140 - 141
  • [6] Profiling total and allele-specific DNA methylation in human placentas
    Kerkel, K.
    Jiang, L.
    Yuan, E.
    Spadola, A.
    Morris, M.
    Haghighi, V.
    Tycko, B.
    PLACENTA, 2007, 28 (8-9) : A7 - A7
  • [7] PARamrfinder: detecting allele-specific DNA methylation on multicore clusters
    Fernandez-Fraga, Alejandro
    Gonzalez-Dominguez, Jorge
    Martin, Maria J.
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (10): : 14573 - 14599
  • [8] ALLELE-SPECIFIC DNA METHYLATION IN THE BRAIN: RELEVANCE TO PSYCHIATRIC GWAS
    Gagliano, Sarah
    Ptak, Carolyn
    Mak, Denise
    Shamsi, Mehrdad
    Gabriel, Oh
    Boutros, Paul
    Knight, Jo
    Petronis, Art
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2017, 27 : S172 - S172
  • [9] Rapid and quantitative method of allele-specific DNA methylation analysis
    Wong, Hui-Lee
    Lyun, Hyang-Min
    Kwan, Jennifer M.
    Campan, Mihaela
    Ingles, Sue A.
    Laird, Peter W.
    Yang, Allen S.
    BIOTECHNIQUES, 2006, 41 (06) : 734 - 739
  • [10] Atlas of imprinted and allele-specific DNA methylation in the human body
    Rosenski, Jonathan
    Peretz, Ayelet
    Magenheim, Judith
    Loyfer, Netanel
    Shemer, Ruth
    Glaser, Benjamin
    Dor, Yuval
    Kaplan, Tommy
    NATURE COMMUNICATIONS, 2025, 16 (01)