Characterization and machine learning prediction of allele-specific DNA methylation

被引:7
|
作者
He, Jianlin [1 ]
Sun, Ming-an [2 ]
Wang, Zhong [3 ,4 ]
Wang, Qianfei [1 ]
Li, Qing [3 ,4 ]
Xie, Hehuang [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Beijing Inst Genom, Lab Genome Variat & Precis Biomed, Beijing 100101, Peoples R China
[2] Virginia Tech, Epigenom & Computat Biol Lab, Virginia Bioinformat Inst, Blacksburg, VA 24060 USA
[3] Sun Yat Sen Univ, Sch Pharmaceut Sci, Guangzhou 510080, Guangdong, Peoples R China
[4] Sun Yat Sen Univ, Ctr Cellular & Struct Biol, Guangzhou 510080, Guangdong, Peoples R China
[5] Virginia Tech, Dept Biol Sci, Blacksburg, VA 24060 USA
基金
美国国家科学基金会;
关键词
Allele-specific DNA methylation; SNP; Epigenetic variation; Logistic regression classifier; GENE; ENHANCERS; SEQUENCE; CPGS;
D O I
10.1016/j.ygeno.2015.09.007
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A large collection of Single Nucleotide Polymorphisms (SNPs) has been identified in the human genome. Currently, the epigenetic influences of SNPs on their neighboring CpG sites remain elusive. A growing body of evidence suggests that locus-specific information, including genomic features and local epigenetic state, may play important roles in the epigenetic readout of SNPs. In this study, we made use of mouse methylomes with known SNPs to develop statistical models for the prediction of SNP associated allele-specific DNA methylation (ASM). ASM has been classified into parent-of-origin dependent ASM (P-ASM) and sequence-dependent ASM (S-ASM), which comprises scattered-S-ASM (sS-ASM) and clustered-S-ASM (cS-ASM). We found that P-ASM and cS-ASM CpG sites are both enriched in CpG rich regions, promoters and exons, while sS-ASM CpG sites are enriched in simple repeat and regions with high frequent SNP occurrence. Using Lasso-grouped Logistic Regression (LGLR), we selected 21 out of 282 genomic and methylation related features that are powerful in distinguishing cS-ASM CpG sites and trained the classifiers with machine learning techniques. Based on 5-fold cross-validation, the logistic regression classifier was found to be the best for cS-ASM prediction with an ACC of 0.77, an AUC of 0.84 and an MCC of 0.54. Lastly, we applied the logistic regression classifier on human brain methylome and predicted 608 genes associated with cS-ASM. Gene ontology term enrichment analysis indicated that these cS-ASM associated genes are significantly enriched in the category coding for transcripts with alternative splicing forms. In summary, this study provided an analytical procedure for cS-ASM prediction and shed new light on the understanding of different types of ASM events. Published by Elsevier Inc.
引用
收藏
页码:331 / 339
页数:9
相关论文
共 50 条
  • [31] Detection of haplotype-dependent allele-specific DNA methylation in WGBS data
    Abante, J.
    Fang, Y.
    Feinberg, A. P.
    Goutsias, J.
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [32] Inter- and intra-individual variation in allele-specific DNA methylation
    Turan, Nahid
    Chalian, Raffi
    Katari, Sunita
    Asadpour, Halleh
    Coutifaris, Christos
    Sapienza, Carmen
    BIOLOGY OF REPRODUCTION, 2008, : 192 - 192
  • [33] ALLELE-SPECIFIC DNA IDENTITY PATTERNS
    WAINSCOAT, JS
    PILKINGTON, S
    PETO, TEA
    BELL, JI
    HIGGS, DR
    HUMAN GENETICS, 1987, 75 (04) : 384 - 387
  • [34] Investigating the potential of single-cell DNA methylation data to detect allele-specific methylation and imprinting
    Johnson, Nicholas D.
    Cutler, David J.
    Conneely, Karen N.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2024, 111 (04) : 654 - 667
  • [35] Allele-specific DNA demethylation editing leads to stable upregulation of allele-specific gene expression
    Rajaram, Nivethika
    Benzler, Katharina
    Bashtrykov, Pavel
    Jeltsch, Albert
    ISCIENCE, 2024, 27 (10)
  • [36] MIRA-SNuPE, a quantitative, multiplex method for measuring allele-specific DNA methylation
    Lee, Dong-Hoon
    Tran, Diana A.
    Singh, Purnima
    Oates, Nathan
    Rivas, Guillermo E.
    Larson, Garrett P.
    Pfeifer, Gerd P.
    Szabo, Piroska E.
    EPIGENETICS, 2011, 6 (02) : 212 - 223
  • [37] Mapping Allele-Specific DNA Methylation: A New Tool for Maximizing Information from GWAS
    Tycko, Benjamin
    AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (02) : 109 - 112
  • [38] Breast cancer family history and allele-specific DNA methylation in the legacy girls study
    Wu, Hui-Chen
    Do, Catherine
    Andrulis, Irene L.
    John, Esther M.
    Daly, Mary B.
    Buys, Saundra S.
    Chung, Wendy K.
    Knight, Julia A.
    Bradbury, Angela R.
    Keegan, Theresa H. M.
    Schwartz, Lisa
    Krupska, Izabela
    Miller, Rachel L.
    Santella, Regina M.
    Tycko, Benjamin
    Terry, Mary Beth
    EPIGENETICS, 2018, 13 (03) : 240 - 250
  • [39] DAMEfinder: a method to detect differential allele-specific methylation
    Stephany Orjuela
    Dania Machlab
    Mirco Menigatti
    Giancarlo Marra
    Mark D. Robinson
    Epigenetics & Chromatin, 13
  • [40] A Resource for the Allele-Specific Analysis of DNA Methylation at Multiple Genomically Imprinted Loci in Mice
    Wasson, Jadiel A.
    Birol, Onur
    Katz, David J.
    G3-GENES GENOMES GENETICS, 2018, 8 (01): : 91 - 103