Characterization and machine learning prediction of allele-specific DNA methylation

被引:7
|
作者
He, Jianlin [1 ]
Sun, Ming-an [2 ]
Wang, Zhong [3 ,4 ]
Wang, Qianfei [1 ]
Li, Qing [3 ,4 ]
Xie, Hehuang [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Beijing Inst Genom, Lab Genome Variat & Precis Biomed, Beijing 100101, Peoples R China
[2] Virginia Tech, Epigenom & Computat Biol Lab, Virginia Bioinformat Inst, Blacksburg, VA 24060 USA
[3] Sun Yat Sen Univ, Sch Pharmaceut Sci, Guangzhou 510080, Guangdong, Peoples R China
[4] Sun Yat Sen Univ, Ctr Cellular & Struct Biol, Guangzhou 510080, Guangdong, Peoples R China
[5] Virginia Tech, Dept Biol Sci, Blacksburg, VA 24060 USA
基金
美国国家科学基金会;
关键词
Allele-specific DNA methylation; SNP; Epigenetic variation; Logistic regression classifier; GENE; ENHANCERS; SEQUENCE; CPGS;
D O I
10.1016/j.ygeno.2015.09.007
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A large collection of Single Nucleotide Polymorphisms (SNPs) has been identified in the human genome. Currently, the epigenetic influences of SNPs on their neighboring CpG sites remain elusive. A growing body of evidence suggests that locus-specific information, including genomic features and local epigenetic state, may play important roles in the epigenetic readout of SNPs. In this study, we made use of mouse methylomes with known SNPs to develop statistical models for the prediction of SNP associated allele-specific DNA methylation (ASM). ASM has been classified into parent-of-origin dependent ASM (P-ASM) and sequence-dependent ASM (S-ASM), which comprises scattered-S-ASM (sS-ASM) and clustered-S-ASM (cS-ASM). We found that P-ASM and cS-ASM CpG sites are both enriched in CpG rich regions, promoters and exons, while sS-ASM CpG sites are enriched in simple repeat and regions with high frequent SNP occurrence. Using Lasso-grouped Logistic Regression (LGLR), we selected 21 out of 282 genomic and methylation related features that are powerful in distinguishing cS-ASM CpG sites and trained the classifiers with machine learning techniques. Based on 5-fold cross-validation, the logistic regression classifier was found to be the best for cS-ASM prediction with an ACC of 0.77, an AUC of 0.84 and an MCC of 0.54. Lastly, we applied the logistic regression classifier on human brain methylome and predicted 608 genes associated with cS-ASM. Gene ontology term enrichment analysis indicated that these cS-ASM associated genes are significantly enriched in the category coding for transcripts with alternative splicing forms. In summary, this study provided an analytical procedure for cS-ASM prediction and shed new light on the understanding of different types of ASM events. Published by Elsevier Inc.
引用
收藏
页码:331 / 339
页数:9
相关论文
共 50 条
  • [41] DAMEfinder: a method to detect differential allele-specific methylation
    Orjuela, Stephany
    Machlab, Dania
    Menigatti, Mirco
    Marra, Giancarlo
    Robinson, Mark D.
    EPIGENETICS & CHROMATIN, 2020, 13 (01)
  • [42] Comparative Anatomy of Chromosomal Domains with Imprinted and Non-Imprinted Allele-Specific DNA Methylation
    Paliwal, Anupam
    Temkin, Alexis M.
    Kerkel, Kristi
    Yale, Alexander
    Yotova, Iveta
    Drost, Natalia
    Lax, Simon
    Nhan-Chang, Chia-Ling
    Powell, Charles
    Borczuk, Alain
    Aviv, Abraham
    Wapner, Ronald
    Chen, Xiaowei
    Nagy, Peter L.
    Schork, Nicholas
    Do, Catherine
    Torkamani, Ali
    Tycko, Benjamin
    PLOS GENETICS, 2013, 9 (08):
  • [43] Allele-specific DNA methylation in mouse strains is mainly determined by cis-acting sequences
    Schilling, Elmar
    El Chartouni, Carol
    Rehli, Michael
    GENOME RESEARCH, 2009, 19 (11) : 2028 - 2035
  • [44] DNA methylation maintains allele-specific KTR gene expression in human natural killer cells
    Chan, HW
    Kurago, ZB
    Stewart, CA
    Wilson, MJ
    Martin, MP
    Mace, BE
    Carrington, M
    Trowsdale, J
    Lutz, CT
    JOURNAL OF EXPERIMENTAL MEDICINE, 2003, 197 (02): : 245 - 255
  • [45] Crosstalk of Genetic Variants, Allele-Specific DNA Methylation, and Environmental Factors for Complex Disease Risk
    Wang, Huishan
    Lou, Dan
    Wang, Zhibin
    FRONTIERS IN GENETICS, 2019, 9
  • [46] ALLELE-SPECIFIC DNA METHYLATION OF DISEASE SUSCEPTIBILITY GENES IN THE JAPANESE PATIENTS WITH INFLAMMATORY BOWEL DISEASE
    Chiba, Hirofumi
    Kakuta, Yoichi
    Kinouchi, Yoshitaka
    Naito, Takeo
    Onodera, Motoyuki
    Moroi, Rintaro
    Kuroha, Masatake
    Kanazawa, Yoshitake
    Kimura, Tomoya
    Endo, Katsuya
    Negoro, Kenichi
    Koike, Tomoyuki
    Masamune, Atsushi
    Shimosegawa, Tooru
    GASTROENTEROLOGY, 2018, 154 (06) : S1020 - S1021
  • [47] ALLELE-SPECIFIC DNA METHYLATION AFFECTING THE STEAP3 GENE IN RENAL CELL CARCINOMAS
    Rigo, Mauricio Menagati
    Castano, Angelica
    Salas, Martha C.
    Tariq, Mariam
    Lerman, Joshua
    Hartley, Sophia
    Nagy, Fidi
    Nagy, Peter L.
    Raver, Michael
    Chow, Kar
    Yerram, Nitin
    Stifelman, Michael D.
    Tycko, Benjamin
    JOURNAL OF UROLOGY, 2024, 211 (05): : E1066 - E1066
  • [48] Establishment of paternal allele-specific DNA methylation at the imprinted mouse Gtl2 locus
    Nowak, Kamila
    Stein, Geneva
    Powell, Elizabeth
    He, Lu Mei
    Naik, Snehal
    Morris, Jane
    Marlow, Sara
    Davis, Tamara L.
    EPIGENETICS, 2011, 6 (08) : 1012 - 1020
  • [49] Allele-specific DNA methylation of disease susceptibility genes in Japanese patients with inflammatory bowel disease
    Chiba, Hirofumi
    Kakuta, Yoichi
    Kinouchi, Yoshitaka
    Kawai, Yosuke
    Watanabe, Kazuhiro
    Nagao, Munenori
    Naito, Takeo
    Onodera, Motoyuki
    Moroi, Rintaro
    Kuroha, Masatake
    Kanazawa, Yoshitake
    Kimura, Tomoya
    Shiga, Hisashi
    Endo, Katsuya
    Negoro, Kenichi
    Nagasaki, Masao
    Unno, Michiaki
    Shimosegawa, Tooru
    PLOS ONE, 2018, 13 (03):
  • [50] Allele-specific methylation of type 1 diabetes susceptibility genes
    Kindt, Alida S. D.
    Fuerst, Rainer W.
    Knoop, Jan
    Laimighofer, Michael
    Telieps, Tanja
    Hippich, Markus
    Woerheide, Maria A.
    Wahl, Simone
    Wilson, Rory
    Sedlmeier, Eva-Maria
    Hommel, Angela
    Todd, John A.
    Krumsiek, Jan
    Ziegler, Anette-G.
    Bonifacio, Ezio
    JOURNAL OF AUTOIMMUNITY, 2018, 89 : 63 - 74