Predicting CpG methylation levels by integrating Infinium HumanMethylation450 BeadChip array data

被引:23
作者
Fan, Shicai [1 ,2 ]
Huang, Kang [1 ]
Ai, Rizi [2 ]
Wang, Mengchi [2 ]
Wang, Wei [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 610054, Peoples R China
[2] Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
基金
中国国家自然科学基金;
关键词
DNA methylation; CpG loci; Prediction; 450K array data; DNA; EPIGENOME; ISLANDS; SITES; MARKS;
D O I
10.1016/j.ygeno.2016.02.005
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The Infinium HumanMethylation450 BeadChip array, referred as 450K array hereinafter, has been widely adopted as an affordable technique to determine DNA methylation. Tens of thousands of data have been generated on diverse cell types and patient tissues, which have provided great insight into understanding the crucial roles of epigenetic modifications in many biological processes and diseases. The limitation of this technique is its coverage, which measures methylation levels of about 450,000 CpGs, accounting for about 1.6% of all CpGs in the human genome. In the present study we developed and compared computational models to significantly expand the coverage of Illumina 450K (similar to 11 folds). Using the whole genome bisulfite sequencing and Illumina 450K data in the human H1 embryonic stem cell, we showed that the predicted and measured methylation levels were well correlated. Our proposed model showed superior prediction accuracies compared to the existing methods on the same dataset. When applied to predict the DNA methylome on other cells, our proposed model achieved comparable performance in cross-validations, which indicates the generalizibility of the method. Our method would thus be invaluable to maximize the usage of the existing data. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:132 / 137
页数:6
相关论文
共 27 条
  • [1] Genomic Imprinting: A Mammalian Epigenetic Discovery Model
    Barlow, Denise P.
    [J]. ANNUAL REVIEW OF GENETICS, VOL 45, 2011, 45 : 379 - 403
  • [2] A decade of exploring the cancer epigenome - biological and translational implications
    Baylin, Stephen B.
    Jones, Peter A.
    [J]. NATURE REVIEWS CANCER, 2011, 11 (10) : 726 - 734
  • [3] Prediction of methylated CpGs in DNA sequences using a support vector machine
    Bhasin, M
    Zhang, H
    Reinherz, EL
    Reche, PA
    [J]. FEBS LETTERS, 2005, 579 (20) : 4302 - 4308
  • [4] DNA methylation patterns and epigenetic memory
    Bird, A
    [J]. GENES & DEVELOPMENT, 2002, 16 (01) : 6 - 21
  • [5] CpG island mapping by epigenome prediction
    Bock, Christoph
    Walter, Joern
    Paulsen, Martina
    Lengauer, Thomas
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (06) : 1055 - 1070
  • [6] CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure
    Bock, Christoph
    Paulsen, Martina
    Tierling, Sascha
    Mikeska, Thomas
    Lengauer, Thomas
    Walter, Joern
    [J]. PLOS GENETICS, 2006, 2 (03): : 243 - 252
  • [7] Analysing and interpreting DNA methylation data
    Bock, Christoph
    [J]. NATURE REVIEWS GENETICS, 2012, 13 (10) : 705 - 719
  • [8] EpiGRAPH: user-friendly software for statistical analysis and prediction of (epi)genomic data
    Bock, Christoph
    Halachev, Konstantin
    Buech, Joachim
    Lengauer, Thomas
    [J]. GENOME BIOLOGY, 2009, 10 (02):
  • [9] Whole-genome DNA methylation profiling using MethylCap-seq
    Brinkman, Arie B.
    Simmer, Femke
    Ma, Kelong
    Kaan, Anita
    Zhu, Jingde
    Stunnenberg, Hendrik G.
    [J]. METHODS, 2010, 52 (03) : 232 - 236
  • [10] A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis
    Down, Thomas A.
    Rakyan, Vardhman K.
    Turner, Daniel J.
    Flicek, Paul
    Li, Heng
    Kulesha, Eugene
    Graf, Stefan
    Johnson, Nathan
    Herrero, Javier
    Tomazou, Eleni M.
    Thorne, Natalie P.
    Backdahl, Liselotte
    Herberth, Marlis
    Howe, Kevin L.
    Jackson, David K.
    Miretti, Marcos M.
    Marioni, John C.
    Birney, Ewan
    Hubbard, Tim J. P.
    Durbin, Richard
    Tavare, Simon
    Beck, Stephan
    [J]. NATURE BIOTECHNOLOGY, 2008, 26 (07) : 779 - 785