Information recovery from low coverage whole-genome bisulfite sequencing

被引:28
|
作者
Libertini, Emanuele [1 ]
Heath, Simon C. [2 ]
Hamoudi, Rifat A. [3 ]
Gut, Marta [2 ]
Ziller, Michael J. [4 ,5 ,6 ]
Czyz, Agata [7 ]
Ruotti, Victor [7 ]
Stunnenberg, Hendrik G. [8 ]
Frontini, Mattia [9 ,10 ,11 ]
Ouwehand, Willem H. [9 ,10 ,12 ]
Meissner, Alexander [4 ,5 ,6 ]
Gut, Ivo G. [2 ]
Beck, Stephan [1 ]
机构
[1] UCL, Inst Canc, Med Genom, London WC1E 6BT, England
[2] CNAG, Parc Cient Barcelona, Barcelona 08028, Spain
[3] UCL, Div Surg & Intervent Sci, London W1W 7EJ, England
[4] MIT & Harvard, Broad Inst, Cambridge, MA 02142 USA
[5] Harvard Stem Cell Inst, Cambridge, MA 02138 USA
[6] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[7] Illumina Inc, San Diego, CA 92121 USA
[8] Radboud Univ Nijmegen, Dept Mol Biol, NL-6525 GA Nijmegen, Netherlands
[9] Univ Cambridge, Dept Haematol, Cambridge CB2 0XY, England
[10] Natl Hlth Serv Blood & Transplant, Cambridge Biomedical Campus, Cambridge CB2 0XY, England
[11] Univ Cambridge, British Heart Fdn Ctr Excellence, Cambridge CB2 0QQ, England
[12] Wellcome Trust Sanger Inst, Wellcome Trust Genome Campus, Cambridge CB10 1SA, England
来源
NATURE COMMUNICATIONS | 2016年 / 7卷
基金
英国惠康基金;
关键词
EPIGENOME-WIDE ASSOCIATION; DNA METHYLATION; IDENTIFICATION; IMPUTATION; PACKAGE; REGIONS;
D O I
10.1038/ncomms11306
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The cost of whole-genome bisulfite sequencing (WGBS) remains a bottleneck for many studies and it is therefore imperative to extract as much information as possible from a given dataset. This is particularly important because even at the recommend 30X coverage for reference methylomes, up to 50% of high-resolution features such as differentially methylated positions (DMPs) cannot be called with current methods as determined by saturation analysis. To address this limitation, we have developed a tool that dynamically segments WGBS methylomes into blocks of comethylation (COMETs) from which lost information can be recovered in the form of differentially methylated COMETs (DMCs). Using this tool, we demonstrate recovery of similar to 30% of the lost DMP information content as DMCs even at very low (5X) coverage. This constitutes twice the amount that can be recovered using an existing method based on differentially methylated regions (DMRs). In addition, we explored the relationship between COMETs and haplotypes in lymphoblastoid cell lines of African and European origin. Using best fit analysis, we show COMETs to be correlated in a population-specific manner, suggesting that this type of dynamic segmentation may be useful for integrated (epi) genome-wide association studies in the future.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Whole-genome bisulfite sequencing with improved accuracy and cost
    Suzuki, Masako
    Liao, Will
    Wos, Frank
    Johnston, Andrew D.
    DeGrazia, Justin
    Ishii, Jennifer
    Bloom, Toby
    Zody, Michael C.
    Germer, Soren
    Greally, John M.
    GENOME RESEARCH, 2018, 28 (09) : 1364 - 1371
  • [2] Tagmentation-based whole-genome bisulfite sequencing
    Wang, Qi
    Gu, Lei
    Adey, Andrew
    Radlwimmer, Bernhard
    Wang, Wei
    Hovestadt, Volker
    Baehr, Marion
    Wolf, Stephan
    Shendure, Jay
    Eils, Roland
    Plass, Christoph
    Weichenhan, Dieter
    NATURE PROTOCOLS, 2013, 8 (10) : 2022 - 2032
  • [3] How to Design a Whole-Genome Bisulfite Sequencing Experiment
    Grehl, Claudius
    Kuhlmann, Markus
    Becker, Claude
    Glaser, Bruno
    Grosse, Ivo
    EPIGENOMES, 2018, 2 (04):
  • [4] Saturation analysis for whole-genome bisulfite sequencing data
    Libertini, Emanuele
    Heath, Simon C.
    Hamoudi, Rifat A.
    Gut, Marta
    Ziller, Michael J.
    Herrero, Javier
    Czyz, Agata
    Ruotti, Victor
    Stunnenberg, Hendrik G.
    Frontini, Mattia
    Ouwehand, Willem H.
    Meissner, Alexander
    Gut, Ivo G.
    Beck, Stephan
    NATURE BIOTECHNOLOGY, 2016, 34 (07) : 691 - 693
  • [5] Methodological aspects of whole-genome bisulfite sequencing analysis
    Adusumalli, Swarnaseetha
    Omar, Mohd Feroz Mohd
    Soong, Richie
    Benoukraf, Touati
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (03) : 369 - 379
  • [6] Comparison and quantitative verification of mapping algorithms for whole-genome bisulfite sequencing
    Kunde-Ramamoorthy, Govindarajan
    Coarfa, Cristian
    Laritsky, Eleonora
    Kessler, Noah J.
    Harris, R. Alan
    Xu, Mingchu
    Chen, Rui
    Shen, Lanlan
    Milosavljevic, Aleksandar
    Waterland, Robert A.
    NUCLEIC ACIDS RESEARCH, 2014, 42 (06) : e43
  • [7] An information-theoretic approach to the modeling and analysis of whole-genome bisulfite sequencing data
    Garrett Jenkinson
    Jordi Abante
    Andrew P. Feinberg
    John Goutsias
    BMC Bioinformatics, 19
  • [8] An information-theoretic approach to the modeling and analysis of whole-genome bisulfite sequencing data
    Jenkinson, Garrett
    Abante, Jordi
    Feinberg, Andrew P.
    Goutsias, John
    BMC BIOINFORMATICS, 2018, 19
  • [9] Ultra-low-input, tagmentation-based whole-genome bisulfite sequencing
    Adey, Andrew
    Shendure, Jay
    GENOME RESEARCH, 2012, 22 (06) : 1139 - 1143
  • [10] BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues
    Zou, Luli S.
    Erdos, Michael R.
    Taylor, D. Leland
    Chines, Peter S.
    Varshney, Arushi
    Parker, Stephen C. J.
    Collins, Francis S.
    Didion, John P.
    BMC GENOMICS, 2018, 19