Information recovery from low coverage whole-genome bisulfite sequencing

被引:28
|
作者
Libertini, Emanuele [1 ]
Heath, Simon C. [2 ]
Hamoudi, Rifat A. [3 ]
Gut, Marta [2 ]
Ziller, Michael J. [4 ,5 ,6 ]
Czyz, Agata [7 ]
Ruotti, Victor [7 ]
Stunnenberg, Hendrik G. [8 ]
Frontini, Mattia [9 ,10 ,11 ]
Ouwehand, Willem H. [9 ,10 ,12 ]
Meissner, Alexander [4 ,5 ,6 ]
Gut, Ivo G. [2 ]
Beck, Stephan [1 ]
机构
[1] UCL, Inst Canc, Med Genom, London WC1E 6BT, England
[2] CNAG, Parc Cient Barcelona, Barcelona 08028, Spain
[3] UCL, Div Surg & Intervent Sci, London W1W 7EJ, England
[4] MIT & Harvard, Broad Inst, Cambridge, MA 02142 USA
[5] Harvard Stem Cell Inst, Cambridge, MA 02138 USA
[6] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[7] Illumina Inc, San Diego, CA 92121 USA
[8] Radboud Univ Nijmegen, Dept Mol Biol, NL-6525 GA Nijmegen, Netherlands
[9] Univ Cambridge, Dept Haematol, Cambridge CB2 0XY, England
[10] Natl Hlth Serv Blood & Transplant, Cambridge Biomedical Campus, Cambridge CB2 0XY, England
[11] Univ Cambridge, British Heart Fdn Ctr Excellence, Cambridge CB2 0QQ, England
[12] Wellcome Trust Sanger Inst, Wellcome Trust Genome Campus, Cambridge CB10 1SA, England
来源
NATURE COMMUNICATIONS | 2016年 / 7卷
基金
英国惠康基金;
关键词
EPIGENOME-WIDE ASSOCIATION; DNA METHYLATION; IDENTIFICATION; IMPUTATION; PACKAGE; REGIONS;
D O I
10.1038/ncomms11306
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The cost of whole-genome bisulfite sequencing (WGBS) remains a bottleneck for many studies and it is therefore imperative to extract as much information as possible from a given dataset. This is particularly important because even at the recommend 30X coverage for reference methylomes, up to 50% of high-resolution features such as differentially methylated positions (DMPs) cannot be called with current methods as determined by saturation analysis. To address this limitation, we have developed a tool that dynamically segments WGBS methylomes into blocks of comethylation (COMETs) from which lost information can be recovered in the form of differentially methylated COMETs (DMCs). Using this tool, we demonstrate recovery of similar to 30% of the lost DMP information content as DMCs even at very low (5X) coverage. This constitutes twice the amount that can be recovered using an existing method based on differentially methylated regions (DMRs). In addition, we explored the relationship between COMETs and haplotypes in lymphoblastoid cell lines of African and European origin. Using best fit analysis, we show COMETs to be correlated in a population-specific manner, suggesting that this type of dynamic segmentation may be useful for integrated (epi) genome-wide association studies in the future.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Whole-genome bisulfite sequencing in systemic sclerosis provides novel targets to understand disease pathogenesis
    Lu, Tianyuan
    Klein, Kathleen Oros
    Colmegna, Ines
    Lora, Maximilien
    Greenwood, Celia M. T.
    Hudson, Marie
    BMC MEDICAL GENOMICS, 2019, 12 (01)
  • [32] Ultra Low-Coverage Whole-Genome Sequencing as an Alternative to Genotyping Arrays in Genome-Wide Association Studies
    Chat, Vylyny
    Ferguson, Robert
    Morales, Leah
    Kirchhoff, Tomas
    FRONTIERS IN GENETICS, 2022, 12
  • [33] Beadchip technology to detect DNA methylation in mouse faithfully recapitulates whole-genome bisulfite sequencing
    Martin, Elizabeth M.
    Grimm, Sara A.
    Xu, Zongli
    Taylor, Jack A.
    Wade, Paul A.
    EPIGENOMICS, 2023, 15 (03) : 115 - 129
  • [34] Global analysis of DNA methylation in hepatocellular carcinoma via a whole-genome bisulfite sequencing approach
    Yan, Qian
    Tang, Ying
    He, Fan
    Xue, Jiao
    Zhou, Ruisheng
    Zhang, Xiaoying
    Luo, Huiyan
    Zhou, Daihan
    Wang, Xiongwen
    GENOMICS, 2021, 113 (05) : 3618 - 3634
  • [35] Genotyping by low-coverage whole-genome sequencing in intercross pedigrees from outbred founders: a cost-efficient approach
    Zan, Yanjun
    Payen, Thibaut
    Lillie, Mette
    Honaker, Christa F.
    Siegel, Paul B.
    Carlborg, Orjan
    GENETICS SELECTION EVOLUTION, 2019, 51 (01)
  • [36] Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data
    Smolander, Johannes
    Khan, Sofia
    Singaravelu, Kalaimathy
    Kauko, Leni
    Lund, Riikka J.
    Laiho, Asta
    Elo, Laura L.
    BMC GENOMICS, 2021, 22 (01)
  • [37] A Bayesian Approach for Analysis of Whole-Genome Bisulfite Sequencing Data Identifies Disease-Associated Changes in DNA Methylation
    Rackham, Owen J. L.
    Langley, Sarah R.
    Oates, Thomas
    Vradi, Eleni
    Harmston, Nathan
    Srivastava, Prashant K.
    Behmoaras, Jacques
    Dellaportas, Petros
    Bottolo, Leonardo
    Petretto, Enrico
    GENETICS, 2017, 205 (04) : 1443 - 1458
  • [38] Population whole-genome bisulfite sequencing across two tissues highlights the environment as the principal source of human methylome variation
    Busche, Stephan
    Shao, Xiaojian
    Caron, Maxime
    Kwan, Tony
    Allum, Fiona
    Cheung, Warren A.
    Ge, Bing
    Westfall, Susan
    Simon, Marie-Michelle
    Barrett, Amy
    Bell, Jordana T.
    McCarthy, Mark I.
    Deloukas, Panos
    Blanchette, Mathieu
    Bourque, Guillaume
    Spector, Timothy D.
    Lathrop, Mark
    Pastinen, Tomi
    Grundberg, Elin
    GENOME BIOLOGY, 2015, 16
  • [39] Moment estimators of relatedness from low-depth whole-genome sequencing data
    Herzig, Anthony F.
    Ciullo, M.
    Consortium, FranceGenRef
    Leutenegger, A-L
    Perdry, H.
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [40] Whole-genome bisulfite sequencing identifies HDAC3-mediated DNA methylation in multiple myeloma
    Ogiya, Daisuke
    Ohguchi, Hiroto
    Liu, Jiye
    Kurata, Keiji
    Adamia, Sophia
    Hideshima, Teru
    Anderson, Kenneth C.
    CLINICAL LYMPHOMA MYELOMA & LEUKEMIA, 2019, 19 (10) : E72 - E72