De novo identification of differentially methylated regions in the human genome

被引:663
作者
Peters, Timothy J. [1 ]
Buckley, Michael J. [1 ]
Statham, Aaron L. [2 ]
Pidsley, Ruth [2 ]
Samaras, Katherine [3 ]
Lord, Reginald V. [4 ]
Clark, Susan J. [2 ,5 ]
Molloy, Peter L. [6 ]
机构
[1] CSIRO, Digital Prod Flagship, Riverside Life Sci Ctr, N Ryde, NSW 2113, Australia
[2] Garvan Inst Med Res, Epigenet Program, Sydney, NSW, Australia
[3] St Vincents Hosp, Darlinghurst, NSW 2010, Australia
[4] Univ Notre Dame, Sch Med, Darlinghurst, NSW 2010, Australia
[5] Univ New S Wales, Fac Med, St Vincents Clin Sch, Darlinghurst, NSW 2010, Australia
[6] CSIRO, Food & Nutr Flagship, Riverside Life Sci Ctr, Sydney, NSW, Australia
关键词
Differential DNA methylation; Kernel smoothing; Illumina; DNA METHYLATION; CANCER GENOME; R PACKAGE; ILLUMINA; ARRAY; REGRESSION; DISCOVERY; VALIDATION; TISSUES;
D O I
10.1186/1756-8935-8-6
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: The identification and characterisation of differentially methylated regions (DMRs) between phenotypes in the human genome is of prime interest in epigenetics. We present a novel method, DMRcate, that fits replicated methylation measurements from the Illumina HM450K BeadChip (or 450K array) spatially across the genome using a Gaussian kernel. DMRcate identifies and ranks the most differentially methylated regions across the genome based on tunable kernel smoothing of the differential methylation (DM) signal. The method is agnostic to both genomic annotation and local change in the direction of the DM signal, removes the bias incurred from irregularly spaced methylation sites, and assigns significance to each DMR called via comparison to a null model. Results: We show that, for both simulated and real data, the predictive performance of DMRcate is superior to those of Bumphunter and Probe Lasso, and commensurate with that of comb-p. For the real data, we validate all array-derived DMRs from the candidate methods on a suite of DMRs derived from whole-genome bisulfite sequencing called from the same DNA samples, using two separate phenotype comparisons. Conclusions: The agglomeration of genomically localised individual methylation sites into discrete DMRs is currently best served by a combination of DM-signal smoothing and subsequent threshold specification. The findings also suggest the design of the 450K array shows preference for CpG sites that are more likely to be differentially methylated, but its overall coverage does not adequately reflect the depth and complexity of methylation signatures afforded by sequencing. For the convenience of the research community we have created a user-friendly R software package called DMRcate, downloadable from Bioconductor and compatible with existing preprocessing packages, which allows others to apply the same DMR-finding method on 450K array data.
引用
收藏
页数:16
相关论文
共 69 条
[61]   Local significant differences from nonparametric two-sample tests [J].
Tarn Duong .
JOURNAL OF NONPARAMETRIC STATISTICS, 2013, 25 (03) :635-645
[63]   Epigenetics and human obesity [J].
van Dijk, S. J. ;
Molloy, P. L. ;
Varinli, H. ;
Morrison, J. L. ;
Muhlhausler, B. S. .
INTERNATIONAL JOURNAL OF OBESITY, 2015, 39 (01) :85-97
[64]   Discovering high-resolution patterns of differential DNA methylation that correlate with gene expression changes [J].
VanderKraats, Nathan D. ;
Hiken, Jeffrey F. ;
Decker, Keith F. ;
Edwards, John R. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (14) :6816-6827
[65]   IMA: an R package for high-throughput analysis of Illumina's 450K Infinium methylation data [J].
Wang, Dan ;
Yan, Li ;
Hu, Qiang ;
Sucheston, Lara E. ;
Higgins, Michael J. ;
Ambrosone, Christine B. ;
Johnson, Candace S. ;
Smiraglia, Dominic J. ;
Liu, Song .
BIOINFORMATICS, 2012, 28 (05) :729-730
[66]   COHCAP: an integrative genomic pipeline for single-nucleotide resolution DNA methylation analysis [J].
Warden, Charles D. ;
Lee, Heehyoung ;
Tompkins, Joshua D. ;
Li, Xiaojin ;
Wang, Charles ;
Riggs, Arthur D. ;
Yu, Hua ;
Jove, Richard ;
Yuan, Yate-Ching .
NUCLEIC ACIDS RESEARCH, 2013, 41 (11) :e117
[67]   Gene ontology analysis for RNA-seq: accounting for selection bias [J].
Young, Matthew D. ;
Wakefield, Matthew J. ;
Smyth, Gordon K. ;
Oshlack, Alicia .
GENOME BIOLOGY, 2010, 11 (02)
[68]   Functional DNA methylation differences between tissues, cell types, and across individuals discovered using the M&M algorithm [J].
Zhang, Bo ;
Zhou, Yan ;
Lin, Nan ;
Lowdon, Rebecca F. ;
Hong, Chibo ;
Nagarajan, Raman P. ;
Cheng, Jeffrey B. ;
Li, Daofeng ;
Stevens, Michael ;
Lee, Hyung Joo ;
Xing, Xiaoyun ;
Zhou, Jia ;
Sundaram, Vasavi ;
Elliott, GiNell ;
Gu, Junchen ;
Shi, Taoping ;
Gascard, Philippe ;
Sigaroudinia, Mahvash ;
Tisty, Thea D. ;
Kadlecek, Theresa ;
Weiss, Arthur ;
O'Geen, Henriette ;
Farnham, Peggy J. ;
Maire, Cecile L. ;
Ligon, Keith L. ;
Madden, Pamela A. F. ;
Tam, Angela ;
Moore, Richard ;
Hirst, Martin ;
Marra, Marco A. ;
Zhang, Baoxue ;
Costello, Joseph F. ;
Wang, Ting .
GENOME RESEARCH, 2013, 23 (09) :1522-1540
[69]   QDMR: a quantitative method for identification of differentially methylated regions by entropy [J].
Zhang, Yan ;
Liu, Hongbo ;
Lv, Jie ;
Xiao, Xue ;
Zhu, Jiang ;
Liu, Xiaojuan ;
Su, Jianzhong ;
Li, Xia ;
Wu, Qiong ;
Wang, Fang ;
Cui, Ying .
NUCLEIC ACIDS RESEARCH, 2011, 39 (09) :e58