Systematic evaluation and validation of reference and library selection methods for deconvolution of cord blood DNA methylation data

被引:121
作者
Gervin, Kristina [1 ,2 ]
Salas, Lucas A. [3 ]
Bakulski, Kelly M. [4 ]
van Zelm, Menno C. [5 ,6 ,7 ]
Koestler, Devin C. [8 ]
Wiencke, John K. [9 ]
Duijts, Liesbeth [10 ,11 ,12 ]
Moll, Henriette A. [13 ]
Kelsey, Karl T. [14 ,15 ]
Kobor, Michael S. [16 ,17 ]
Lyle, Robert [2 ,18 ]
Christensen, Brock C. [3 ,19 ,20 ,21 ]
Felix, Janine F. [10 ,13 ,22 ]
Jones, Meaghan J. [23 ,24 ]
机构
[1] Univ Oslo, Sch Pharm, Dept Pharm, Pharmacoepidemiol & Drug Safety Res Grp, Oslo, Norway
[2] Univ Oslo, Fac Math & Nat Sci, PharmaTox Strateg Res Initiat, Oslo, Norway
[3] Dartmouth Coll, Geisel Sch Med, Dept Epidemiol, 1 Med Ctr Dr, Lebanon, NH 03756 USA
[4] Univ Michigan, Sch Publ Hlth, Dept Epidemiol, Ann Arbor, MI 48109 USA
[5] Monash Univ, Cent Clin Sch, Dept Immunol & Pathol, Melbourne, Vic, Australia
[6] Alfred Hosp, Melbourne, Vic, Australia
[7] Univ Med Ctr Rotterdam, Dept Immunol, Erasmus MC, Rotterdam, Netherlands
[8] Univ Kansas, Med Ctr, Dept Biostat, Kansas City, KS 66103 USA
[9] Univ Calif San Francisco, Dept Neurol Surg, San Francisco, CA USA
[10] Univ Med Ctr Rotterdam, Erasmus MC, Generat R Study Grp, Rotterdam, Netherlands
[11] Univ Med Ctr Rotterdam, Erasmus MC Sophia Childrens Hosp, Dept Pediat, Div Resp Med & Allergol, Rotterdam, Netherlands
[12] Univ Med Ctr Rotterdam, Erasmus MC Sophia Childrens Hosp, Dept Pediat, Div Neonatol, Rotterdam, Netherlands
[13] Univ Med Ctr Rotterdam, Erasmus MC Sophia Childrens Hosp, Dept Pediat, Rotterdam, Netherlands
[14] Brown Univ, Dept Pathol & Lab Med, Providence, RI 02912 USA
[15] Brown Univ, Dept Epidemiol, Providence, RI 02912 USA
[16] Univ British Columbia, Dept Med Genet, Vancouver, BC, Canada
[17] BC Childrens Hosp Res Inst, Vancouver, BC, Canada
[18] Oslo Univ Hosp, Dept Med Genet, Oslo, Norway
[19] Dartmouth Coll, Dept Mol & Syst Biol, Geisel Sch Med, 1 Med Ctr Dr, Lebanon, NH 03756 USA
[20] Dartmouth Coll, Community & Family Med, Geisel Sch Med, 1 Med Ctr Dr, Lebanon, NH 03756 USA
[21] Dartmouth Coll, Dept Community & Family Med, Geisel Sch Med, 1 Med Ctr Dr, Lebanon, NH 03756 USA
[22] Univ Med Ctr Rotterdam, Dept Epidemiol, Erasmus MC, Rotterdam, Netherlands
[23] Univ Manitoba, Dept Biochem & Med Genet, Winnipeg, MB, Canada
[24] Childrens Hosp Res Inst Manitoba, Winnipeg, MB, Canada
基金
欧盟地平线“2020”; 美国国家卫生研究院;
关键词
DNAm; Cell type heterogeneity; Umbilical cord blood; Reference dataset; Deconvolution; IDOL; minfi; pickCompProbes; GENERATION R; T-CELL;
D O I
10.1186/s13148-019-0717-y
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
BackgroundUmbilical cord blood (UCB) is commonly used in epigenome-wide association studies of prenatal exposures. Accounting for cell type composition is critical in such studies as it reduces confounding due to the cell specificity of DNA methylation (DNAm). In the absence of cell sorting information, statistical methods can be applied to deconvolve heterogeneous cell mixtures. Among these methods, reference-based approaches leverage age-appropriate cell-specific DNAm profiles to estimate cellular composition. In UCB, four reference datasets comprising DNAm signatures profiled in purified cell populations have been published using the Illumina 450K and EPIC arrays. These datasets are biologically and technically different, and currently, there is no consensus on how to best apply them. Here, we systematically evaluate and compare these datasets and provide recommendations for reference-based UCB deconvolution.ResultsWe first evaluated the four reference datasets to ascertain both the purity of the samples and the potential cell cross-contamination. We filtered samples and combined datasets to obtain a joint UCB reference. We selected deconvolution libraries using two different approaches: automatic selection using the top differentially methylated probes from the function pickCompProbes in minfi and a standardized library selected using the IDOL (Identifying Optimal Libraries) iterative algorithm. We compared the performance of each reference separately and in combination, using the two approaches for reference library selection, and validated the results in an independent cohort (Generation R Study, n = 191) with matched Fluorescence-Activated Cell Sorting measured cell counts. Strict filtering and combination of the references significantly improved the accuracy and efficiency of cell type estimates. Ultimately, the IDOL library outperformed the library from the automatic selection method implemented in pickCompProbes.ConclusionThese results have important implications for epigenetic studies in UCB as implementing this method will optimally reduce confounding due to cellular heterogeneity. This work provides guidelines for future reference-based UCB deconvolution and establishes a framework for combining reference datasets in other tissues.
引用
收藏
页数:15
相关论文
共 34 条
[1]   Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays [J].
Aryee, Martin J. ;
Jaffe, Andrew E. ;
Corrada-Bravo, Hector ;
Ladd-Acosta, Christine ;
Feinberg, Andrew P. ;
Hansen, Kasper D. ;
Irizarry, Rafael A. .
BIOINFORMATICS, 2014, 30 (10) :1363-1369
[2]  
Bakulski K. M, 2016, EPIGENETICS
[3]   Epigenetic Research in Neuropsychiatric Disorders: the “Tissue Issue” [J].
Bakulski K.M. ;
Halladay A. ;
Hu V.W. ;
Mill J. ;
Fallin M.D. .
Current Behavioral Neuroscience Reports, 2016, 3 (3) :264-274
[4]   Validation of a DNA methylation reference panel for the estimation of nucleated cells types in cord blood [J].
Cardenas, Andres ;
Allard, Catherine ;
Doyon, Myriam ;
Houseman, E. Andres ;
Bakulski, Kelly M. ;
Perron, Patrice ;
Bouchard, Luigi ;
Hivert, Marie-France .
EPIGENETICS, 2016, 11 (11) :773-779
[5]   Root mean square error (RMSE) or mean absolute error (MAE)? - Arguments against avoiding RMSE in the literature [J].
Chai, T. ;
Draxler, R. R. .
GEOSCIENTIFIC MODEL DEVELOPMENT, 2014, 7 (03) :1247-1250
[6]   Cord blood hematopoietic cells from preterm infants display altered DNA methylation patterns [J].
de Goede, Olivia M. ;
Lavoie, Pascal M. ;
Robinson, Wendy P. .
CLINICAL EPIGENETICS, 2017, 9
[7]   Nucleated red blood cells impact DNA methylation and expression analyses of cord blood hematopoietic cells [J].
de Goede, Olivia M. ;
Razzaghian, Hamid R. ;
Price, E. Magda ;
Jones, Meaghan J. ;
Kobor, Michael S. ;
Robinson, Wendy P. ;
Lavoie, Pascal M. .
CLINICAL EPIGENETICS, 2015, 7
[8]  
Felix J.F., 2017, INT J EPIDEMIOL
[9]   Long-term prenatal exposure to paracetamol is associated with DNA methylation differences in children diagnosed with ADHD [J].
Gervin, Kristina ;
Nordeng, Hedvig ;
Ystrom, Eivind ;
Reichborn-Kjennerud, Ted ;
Lyle, Robert .
CLINICAL EPIGENETICS, 2017, 9
[10]   Cell type specific DNA methylation in cord blood: A 450K-reference data set and cell count-based validation of estimated cell type composition [J].
Gervin, Kristina ;
Page, Christian Magnus ;
Aass, Hans Christian D. ;
Jansen, Michelle A. ;
Fjeldstad, Heidi Elisabeth ;
Andreassen, Bettina Kulle ;
Duijts, Liesbeth ;
van Meurs, Joyce B. ;
van Zelm, Menno C. ;
Jaddoe, Vincent W. ;
Nordeng, Hedvig ;
Knudsen, Gunn Peggy ;
Magnus, Per ;
Nystad, Wenche ;
Staff, Anne Cathrine ;
Felix, Janine F. ;
Lyle, Robert .
EPIGENETICS, 2016, 11 (09) :690-698