A novel principal component based method for identifying differentially methylated regions in Illumina Infinium MethylationEPIC BeadChip data

被引:2
作者
Zheng, Yuanchao [1 ,2 ]
Lunetta, Kathryn L. [2 ]
Liu, Chunyu [2 ]
Smith, Alicia K. [3 ,4 ]
Sherva, Richard [1 ,5 ]
Miller, Mark W. [1 ,6 ]
Logue, Mark W. [1 ,2 ,5 ,6 ,7 ]
机构
[1] Natl Ctr PTSD, VA Boston Healthcare Syst, Boston, MA USA
[2] Boston Univ, Sch Publ Hlth, Dept Biostat, Boston, MA USA
[3] Emory Univ, Dept Gynecol & Obstet, Atlanta, GA USA
[4] Emory Univ, Sch Med, Dept Psychiat & Behav Sci, Atlanta, GA USA
[5] Boston Univ, Sch Med, Dept Psychiat, Boston, MA USA
[6] Boston Univ, Sch Med, Biomed Genet, Boston, MA USA
[7] Natl Ctr PTSD, VA Boston Healthcare Syst, Behav Sci Div, Boston, MA 02130 USA
关键词
Differentially methylated region; false positive rate; principal components; DNA METHYLATION; TOBACCO SMOKING;
D O I
10.1080/15592294.2023.2207959
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Differentially methylated regions (DMRs) are genomic regions with methylation patterns across multiple CpG sites that are associated with a phenotype. In this study, we proposed a Principal Component (PC) based DMR analysis method for use with data generated using the Illumina Infinium MethylationEPIC BeadChip (EPIC) array. We obtained methylation residuals by regressing the M-values of CpGs within a region on covariates, extracted PCs of the residuals, and then combined association information across PCs to obtain regional significance. Simulation-based genome-wide false positive (GFP) rates and true positive rates were estimated under a variety of conditions before determining the final version of our method, which we have named DMRPC. Then, DMRPC and another DMR method, coMethDMR, were used to perform epigenome-wide analyses of several phenotypes known to have multiple associated methylation loci (age, sex, and smoking) in a discovery and a replication cohort. Among regions that were analysed by both methods, DMRPC identified 50% more genome-wide significant age-associated DMRs than coMethDMR. The replication rate for the loci that were identified by only DMRPC was higher than the rate for those that were identified by only coMethDMR (90% for DMRPC vs. 76% for coMethDMR). Furthermore, DMRPC identified replicable associations in regions of moderate between-CpG correlation which are typically not analysed by coMethDMR. For the analyses of sex and smoking, the advantage of DMRPC was less clear. In conclusion, DMRPC is a new powerful DMR discovery tool that retains power in genomic regions with moderate correlation across CpGs.
引用
收藏
页数:19
相关论文
共 41 条
  • [1] Nucleotide distance influences co-methylation between nearby CpG sites
    Affinito, Ornella
    Palumbo, Domenico
    Fierro, Annalisa
    Cuomo, Mariella
    De Riso, Giulia
    Monticelli, Antonella
    Miele, Gennaro
    Chiariotti, Lorenzo
    Cocozza, Sergio
    [J]. GENOMICS, 2020, 112 (01) : 144 - 150
  • [2] Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays
    Aryee, Martin J.
    Jaffe, Andrew E.
    Corrada-Bravo, Hector
    Ladd-Acosta, Christine
    Feinberg, Andrew P.
    Hansen, Kasper D.
    Irizarry, Rafael A.
    [J]. BIOINFORMATICS, 2014, 30 (10) : 1363 - 1369
  • [3] Prediction by supervised principal components
    Bair, E
    Hastie, T
    Paul, D
    Tibshirani, R
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 119 - 137
  • [4] Semi-supervised methods to predict patient survival from gene expression data
    Bair, E
    Tibshirani, R
    [J]. PLOS BIOLOGY, 2004, 2 (04) : 511 - 522
  • [5] DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines
    Bell, Jordana T.
    Pai, Athma A.
    Pickrell, Joseph K.
    Gaffney, Daniel J.
    Pique-Regi, Roger
    Degner, Jacob F.
    Gilad, Yoav
    Pritchard, Jonathan K.
    [J]. GENOME BIOLOGY, 2011, 12 (01)
  • [6] Controlling the false discovery rate in behavior genetics research
    Benjamini, Y
    Drai, D
    Elmer, G
    Kafkafi, N
    Golani, I
    [J]. BEHAVIOURAL BRAIN RESEARCH, 2001, 125 (1-2) : 279 - 284
  • [7] Novel DNA methylation signatures of tobacco smoking with trans-ethnic effects
    Christiansen, C.
    Castillo-Fernandez, J. E.
    Domingo-Relloso, A.
    Zhao, W.
    El-Sayed Moustafa, J. S.
    Tsai, P. -C.
    Maddock, J.
    Haack, K.
    Cole, S. A.
    Kardia, S. L. R.
    Molokhia, M.
    Suderman, M.
    Power, C.
    Relton, C.
    Wong, A.
    Kuh, D.
    Goodman, A.
    Small, K. S.
    Smith, J. A.
    Tellez-Plaza, M.
    Navas-Acien, A.
    Ploubidis, G. B.
    Hardy, R.
    Bell, J. T.
    [J]. CLINICAL EPIGENETICS, 2021, 13 (01)
  • [8] Dewey M, 2022, metap:meta-analysis of significance values
  • [9] Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis
    Du, Pan
    Zhang, Xiao
    Huang, Chiang-Ching
    Jafari, Nadereh
    Kibbe, Warren A.
    Hou, Lifang
    Lin, Simon M.
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [10] Concordant and discordant DNA methylation signatures of aging in human blood and brain
    Farre, Pau
    Jones, Meaghan J.
    Meaney, Michael J.
    Emberly, Eldon
    Turecki, Gustavo
    Kobor, Michael S.
    [J]. EPIGENETICS & CHROMATIN, 2015, 8