Comparison of different cell type correction methods for genome-scale epigenetics studies

被引:55
作者
Kaushal, Akhilesh [1 ]
Zhang, Hongmei [1 ]
Karmaus, Wilfried J. J. [1 ]
Ray, Meredith [1 ]
Torres, Mylin A. [2 ,3 ]
Smith, Alicia K. [2 ,4 ]
Wang, Shu-Li [5 ]
机构
[1] Univ Memphis, Div Epidemiol Biostat & Environm Hlth, Memphis, TN 38152 USA
[2] Emory Univ, Winship Canc Inst, 1365 Clifton Rd NE, Atlanta, GA 30322 USA
[3] Emory Univ, Dept Radiat Oncol, Sch Med, 1365 Clifton Rd NE, Atlanta, GA 30322 USA
[4] Emory Univ, Dept Psychiat & Behav Sci, Sch Med, 101 Woodruff Circle,Suite 4000, Atlanta, GA 30322 USA
[5] Natl Hlth Res Inst, Natl Inst Environm Hlth Sci, Miaoli, Taiwan
来源
BMC BIOINFORMATICS | 2017年 / 18卷
关键词
Cell-type composition; CpG sites; Genome-scale DNA methylation; Surrogate variables; DNA METHYLATION; ARSENIC EXPOSURE; GENE-EXPRESSION; HETEROGENEITY; BIOCONDUCTOR; GENDER; BREAST; BLOOD; AGE;
D O I
10.1186/s12859-017-1611-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Whole blood is frequently utilized in genome-wide association studies of DNA methylation patterns in relation to environmental exposures or clinical outcomes. These associations can be confounded by cellular heterogeneity. Algorithms have been developed to measure or adjust for this heterogeneity, and some have been compared in the literature. However, with new methods available, it is unknown whether the findings will be consistent, if not which method(s) perform better. Results: Methods: We compared eight cell-type correction methods including the method in the minfi R package, the method by Houseman et al., the Removing unwanted variation (RUV) approach, the methods in FaST-LMMEWASher, ReFACTor, RefFreeEWAS, and RefFreeCellMix R programs, along with one approach utilizing surrogate variables (SVAs). We first evaluated the association of DNA methylation at each CpG across the whole genome with prenatal arsenic exposure levels and with cancer status, adjusted for estimated cell-type information obtained from different methods. We then compared CpGs showing statistical significance from different approaches. For the methods implemented in minfi and proposed by Houseman et al., we utilized homogeneous data with composition of some blood cells available and compared them with the estimated cell compositions. Finally, for methods not explicitly estimating cell compositions, we evaluated their performance using simulated DNA methylation data with a set of latent variables representing "cell types". Results: Results from the SVA-based method overall showed the highest agreement with all other methods except for FaST-LMM-EWASher. Using homogeneous data, minfi provided better estimations on cell types compared to the originally proposed method by Houseman et al. Further simulation studies on methods free of reference data revealed that SVA provided good sensitivities and specificities, RefFreeCellMix in general produced high sensitivities but specificities tended to be low when confounding is present, and FaST-LMM-EWASher gave the lowest sensitivity but highest specificity. Conclusions: Results from real data and simulations indicated that SVA is recommended when the focus is on the identification of informative CpGs. When appropriate reference data are available, the method implemented in the minfi package is recommended. However, if no such reference data are available or if the focus is not on estimating cell proportions, the SVA method is suggested.
引用
收藏
页数:12
相关论文
共 46 条
  • [21] Genome-scale mutant fitness reveals versatile c-type cytochromes in Shewanella oneidensis MR-1
    Ding, Dewu
    Wu, Meili
    Liu, Yanfen
    MOLECULAR OMICS, 2021, 17 (02) : 288 - 295
  • [22] Genome-scale search of tumor-specific antigens by collective analysis of mutations, expressions and T-cell recognition
    Jia, Jia
    Cui, Juan
    Liu, Xianghui
    Han, Jinhua
    Yang, Shengyong
    Wei, Yuquan
    Chen, Yuzong
    MOLECULAR IMMUNOLOGY, 2009, 46 (8-9) : 1824 - 1829
  • [23] TScan-II: A genome-scale platform for the de novo identification of CD4+T cell epitopes
    Dezfulian, Mohammad H.
    Kula, Tomasz
    Pranzatelli, Thomas
    Kamitaki, Nolan
    Meng, Qingda
    Khatri, Bhuwan
    Perez, Paola
    Xu, Qikai
    Chang, Aiquan
    Kohlgruber, Ayano C.
    Leng, Yumei
    Jupudi, Ananth Aditya
    Joachims, Michelle L.
    Chiorini, John A.
    Lessard, Christopher J.
    Farris, A. Darise
    Muthuswamy, Senthil K.
    Warner, Blake M.
    Elledge, Stephen J.
    CELL, 2023, 186 (25) : 5569 - 5586.e21
  • [24] Genome-scale case-control analysis of CD4+T-cell DNA methylation in juvenile idiopathic arthritis reveals potential targets involved in disease
    Ellis, Justine A.
    Munro, Jane E.
    Chavez, Raul A.
    Gordon, Lavinia
    Joo, Jihoon E.
    Akikusa, Jonathan D.
    Allen, Roger C.
    Ponsonby, Anne-Louise
    Craig, Jeffrey M.
    Saffery, Richard
    CLINICAL EPIGENETICS, 2012, 4
  • [25] A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
    Braegelmann, Johannes
    Bermejo, Justo Lorenzo
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (06) : 2055 - 2065
  • [26] A comparison of red cell recovery between two different methods of red cell washing
    Waters, JH
    Potter, P
    Hobson, DF
    ANESTHESIA AND ANALGESIA, 2003, 97 (06) : 1578 - 1581
  • [27] Genome-scale CRISPR activation screening identifies a role of ELAVL2-CDKN1A axis in paclitaxel resistance in esophageal squamous cell carcinoma
    Zhao, Wen-Si
    Fan, Wan-Pu
    Chen, Dong-Bo
    Dai, Liang
    Yang, Yong-Bo
    Kang, Xiao-Zheng
    Fu, Hao
    Chen, Pu
    Deng, Kang-Jian
    Wang, Xue-Yan
    Xie, Xing-Wang
    Chen, Hong-Song
    Chen, Ke-Neng
    AMERICAN JOURNAL OF CANCER RESEARCH, 2019, 9 (06): : 1183 - 1200
  • [28] An evaluation of methods correcting for cell-type heterogeneity in DNA methylation studies
    Kevin McGregor
    Sasha Bernatsky
    Ines Colmegna
    Marie Hudson
    Tomi Pastinen
    Aurélie Labbe
    Celia M.T. Greenwood
    Genome Biology, 17
  • [29] Quantitative genome-scale metabolic modeling of human CD4+ T cell differentiation reveals subset-specific regulation of glycosphingolipid pathways
    Sen, Partho
    Andrabi, Syed Bilal Ahmad
    Buchacher, Tanja
    Khan, Mohd Moin
    Kalim, Ubaid Ullah
    Lindeman, Tuomas Mikael
    Alves, Marina Amaral
    Hinkkanen, Victoria
    Kemppainen, Esko
    Dickens, Alex M.
    Rasool, Omid
    Hyotylainen, Tuulia
    Lahesmaa, Riitta
    Oresic, Matej
    CELL REPORTS, 2021, 37 (06):
  • [30] An evaluation of methods correcting for cell-type heterogeneity in DNA methylation studies
    McGregor, Kevin
    Bernatsky, Sasha
    Colmegna, Ines
    Hudson, Marie
    Pastinen, Tomi
    Labbe, Aurelie
    Greenwood, Celia M. T.
    GENOME BIOLOGY, 2016, 17