Accounting for Population Stratification in DNA Methylation Studies

被引:179
作者
Barfield, Richard T. [1 ]
Almli, Lynn M. [2 ]
Kilaru, Varun [2 ]
Smith, Alicia K. [2 ]
Mercer, Kristina B. [2 ]
Duncan, Richard [3 ]
Klengel, Torsten [4 ]
Mehta, Divya [4 ]
Binder, Elisabeth B. [2 ,4 ]
Epstein, Michael P. [3 ]
Ressler, Kerry J. [2 ]
Conneely, Karen N. [3 ]
机构
[1] Harvard Univ, Dept Biostat, Boston, MA 02115 USA
[2] Emory Univ, Sch Med, Dept Psychiat & Behav Sci, Atlanta, GA 30322 USA
[3] Emory Univ, Sch Med, Dept Human Genet, Atlanta, GA 30322 USA
[4] Max Planck Inst Psychiat, D-80804 Munich, Germany
基金
美国国家卫生研究院;
关键词
association studies; principal components; population stratification; DNA methylation; GENE-EXPRESSION; GENOMIC CONTROL; ASSOCIATION; SMOKING; DISCOVERY; TISSUES; CANCER; CELLS; SCALE;
D O I
10.1002/gepi.21789
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
DNA methylation is an important epigenetic mechanism that has been linked to complex diseases and is of great interest to researchers as a potential link between genome, environment, and disease. As the scale of DNA methylation association studies approaches that of genome-wide association studies, issues such as population stratification will need to be addressed. It is well-documented that failure to adjust for population stratification can lead to false positives in genetic association studies, but population stratification is often unaccounted for in DNA methylation studies. Here, we propose several approaches to correct for population stratification using principal components (PCs) from different subsets of genome-wide methylation data. We first illustrate the potential for confounding due to population stratification by demonstrating widespread associations between DNA methylation and race in 388 individuals (365 African American and 23 Caucasian). We subsequently evaluate the performance of our PC-based approaches and other methods in adjusting for confounding due to population stratification. Our simulations show that (1) all of the methods considered are effective at removing inflation due to population stratification, and (2) maximum power can be obtained with single-nucleotide polymorphism (SNP)-based PCs, followed by methylation-based PCs, which outperform both surrogate variable analysis and genomic control. Among our different approaches to computing methylation-based PCs, we find that PCs based on CpG sites chosen for their potential to proxy nearby SNPs can provide a powerful and computationally efficient approach to adjust for population stratification in DNA methylation studies when genome-wide SNP data are unavailable.
引用
收藏
页码:231 / 241
页数:11
相关论文
共 54 条
[1]   Racial Differences in Gene-Specific DNA Methylation Levels are Present at Birth [J].
Adkins, Ronald M. ;
Krushkal, Julia ;
Tylavsky, Frances A. ;
Thomas, Fridtjof .
BIRTH DEFECTS RESEARCH PART A-CLINICAL AND MOLECULAR TERATOLOGY, 2011, 91 (08) :728-736
[2]   Age-associated DNA methylation in pediatric populations [J].
Alisch, Reid S. ;
Barwick, Benjamin G. ;
Chopra, Pankaj ;
Myrick, Leila K. ;
Satten, Glen A. ;
Conneely, Karen N. ;
Warren, Stephen T. .
GENOME RESEARCH, 2012, 22 (04) :623-632
[3]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[4]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[5]   The power of genomic control [J].
Bacanu, SA ;
Devlin, B ;
Roeder, K .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (06) :1933-1944
[6]   CpGassoc: an R function for analysis of DNA methylation microarray data [J].
Barfield, Richard T. ;
Kilaru, Varun ;
Smith, Alicia K. ;
Conneely, Karen N. .
BIOINFORMATICS, 2012, 28 (09) :1280-1281
[7]   DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines [J].
Bell, Jordana T. ;
Pai, Athma A. ;
Pickrell, Joseph K. ;
Gaffney, Daniel J. ;
Pique-Regi, Roger ;
Degner, Jacob F. ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
GENOME BIOLOGY, 2011, 12 (01)
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]   The Relationship of DNA Methylation with Age, Gender and Genotype in Twins and Healthy Controls [J].
Boks, Marco P. ;
Derks, Eske M. ;
Weisenberger, Daniel J. ;
Strengman, Erik ;
Janson, Esther ;
Sommer, Iris E. ;
Kahn, Rene S. ;
Ophoff, Roel A. .
PLOS ONE, 2009, 4 (08)
[10]   Tobacco-Smoking-Related Differential DNA Methylation: 27K Discovery and Replication [J].
Breitling, Lutz P. ;
Yang, Rongxi ;
Korn, Bernhard ;
Burwinkel, Barbara ;
Brenner, Hermann .
AMERICAN JOURNAL OF HUMAN GENETICS, 2011, 88 (04) :450-457