JOINT ANALYSIS OF SNP AND GENE EXPRESSION DATA IN GENETIC ASSOCIATION STUDIES OF COMPLEX DISEASES

被引:75
作者
Huang, Yen-Tsung [1 ]
VanderWeele, Tyler J. [2 ,3 ]
Lin, Xihong [3 ]
机构
[1] Brown Univ, Dept Epidemiol, Providence, RI 02912 USA
[2] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[3] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
关键词
Causal inference; data integration; mediation analysis; mixed models; score test; SNP set analysis; variance component test; GENOME-WIDE ASSOCIATION; MEDIATION; INFERENCE; PHENOTYPES; MODELS;
D O I
10.1214/13-AOAS690
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Genetic association studies have been a popular approach for assessing the association between common Single Nucleotide Polymorphisms (SNPs) and complex diseases. However, other genomic data involved in the mechanism from SNPs to disease, for example, gene expressions, are usually neglected in these association studies. In this paper, we propose to exploit gene expression information to more powerfully test the association between SNPs and diseases by jointly modeling the relations among SNPs, gene expressions and diseases. We propose a variance component test for the total effect of SNPs and a gene expression on disease risk. We cast the test within the causal mediation analysis framework with the gene expression as a potential mediator. For eQTL SNPs, the use of gene expression information can enhance power to test for the total effect of a SNP-set, which is the combined direct and indirect effects of the SNPs mediated through the gene expression, on disease risk. We show that the test statistic under the null hypothesis follows a mixture of chi(2) distributions, which can be evaluated analytically or empirically using the resampling-based perturbation method. We construct tests for each of three disease models that are determined by SNPs only, SNPs and gene expression, or include also their interactions. As the true disease model is unknown in practice, we further propose an omnibus test to accommodate different underlying disease models. We evaluate the finite sample performance of the proposed methods using simulation studies, and show that our proposed test performs well and the omnibus test can almost reach the optimal power where the disease model is known and correctly specified. We apply our method to reanalyze the overall effect of the SNP-set and expression of the ORMDL3 gene on the risk of asthma.
引用
收藏
页码:352 / 376
页数:25
相关论文
共 42 条
[11]   A Bayesian Framework for Inference of the Genotype-Phenotype Map for Segregating Populations [J].
Hageman, Rachael S. ;
Leduc, Magalie S. ;
Korstanje, Ron ;
Paigen, Beverly ;
Churchill, Gary A. .
GENETICS, 2011, 187 (04) :1163-U296
[12]   Identification, Inference and Sensitivity Analysis for Causal Mediation Effects [J].
Imai, Kosuke ;
Keele, Luke ;
Yamamoto, Teppei .
STATISTICAL SCIENCE, 2010, 25 (01) :51-71
[13]   Identification, Replication, and Functional Fine-Mapping of Expression Quantitative Trait Loci in Primary Human Liver Tissue [J].
Innocenti, Federico ;
Cooper, Gregory M. ;
Stanaway, Ian B. ;
Gamazon, Eric R. ;
Smith, Joshua D. ;
Mirkov, Snezana ;
Ramirez, Jacqueline ;
Liu, Wanqing ;
Lin, Yvonne S. ;
Moloney, Cliona ;
Aldred, Shelly Force ;
Trinklein, Nathan D. ;
Schuetz, Erin ;
Nickerson, Deborah A. ;
Thummel, Ken E. ;
Rieder, Mark J. ;
Rettie, Allan E. ;
Ratain, Mark J. ;
Cox, Nancy J. ;
Brown, Christopher D. .
PLOS GENETICS, 2011, 7 (05)
[14]   OPINION Epigenome dynamics: a quantitative genetics perspective [J].
Johannes, Frank ;
Colot, Vincent ;
Jansen, Ritsert C. .
NATURE REVIEWS GENETICS, 2008, 9 (11) :883-890
[15]   A powerful and flexible multilocus association test for quantitative traits [J].
Kwee, Lydia Coulter ;
Liu, Dawei ;
Lin, Xihong ;
Ghosh, Debashis ;
Epstein, Michael P. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (02) :386-397
[16]   F-SNP: computationally predicted functional SNPs for disease association studies [J].
Lee, Phil Hyoun ;
Shatkay, Hagit .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D820-D824
[17]   Mapping determinants of gene expression plasticity by genetical genomics in C-elegans [J].
Li, Yang ;
Alvarez, Olga Alda ;
Gutteling, Evert W. ;
Tijsterman, Marcel ;
Fu, Jingyuan ;
Riksen, Joost A. G. ;
Hazendonk, Esther ;
Prins, Pjotr ;
Plasterk, Ronald H. A. ;
Jansen, Ritsert C. ;
Breitling, Rainer ;
Kammenga, Jan E. .
PLOS GENETICS, 2006, 2 (12) :2155-2161
[18]   Critical reasoning on causal inference in genome-wide linkage and association studies [J].
Li, Yang ;
Tesson, Bruno M. ;
Churchill, Gary A. ;
Jansen, Ritsert C. .
TRENDS IN GENETICS, 2010, 26 (12) :493-498
[19]   Variance component testing in generalised linear models with random effects [J].
Lin, XH .
BIOMETRIKA, 1997, 84 (02) :309-326
[20]   A new multipoint method for genome-wide association studies by imputation of genotypes [J].
Marchini, Jonathan ;
Howie, Bryan ;
Myers, Simon ;
McVean, Gil ;
Donnelly, Peter .
NATURE GENETICS, 2007, 39 (07) :906-913