A multistep approach to single nucleotide polymorphism–set analysis: an evaluation of power and type I error of gene-based tests of association after pathway-based association tests

被引:0
作者
Alessandra Valcarcel
Kelsey Grinde
Kaitlyn Cook
Alden Green
Nathan Tintle
机构
[1] University of Connecticut,Department of Statistics
[2] University of Washington,Department of Biostatistics
[3] Carleton College,Department of Mathematics and Statistics
[4] Harvard University,Department of Statistics
[5] Statistics and Computer Science,Department of Mathematics
[6] Dordt College,undefined
关键词
Mean Arterial Pressure; Causal Variant; Blood Pressure Medication; Pathway Test; Sequence Kernel Association Test;
D O I
10.1186/s12919-016-0055-4
中图分类号
学科分类号
摘要
The aggregation of functionally associated variants given a priori biological information can aid in the discovery of rare variants associated with complex diseases. Many methods exist that aggregate rare variants into a set and compute a single p value summarizing association between the set of rare variants and a phenotype of interest. These methods are often called gene-based, rare variant tests of association because the variants in the set are often all contained within the same gene. A reasonable extension of these approaches involves aggregating variants across an even larger set of variants (eg, all variants contained in genes within a pathway). Testing sets of variants such as pathways for association with a disease phenotype reduces multiple testing penalties, may increase power, and allows for straightforward biological interpretation. However, a significant variant-set association test does not indicate precisely which variants contained within that set are causal. Because pathways often contain many variants, it may be helpful to follow-up significant pathway tests by conducting gene-based tests on each gene in that pathway to narrow in on the region of causal variants. In this paper, we propose such a multistep approach for variant-set analysis that can also account for covariates and complex pedigree structure. We demonstrate this approach on simulated phenotypes from Genetic Analysis Workshop 19. We find generally better power for the multistep approach when compared to a more conventional, single-step approach that simply runs gene-based tests of association on each gene across the genome. Further work is necessary to evaluate the multistep approach on different data sets with different characteristics.
引用
收藏
相关论文
共 68 条
[1]  
Liu K(2013)A geometric framework for evaluating rare variant tests of association Genet Epidemiol 37 345-357
[2]  
Fast S(2012)Optimal unified approach for rare-variant association testing with application to small-sample case–control whole-exome sequencing studies Am J Hum Genet 91 224-237
[3]  
Zawistowski M(2011)Rare-variant association testing for sequencing data with the sequence kernel association test Am J Hum Genet 89 82-93
[4]  
Tintle NL(2008)Methods for detecting associations with rare variants for common diseases : application to analysis of sequence data Am J Hum Genet 83 311-321
[5]  
Lee S(2014)Integrated pathway-based approach identifies association between genomic regions at CTCF and CACNB2 and schizophrenia PLoS Genet 10 S105-99
[6]  
Emond MJ(2014)Application of family-based tests of association for rare variants to pathways BMC Proc 8 93-1045
[7]  
Bamshad MJ(2004)A global test for groups of genes: testing association with a clinical outcome Bioinformatics 20 9-2935
[8]  
Barnes KC(2011)Gene set analysis and network analysis for genome-wide association studies Cold Spring Harb Protoc 2011 1028-210
[9]  
Rieder MJ(2012)Family-based association studies for next-generation sequencing Am J Hum Genet 90 667-undefined
[10]  
Nickerson DA(2012)Weighted pedigree-based statistics for testing the association of rare variants BMC Genomics 13 S2-undefined