Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes

被引:17
作者
Duong, Dat [1 ]
Gai, Lisa [1 ]
Snir, Sagi [2 ,3 ]
Kang, Eun Yong [1 ]
Han, Buhm [4 ,5 ]
Sul, Jae Hoon [6 ]
Eskin, Eleazar [1 ,7 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
[2] Univ Haifa, Inst Evolut, IL-3498838 Haifa, Israel
[3] Univ Haifa, Dept Evolutionary & Environm Biol, IL-3498838 Haifa, Israel
[4] Univ Ulsan, Coll Med, Dept Convergence Med, Seoul 05505, South Korea
[5] Asan Inst Life Sci, Asan Med Ctr, Seoul 05505, South Korea
[6] Univ Calif Los Angeles, Dept Psychiat & Behav Sci, Los Angeles, CA 90095 USA
[7] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
基金
美国国家科学基金会; 新加坡国家研究基金会; 美国国家卫生研究院;
关键词
GENE-EXPRESSION; ASSOCIATION; DISEASE; POWER; RISK;
D O I
10.1093/bioinformatics/btx227
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: There is recent interest in using gene expression data to contextualize findings from traditional genome-wide association studies (GWAS). Conditioned on a tissue, expression quantitative trait loci (eQTLs) are genetic variants associated with gene expression, and eGenes are genes whose expression levels are associated with genetic variants. eQTLs and eGenes provide great supporting evidence for GWAS hits and important insights into the regulatory pathways involved in many diseases. When a significant variant or a candidate gene identified by GWAS is also an eQTL or eGene, there is strong evidence to further study this variant or gene. Multi-tissue gene expression datasets like the Gene Tissue Expression (GTEx) data are used to find eQTLs and eGenes. Unfortunately, these datasets often have small sample sizes in some tissues. For this reason, there have been many meta-analysis methods designed to combine gene expression data across many tissues to increase power for finding eQTLs and eGenes. However, these existing techniques are not scalable to datasets containing many tissues, like the GTEx data. Furthermore, these methods ignore a biological insight that the same variant may be associated with the same gene across similar tissues. Results: We introduce a meta-analysis model that addresses these problems in existing methods. We focus on the problem of finding eGenes in gene expression data from many tissues, and show that our model is better than other types of meta-analyses.
引用
收藏
页码:I67 / I74
页数:8
相关论文
共 26 条
[1]  
Abraham B., 2006, Introduction To Regression Modeling
[2]   Exploiting expression patterns across multiple tissues to map expression quantitative trait loci [J].
Acharya, Chaitanya R. ;
McCarthy, Janice M. ;
Owzar, Kouros ;
Allen, Andrew S. .
BMC BIOINFORMATICS, 2016, 17
[3]   Brains, genes and power [J].
Albert, Frank W. .
NATURE NEUROSCIENCE, 2016, 19 (11) :1428-1430
[4]   The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans [J].
Ardlie, Kristin G. ;
DeLuca, David S. ;
Segre, Ayellet V. ;
Sullivan, Timothy J. ;
Young, Taylor R. ;
Gelfand, Ellen T. ;
Trowbridge, Casandra A. ;
Maller, Julian B. ;
Tukiainen, Taru ;
Lek, Monkol ;
Ward, Lucas D. ;
Kheradpour, Pouya ;
Iriarte, Benjamin ;
Meng, Yan ;
Palmer, Cameron D. ;
Esko, Tonu ;
Winckler, Wendy ;
Hirschhorn, Joel N. ;
Kellis, Manolis ;
MacArthur, Daniel G. ;
Getz, Gad ;
Shabalin, Andrey A. ;
Li, Gen ;
Zhou, Yi-Hui ;
Nobel, Andrew B. ;
Rusyn, Ivan ;
Wright, Fred A. ;
Lappalainen, Tuuli ;
Ferreira, Pedro G. ;
Ongen, Halit ;
Rivas, Manuel A. ;
Battle, Alexis ;
Mostafavi, Sara ;
Monlong, Jean ;
Sammeth, Michael ;
Mele, Marta ;
Reverter, Ferran ;
Goldmann, Jakob M. ;
Koller, Daphne ;
Guigo, Roderic ;
McCarthy, Mark I. ;
Dermitzakis, Emmanouil T. ;
Gamazon, Eric R. ;
Im, Hae Kyung ;
Konkashbaev, Anuar ;
Nicolae, Dan L. ;
Cox, Nancy J. ;
Flutre, Timothee ;
Wen, Xiaoquan ;
Stephens, Matthew .
SCIENCE, 2015, 348 (6235) :648-660
[5]  
Dabney A, 2010, qvalue: Q-value estimation for false discovery rate control, V1
[6]   Incorporating prior information into association studies [J].
Darnell, Gregory ;
Duong, Dat ;
Han, Buhm ;
Eskin, Eleazar .
BIOINFORMATICS, 2012, 28 (12) :I147-I153
[7]   Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[8]   Using genomic annotations increases statistical power to detect eGenes [J].
Duong, Dat ;
Zou, Jennifer ;
Hormozdiari, Farhad ;
Sul, Jae Hoon ;
Ernst, Jason ;
Han, Buhm ;
Eskin, Eleazar .
BIOINFORMATICS, 2016, 32 (12) :156-163
[9]   Discovering Genes Involved in Disease and the Mystery of Missing Heritability [J].
Eskin, Eleazar .
COMMUNICATIONS OF THE ACM, 2015, 58 (10) :80-87
[10]   A Statistical Framework for Joint eQTL Analysis in Multiple Tissues [J].
Flutre, Timothee ;
Wen, Xiaoquan ;
Pritchard, Jonathan ;
Stephens, Matthew .
PLOS GENETICS, 2013, 9 (05)