Integrating predicted transcriptome from multiple tissues improves association detection

被引:184
作者
Barbeira, Alvaro N. [1 ]
Pividori, Milton D. [1 ]
Zheng, Jiamao [1 ]
Wheeler, Heather E. [2 ,3 ]
Nicolae, Dan L. [1 ,4 ,5 ]
Im, Hae Kyung [1 ,5 ]
机构
[1] Univ Chicago, Sect Genet Med, Chicago, IL 60637 USA
[2] Loyola Univ, Dept Biol, Chicago, IL 60626 USA
[3] Loyola Univ, Dept Comp Sci, Chicago, IL 60611 USA
[4] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
[5] Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
来源
PLOS GENETICS | 2019年 / 15卷 / 01期
基金
美国国家卫生研究院; 英国惠康基金;
关键词
ENDOTHELIAL GROWTH-FACTOR; EXPRESSION; VARIANTS; SUSCEPTIBILITY; ADAM15; GENES; RISK; LOCI;
D O I
10.1371/journal.pgen.1007889
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Integration of genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) studies is needed to improve our understanding of the biological mechanisms underlying GWAS hits, and our ability to identify therapeutic targets. Gene-level association methods such as PrediXcan can prioritize candidate targets. However, limited eQTL sample sizes and absence of relevant developmental and disease context restrict our ability to detect associations. Here we propose an efficient statistical method (MultiXcan) that leverages the substantial sharing of eQTLs across tissues and contexts to improve our ability to identify potential target genes. MultiXcan integrates evidence across multiple panels using multivariate regression, which naturally takes into account the correlation structure. We apply our method to simulated and real traits from the UK Biobank and show that, in realistic settings, we can detect a larger set of significantly associated genes than using each panel separately. To improve applicability, we developed a summary result-based extension called S-MultiXcan, which we show yields highly concordant results with the individual level version when LD is well matched. Our multivariate model-based approach allowed us to use the individual level results as a gold standard to calibrate and develop a robust implementation of the summary-based extension. Results from our analysis as well as software and necessary resources to apply our method are publicly available. Author summary We develop a new method, MultiXcan, to test the mediating role of gene expression variation on complex traits, integrating information available across multiple tissue studies. We show this approach has higher power than traditional single-tissue methods. We extend this method to use only summary-statistics from public GWAS. We apply these methods to 222 complex traits available in the UK Biobank cohort, and 109 complex traits from public GWAS and discuss the findings.
引用
收藏
页数:20
相关论文
共 38 条
[1]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[2]  
[Anonymous], LOCAL GENETIC EFFECT
[3]   Altered expression of ADAMS (A Disintegrin And Metalloproteinase) in fibrillating human atria [J].
Arndt, M ;
Lendeckel, U ;
Röcken, C ;
Nepple, K ;
Wolke, C ;
Spiess, A ;
Huth, C ;
Ansorge, S ;
Klein, HU ;
Goette, A .
CIRCULATION, 2002, 105 (06) :720-725
[4]   Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics [J].
Barbeira, Alvaro N. ;
Dickinson, Scott P. ;
Bonazzola, Rodrigo ;
Zheng, Jiamao ;
Wheeler, Heather E. ;
Torres, Jason M. ;
Torstenson, Eric S. ;
Shah, Kaanan P. ;
Garcia, Tzintzuni ;
Edwards, Todd L. ;
Stahl, Eli A. ;
Huckins, Laura M. ;
Nicolae, Dan L. ;
Cox, Nancy J. ;
Im, Hae Kyung .
NATURE COMMUNICATIONS, 2018, 9
[5]   The UK Biobank resource with deep phenotyping and genomic data [J].
Bycroft, Clare ;
Freeman, Colin ;
Petkova, Desislava ;
Band, Gavin ;
Elliott, Lloyd T. ;
Sharp, Kevin ;
Motyer, Allan ;
Vukcevic, Damjan ;
Delaneau, Olivier ;
O'Connell, Jared ;
Cortes, Adrian ;
Welsh, Samantha ;
Young, Alan ;
Effingham, Mark ;
McVean, Gil ;
Leslie, Stephen ;
Allen, Naomi ;
Donnelly, Peter ;
Marchini, Jonathan .
NATURE, 2018, 562 (7726) :203-+
[6]   Testing The Role of Circadian Genes in Conferring Risk for Psychiatric Disorders [J].
Byrne, Enda M. ;
Heath, Andrew C. ;
Madden, Pamela A. F. ;
Pergadia, Michele L. ;
Hickie, Ian B. ;
Montgomery, Grant W. ;
Martin, Nicholas G. ;
Wray, Naomi R. .
AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS, 2014, 165 (03) :254-260
[7]   Large-scale association analysis identifies new risk loci for coronary artery disease [J].
Deloukas, Panos ;
Kanoni, Stavroula ;
Willenborg, Christina ;
Farrall, Martin ;
Assimes, Themistocles L. ;
Thompson, John R. ;
Ingelsson, Erik ;
Saleheen, Danish ;
Erdmann, Jeanette ;
Goldstein, Benjamin A. ;
Stirrups, Kathleen ;
Koenig, Inke R. ;
Cazier, Jean-Baptiste ;
Johansson, Asa ;
Hall, Alistair S. ;
Lee, Jong-Young ;
Willer, Cristen J. ;
Chambers, John C. ;
Esko, Tonu ;
Folkersen, Lasse ;
Goel, Anuj ;
Grundberg, Elin ;
Havulinna, Aki S. ;
Ho, Weang K. ;
Hopewell, Jemma C. ;
Eriksson, Niclas ;
Kleber, Marcus E. ;
Kristiansson, Kati ;
Lundmark, Per ;
Lyytikainen, Leo-Pekka ;
Rafelt, Suzanne ;
Shungin, Dmitry ;
Strawbridge, Rona J. ;
Thorleifsson, Gudmar ;
Tikkanen, Emmi ;
Van Zuydam, Natalie ;
Voight, Benjamin F. ;
Waite, Lindsay L. ;
Zhang, Weihua ;
Ziegler, Andreas ;
Absher, Devin ;
Altshuler, David ;
Balmforth, Anthony J. ;
Barroso, Ines ;
Braund, Peter S. ;
Burgdorf, Christof ;
Claudi-Boehm, Simone ;
Cox, David ;
Dimitriou, Maria ;
Do, Ron .
NATURE GENETICS, 2013, 45 (01) :25-U52
[8]   Second-generation PLINK: rising to the challenge of larger and richer datasets [J].
Chang, Christopher C. ;
Chow, Carson C. ;
Tellier, Laurent C. A. M. ;
Vattikuti, Shashaank ;
Purcell, Shaun M. ;
Lee, James J. .
GIGASCIENCE, 2015, 4
[9]   Integration of Genomic and Genetic Approaches Implicates IREB2 as a COPD Susceptibility Gene [J].
DeMeo, Dawn L. ;
Mariani, Thomas ;
Bhattacharya, Soumyaroop ;
Srisuma, Sorachai ;
Lange, Christoph ;
Litonjua, Augusto ;
Bueno, Raphael ;
Pillai, Sreekumar G. ;
Lomas, David A. ;
Sparrow, David ;
Shapiro, Steven D. ;
Criner, Gerard J. ;
Kim, Hong P. ;
Chen, Zhihua ;
Choi, Augustine M. K. ;
Reilly, John ;
Silverman, Edwin K. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 85 (04) :493-502
[10]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22