A multiple-phenotype imputation method for genetic studies

被引:73
作者
Dahl, Andrew [1 ]
Iotchkova, Valentina [2 ,3 ]
Baud, Amelie [3 ]
Johansson, Asa [4 ]
Gyllensten, Ulf [4 ]
Soranzo, Nicole [2 ]
Mott, Richard [1 ]
Kranis, Andreas [5 ,6 ]
Marchini, Jonathan [1 ,7 ]
机构
[1] Univ Oxford, Wellcome Trust Ctr Human Genet, Oxford, England
[2] Wellcome Trust Sanger Inst, Human Genet, Wellcome Trust Genome Campus, Hinxton, England
[3] European Bioinformat Inst EMBL EBI, Wellcome Trust Genome Campus, Hinxton, England
[4] Uppsala Univ, Sci Life Lab Uppsala, Dept Immunol Genet & Pathol, Uppsala, Sweden
[5] Aviagen Ltd, Newbridge, England
[6] Univ Edinburgh, Roslin Inst, Edinburgh EH8 9YL, Midlothian, Scotland
[7] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
基金
欧洲研究理事会; 英国惠康基金;
关键词
GENOME-WIDE ASSOCIATION; TRAIT LINKAGE ANALYSIS; GENOTYPE IMPUTATION; COMPLEX TRAITS; POPULATION; PLEIOTROPY; HERITABILITY; AGGREGATION; ALGORITHMS; ACTIVATION;
D O I
10.1038/ng.3513
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genetic association studies have yielded a wealth of biological discoveries. However, these studies have mostly analyzed one trait and one SNP at a time, thus failing to capture the underlying complexity of the data sets. Joint genotype-phenotype analyses of complex, high-dimensional data sets represent an important way to move beyond simple genome-wide association studies (GWAS) with great potential. The move to high-dimensional phenotypes will raise many new statistical problems. Here we address the central issue of missing phenotypes in studies with any level of relatedness between samples. We propose a multiple-phenotype mixed model and use a computationally efficient variational Bayesian algorithm to fit the model. On a variety of simulated and real data sets from a range of organisms and trait types, we show that our method outperforms existing state-of-the-art methods from the statistics and machine learning literature and can boost signals of association.
引用
收藏
页码:466 / +
页数:9
相关论文
共 60 条
[1]   Dissection of additive genetic variability for quantitative traits in chickens using SNP markers [J].
Abdollahi-Arpanahi, R. ;
Pakdel, A. ;
Nejati-Javaremi, A. ;
Shahrbabak, M. Moradi ;
Morota, G. ;
Valente, B. D. ;
Kranis, A. ;
Rosa, G. J. M. ;
Gianola, D. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2014, 131 (03) :183-193
[2]   Association analysis in a variance components framework [J].
Abecasis, GR ;
Cardon, LR ;
Cookson, WOC ;
Sham, PC ;
Cherny, SS .
GENETIC EPIDEMIOLOGY, 2001, 21 :S341-S346
[3]   TRANSPOSABLE REGULARIZED COVARIANCE MODELS WITH AN APPLICATION TO MISSING DATA IMPUTATION [J].
Allen, Genevera I. ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2010, 4 (02) :764-790
[4]  
Almasy L, 1997, GENET EPIDEMIOL, V14, P953, DOI 10.1002/(SICI)1098-2272(1997)14:6<953::AID-GEPI65>3.0.CO
[5]  
2-K
[6]   Multipoint quantitative-trait linkage analysis in general pedigrees [J].
Almasy, L ;
Blangero, J .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 62 (05) :1198-1211
[7]  
[Anonymous], 1987, Statistical analysis with missing data
[8]   The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans [J].
Ardlie, Kristin G. ;
DeLuca, David S. ;
Segre, Ayellet V. ;
Sullivan, Timothy J. ;
Young, Taylor R. ;
Gelfand, Ellen T. ;
Trowbridge, Casandra A. ;
Maller, Julian B. ;
Tukiainen, Taru ;
Lek, Monkol ;
Ward, Lucas D. ;
Kheradpour, Pouya ;
Iriarte, Benjamin ;
Meng, Yan ;
Palmer, Cameron D. ;
Esko, Tonu ;
Winckler, Wendy ;
Hirschhorn, Joel N. ;
Kellis, Manolis ;
MacArthur, Daniel G. ;
Getz, Gad ;
Shabalin, Andrey A. ;
Li, Gen ;
Zhou, Yi-Hui ;
Nobel, Andrew B. ;
Rusyn, Ivan ;
Wright, Fred A. ;
Lappalainen, Tuuli ;
Ferreira, Pedro G. ;
Ongen, Halit ;
Rivas, Manuel A. ;
Battle, Alexis ;
Mostafavi, Sara ;
Monlong, Jean ;
Sammeth, Michael ;
Mele, Marta ;
Reverter, Ferran ;
Goldmann, Jakob M. ;
Koller, Daphne ;
Guigo, Roderic ;
McCarthy, Mark I. ;
Dermitzakis, Emmanouil T. ;
Gamazon, Eric R. ;
Im, Hae Kyung ;
Konkashbaev, Anuar ;
Nicolae, Dan L. ;
Cox, Nancy J. ;
Flutre, Timothee ;
Wen, Xiaoquan ;
Stephens, Matthew .
SCIENCE, 2015, 348 (6235) :648-660
[9]   Combined sequence-based and genetic mapping analysis of complex traits in outbred rats [J].
Baud, Amelie ;
Hermsen, Roel ;
Guryev, Victor ;
Stridh, Pernilla ;
Graham, Delyth ;
McBride, Martin W. ;
Foroud, Tatiana ;
Calderari, Sophie ;
Diez, Margarita ;
Ockinger, Johan ;
Beyeen, Amennai D. ;
Gillett, Alan ;
Abdelmagid, Nada ;
Guerreiro-Cacais, Andre Ortlieb ;
Jagodic, Maja ;
Tuncel, Jonatan ;
Norin, Ulrika ;
Beattie, Elisabeth ;
Ngan Huynh ;
Miller, William H. ;
Koller, Daniel L. ;
Alam, Imranul ;
Falak, Samreen ;
Osborne-Pellegrin, Mary ;
Martinez-Membrives, Esther ;
Canete, Toni ;
Blazquez, Gloria ;
Vicens-Costa, Elia ;
Mont-Cardona, Carme ;
Diaz-Moran, Sira ;
Tobena, Adolf ;
Hummel, Oliver ;
Zelenika, Diana ;
Saar, Kathrin ;
Patone, Giannino ;
Bauerfeind, Anja ;
Bihoreau, Marie-Therese ;
Heinig, Matthias ;
Lee, Young-Ae ;
Rintisch, Carola ;
Schulz, Herbert ;
Wheeler, David A. ;
Worley, Kim C. ;
Muzny, Donna M. ;
Gibbs, Richard A. ;
Lathrop, Mark ;
Lansu, Nico ;
Toonen, Pim ;
Ruzius, Frans Paul ;
de Bruijn, Ewart .
NATURE GENETICS, 2013, 45 (07) :767-+
[10]   Finding the sources of missing heritability in a yeast cross [J].
Bloom, Joshua S. ;
Ehrenreich, Ian M. ;
Loo, Wesley T. ;
Thuy-Lan Vo Lite ;
Kruglyak, Leonid .
NATURE, 2013, 494 (7436) :234-237