Meta-imputation: An efficient method to combine genotype data after imputation with multiple reference panels

被引:21
作者
Yu, Ketian [1 ]
Das, Sayantan [1 ,2 ]
LeFaive, Jonathon [1 ]
Kwong, Alan [1 ]
Pleiness, Jacob [1 ]
Forer, Lukas [3 ]
Schonherr, Sebastian [3 ]
Fuchsberger, Christian [1 ,3 ,4 ]
Smith, Albert Vernon [1 ]
Abecasis, Goncalo Rocha [1 ,5 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48105 USA
[2] 23andMe, Sunnyvale, CA 94086 USA
[3] Med Univ Innsbruck, Inst Genet Epidemiol, Dept Genet & Pharmacol, A-6020 Innsbruck, Austria
[4] Eurac Res, Inst Biomed, I-39100 Bolzano, Italy
[5] Regeneron Pharmaceut Inc, 777 Old Saw Mill River Rd, Tarrytown, NY 10591 USA
关键词
GENOME-WIDE ASSOCIATION; GENE-EXPRESSION; RARE; VARIANTS;
D O I
10.1016/j.ajhg.2022.04.002
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genotype imputation is an integral tool in genome-wide association studies, in which it facilitates meta-analysis, increases power, and enables fine-mapping. With the increasing availability of whole-genome-sequence datasets, investigators have access to a multitude of reference-panel choices for genotype imputation. In principle, combining all sequenced whole genomes into a single large panel would provide the best imputation performance, but this is often cumbersome or impossible due to privacy restrictions. Here, we describe meta imputation, a method that allows imputation results generated using different reference panels to be combined into a consensus imputed dataset. Our meta-imputation method requires small changes to the output of existing imputation tools to produce necessary inputs, which are then combined using dynamically estimated weights that are tailored to each individual and genome segment. In the scenarios we examined, the method consistently outperforms imputation using a single reference panel and achieves accuracy comparable to imputation using a combined reference panel.
引用
收藏
页码:1007 / +
页数:10
相关论文
共 32 条
[1]   Fast model-based estimation of ancestry in unrelated individuals [J].
Alexander, David H. ;
Novembre, John ;
Lange, Kenneth .
GENOME RESEARCH, 2009, 19 (09) :1655-1664
[2]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[3]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[4]   FINEMAP: efficient variable selection using summary data from genome-wide association studies [J].
Benner, Christian ;
Spencer, Chris C. A. ;
Havulinna, Aki S. ;
Salomaa, Veikko ;
Ripatti, Samuli ;
Pirinen, Matti .
BIOINFORMATICS, 2016, 32 (10) :1493-1501
[5]   Fast two-stage phasing of large-scale sequence data [J].
Browning, Brian L. ;
Tian, Xiaowen ;
Zhou, Ying ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2021, 108 (10) :1880-1890
[6]  
Cann HM, 2002, SCIENCE, V296, P261
[7]   Genotype Imputation from Large Reference Panels [J].
Das, Sayantan ;
Abecasis, Goncalo R. ;
Browning, Brian L. .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 19, 2018, 19 :73-96
[8]   Next-generation genotype imputation service and methods [J].
Das, Sayantan ;
Forer, Lukas ;
Schoenherr, Sebastian ;
Sidore, Carlo ;
Locke, Adam E. ;
Kwong, Alan ;
Vrieze, Scott I. ;
Chew, Emily Y. ;
Levy, Shawn ;
McGue, Matt ;
Schlessinger, David ;
Stambolian, Dwight ;
Loh, Po-Ru ;
Iacono, William G. ;
Swaroop, Anand ;
Scott, Laura J. ;
Cucca, Francesco ;
Kronenberg, Florian ;
Boehnke, Michael ;
Abecasis, Goncalo R. ;
Fuchsberger, Christian .
NATURE GENETICS, 2016, 48 (10) :1284-1287
[9]   Improved imputation quality of low-frequency and rare variants in European samples using the 'Genome of The Netherlands' [J].
Deelen, Patrick ;
Menelaou, Androniki ;
van Leeuwen, Elisabeth M. ;
Kanterakis, Alexandros ;
van Dijk, Freerk ;
Medina-Gomez, Carolina ;
Francioli, Laurent C. ;
Hottenga, Jouke Jan ;
Karssen, Lennart C. ;
Estrada, Karol ;
Kreiner-Moller, Eskil ;
Rivadeneira, Fernando ;
van Setten, Jessica ;
Gutierrez-Achury, Javier ;
Westra, Harm-Jan ;
Franke, Lude ;
van Enckevort, David ;
Dijkstra, Martijn ;
Byelas, Heorhiy ;
van Duijn, Cornelia M. ;
de Bakker, Paul I. W. ;
Wijmenga, Cisca ;
Swertz, Morris A. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2014, 22 (11) :1321-1326
[10]   Accurate, scalable and integrative haplotype estimation [J].
Delaneau, Olivier ;
Zagury, Jean-Francois ;
Robinson, Matthew R. ;
Marchini, Jonathan L. ;
Dermitzakis, Emmanouil T. .
NATURE COMMUNICATIONS, 2019, 10 (1)