Controlling for human population stratification in rare variant association studies

被引:9
作者
Bouaziz, Matthieu [1 ,2 ]
Mullaert, Jimmy [1 ,2 ,3 ,4 ]
Bigio, Benedetta [5 ]
Seeleuthner, Yoann [1 ,2 ]
Casanova, Jean-Laurent [1 ,2 ,5 ,6 ]
Alcai, Alexandre [1 ,2 ]
Abel, Laurent [1 ,2 ,5 ]
Cobat, Aurelie [1 ,2 ]
机构
[1] INSERM, Lab Human Genet Infect Dis, Necker Branch, U1163, Paris, France
[2] Univ Paris, Imagine Inst, F-75015 Paris, France
[3] Univ Paris, INSERM, IAME, F-75018 Paris, France
[4] Hop Bichat Claude Bernard, AP HP, DEBRC, F-75018 Paris, France
[5] Rockefeller Univ, Rockefeller Branch, St Giles Lab Human Genet Infect Dis, New York, NY 10021 USA
[6] Howard Hughes Med Inst, New York, NY USA
关键词
LINEAR MIXED MODELS; PRINCIPAL-COMPONENTS; COMMON DISEASES; GENOME;
D O I
10.1038/s41598-021-98370-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Population stratification is a confounder of genetic association studies. In analyses of rare variants, corrections based on principal components (PCs) and linear mixed models (LMMs) yield conflicting conclusions. Studies evaluating these approaches generally focused on limited types of structure and large sample sizes. We investigated the properties of several correction methods through a large simulation study using real exome data, and several within- and between-continent stratification scenarios. We considered different sample sizes, with situations including as few as 50 cases, to account for the analysis of rare disorders. Large samples showed that accounting for stratification was more difficult with a continental than with a worldwide structure. When considering a sample of 50 cases, an inflation of type-I-errors was observed with PCs for small numbers of controls (<= 100), and with LMMs for large numbers of controls (>= 1000). We also tested a novel local permutation method (LocPerm), which maintained a correct type-I-error in all situations. Powers were equivalent for all approaches pointing out that the key issue is to properly control type-I-errors. Finally, we found that power of analyses including small numbers of cases can be increased, by adding a large panel of external controls, provided an appropriate stratification correction was used.
引用
收藏
页数:14
相关论文
共 47 条
[1]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[2]   Data quality control in genetic case-control association studies [J].
Anderson, Carl A. ;
Pettersson, Fredrik H. ;
Clarke, Geraldine M. ;
Cardon, Lon R. ;
Morris, Andrew P. ;
Zondervan, Krina T. .
NATURE PROTOCOLS, 2010, 5 (09) :1564-1573
[3]   Rare variant association studies: considerations, challenges and opportunities [J].
Auer, Paul L. ;
Lettre, Guillaume .
GENOME MEDICINE, 2015, 7
[4]   Rare and Low Frequency Variant Stratification in the UK Population: Description and Impact on Association Tests [J].
Babron, Marie-Claude ;
de Tayrac, Marie ;
Rutledge, Douglas N. ;
Zeggini, Eleftheria ;
Genin, Emmanuelle .
PLOS ONE, 2012, 7 (10)
[5]   Statistical analysis strategies for association studies involving rare variants [J].
Bansal, Vikas ;
Libiger, Ondrej ;
Torkamani, Ali ;
Schork, Nicholas J. .
NATURE REVIEWS GENETICS, 2010, 11 (11) :773-785
[6]   Population structure analysis using rare and common functional variants [J].
Tesfaye M Baye ;
Hua He ;
Lili Ding ;
Brad G Kurowski ;
Xue Zhang ;
Lisa J Martin .
BMC Proceedings, 5 (Suppl 9)
[7]   Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants [J].
Belkadi, Aziz ;
Bolze, Alexandre ;
Itan, Yuval ;
Cobat, Aurelie ;
Vincent, Quentin B. ;
Antipenko, Alexander ;
Shang, Lei ;
Boisson, Bertrand ;
Casanova, Jean-Laurent ;
Abel, Laurent .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (17) :5473-5478
[8]   The conditional permutation test for independence while controlling for confounders [J].
Berrett, Thomas B. ;
Wang, Yi ;
Barber, Rina Foygel ;
Samworth, Richard J. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020, 82 (01) :175-197
[9]   Tuberculosis and impaired IL-23-dependent IFN-γ immunity in humans homozygous for a common TYK2 missense variant [J].
Boisson-Dupuis, Stephanie ;
Ramirez-Alejo, Noe ;
Li, Zhi ;
Patin, Etienne ;
Rao, Geetha ;
Kerner, Gaspard ;
Lim, Che Kang ;
Krementsov, Dimitry N. ;
Hernandez, Nicholas ;
Ma, Cindy S. ;
Zhang, Qian ;
Markle, Janet ;
Martinez-Barricarte, Ruben ;
Payne, Kathryn ;
Fisch, Robert ;
Deswarte, Caroline ;
Halpern, Joshua ;
Bouaziz, Matthieu ;
Mulwa, Jeanette ;
Sivanesan, Durga ;
Lazarov, Tomi ;
Naves, Rodrigo ;
Garcia, Patricia ;
Itan, Yuval ;
Boisson, Bertrand ;
Checchi, Alix ;
Jabot-Hanin, Fabienne ;
Cobat, Aurelie ;
Guennoun, Andrea ;
Jackson, Carolyn C. ;
Pekcan, Sevgi ;
Caliskaner, Zafer ;
Inostroza, Jaime ;
Costa-Carvalho, Beatriz Tavares ;
Tavares de Albuquerque, Jose Antonio ;
Garcia-Ortiz, Humberto ;
Orozco, Lorena ;
Ozcelik, Tayfun ;
Abid, Ahmed ;
Rhorfi, Ismail Abderahmani ;
Souhi, Hicham ;
Amrani, Hicham Naji ;
Zegmout, Adil ;
Geissmann, Frederic ;
Michnick, Stephen W. ;
Muller-Fleckenstein, Ingrid ;
Fleckenstein, Bernhard ;
Puel, Anne ;
Ciancanelli, Michael J. ;
Marr, Nico .
SCIENCE IMMUNOLOGY, 2018, 3 (30)
[10]   Panning for gold: "model-X' knockoffs for high dimensional controlled variable selection [J].
Candes, Emmanuel ;
Fan, Yingying ;
Janson, Lucas ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (03) :551-577