Enhanced Localization of Genetic Samples through Linkage-Disequilibrium Correction

被引:20
作者
Baran, Yael [1 ]
Quintela, Ines [2 ]
Carracedo, Angel [2 ,3 ]
Pasaniuc, Bogdan [4 ,5 ]
Haperin, Eran [1 ,6 ,7 ]
机构
[1] Tel Aviv Univ, Blavatnik Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Univ Santiago de Compostela, Ctr Nacl Genotipado, Inst Carlos III, Ctr Invest Biomed Red Enfermedades Raras,Grp Med, Santiago De Compostela 15782, Spain
[3] Hosp Clin Univ, Fdn Publ Galega Med Xen, Inst Invest Sanitaria Santiago de Compostela, Serv Galego Saude, Santiago De Compostela 15782, Spain
[4] Univ Calif Los Angeles, David Geffen Sch Med, Dept Pathol & Lab Med, Los Angeles, CA 90024 USA
[5] Univ Calif Los Angeles, Jonsson Comprehens Canc Ctr, Los Angeles, CA 90024 USA
[6] Tel Aviv Univ, Dept Mol Microbiol & Biotechnol, IL-69978 Tel Aviv, Israel
[7] Int Comp Sci Inst, Berkeley, CA 94704 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
WHOLE-GENOME ASSOCIATION; ADMIXED POPULATIONS; WIDE ASSOCIATION; ANCESTRY; INDIVIDUALS; INFERENCE; STRATIFICATION; RECOMBINATION; FREQUENCIES; ADMIXTURE;
D O I
10.1016/j.ajhg.2013.04.023
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Characterizing the spatial patterns of genetic diversity in human populations has a wide range of applications, from detecting genetic mutations associated with disease to inferring human history. Current approaches, including the widely used principal-component analysis, are not suited for the analysis of linked markers, and local and long-range linkage disequilibrium (LD) can dramatically reduce the accuracy of spatial localization when unaccounted for. To overcome this, we have introduced an approach that performs spatial localization of individuals on the basis of their genetic data and explicitly models LD among markers by using a multivariate normal distribution. By leveraging external reference panels, we derive closed-form solutions to the optimization procedure to achieve a computationally efficient method that can handle large data sets. We validate the method on empirical data from a large sample of European individuals from the POPRES data set, as well as on a large sample of individuals of Spanish ancestry. First, we show that by modeling LD, we achieve accuracy superior to that of existing methods. Importantly, whereas other methods show decreased performance when dense marker panels are used in the inference, our approach improves in accuracy as more markers become available. Second, we show that accurate localization of genetic data can be achieved with only a part of the genome, and this could potentially enable the spatial localization of admixed samples that have a fraction of their genome originating from a given continent. Finally, we demonstrate that our approach is resistant to distortions resulting from long-range LD regions; such distortions can dramatically bias the results when unaccounted for.
引用
收藏
页码:882 / 894
页数:13
相关论文
共 33 条
[1]   Fast model-based estimation of ancestry in unrelated individuals [J].
Alexander, David H. ;
Novembre, John ;
Lange, Kenneth .
GENOME RESEARCH, 2009, 19 (09) :1655-1664
[2]   Fast and accurate inference of local ancestry in Latino populations [J].
Baran, Yael ;
Pasaniuc, Bogdan ;
Sankararaman, Sriram ;
Torgerson, Dara G. ;
Gignoux, Christopher ;
Eng, Celeste ;
Rodriguez-Cintron, William ;
Chapela, Rocio ;
Ford, Jean G. ;
Avila, Pedro C. ;
Rodriguez-Santana, Jose ;
Burchard, Esteban Gonzalez ;
Halperin, Eran .
BIOINFORMATICS, 2012, 28 (10) :1359-1367
[3]   PCAdmix: Principal Components-Based Assignment of Ancestry along Each Chromosome in Individuals with Admixed Ancestry from Two or More Populations [J].
Brisbin, Abra ;
Bryc, Katarzyna ;
Byrnes, Jake ;
Zakharia, Fouad ;
Omberg, Larsson ;
Degenhardt, Jeremiah ;
Reynolds, Andrew ;
Ostrer, Harry ;
Mezey, Jason G. ;
Bustamante, Carlos D. .
HUMAN BIOLOGY, 2012, 84 (04) :343-364
[4]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[5]   Genome-wide patterns of population structure and admixture among Hispanic/Latino populations [J].
Bryc, Katarzyna ;
Velez, Christopher ;
Karafet, Tatiana ;
Moreno-Estrada, Andres ;
Reynolds, Andy ;
Auton, Adam ;
Hammer, Michael ;
Bustamante, Carlos D. ;
Ostrer, Harry .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 :8954-8961
[6]   Multiway Admixture Deconvolution Using Phased or Unphased Ancestral Panels [J].
Churchhouse, Claire ;
Marchini, Jonathan .
GENETIC EPIDEMIOLOGY, 2013, 37 (01) :1-12
[7]   Analysis of Population Structure: A Unifying Framework and Novel Methods Based on Sparse Factor Analysis [J].
Engelhardt, Barbara E. ;
Stephens, Matthew .
PLOS GENETICS, 2010, 6 (09)
[8]   Demographic history and rare allele sharing among human populations [J].
Gravel, Simon ;
Henn, Brenna M. ;
Gutenkunst, Ryan N. ;
Indap, Amit R. ;
Marth, Gabor T. ;
Clark, Andrew G. ;
Yu, Fuli ;
Gibbs, Richard A. ;
Bustamante, Carlos D. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (29) :11983-11988
[9]   The landscape of recombination in African Americans [J].
Hinch, Anjali G. ;
Tandon, Arti ;
Patterson, Nick ;
Song, Yunli ;
Rohland, Nadin ;
Palmer, Cameron D. ;
Chen, Gary K. ;
Wang, Kai ;
Buxbaum, Sarah G. ;
Akylbekova, Ermeg L. ;
Aldrich, Melinda C. ;
Ambrosone, Christine B. ;
Amos, Christopher ;
Bandera, Elisa V. ;
Berndt, Sonja I. ;
Bernstein, Leslie ;
Blot, William J. ;
Bock, Cathryn H. ;
Boerwinkle, Eric ;
Cai, Qiuyin ;
Caporaso, Neil ;
Casey, Graham ;
Cupples, L. Adrienne ;
Deming, Sandra L. ;
Diver, W. Ryan ;
Divers, Jasmin ;
Fornage, Myriam ;
Gillanders, Elizabeth M. ;
Glessner, Joseph ;
Harris, Curtis C. ;
Hu, Jennifer J. ;
Ingles, Sue A. ;
Isaacs, William ;
John, Esther M. ;
Kao, W. H. Linda ;
Keating, Brendan ;
Kittles, Rick A. ;
Kolonel, Laurence N. ;
Larkin, Emma ;
Le Marchand, Loic ;
McNeill, Lorna H. ;
Millikan, Robert C. ;
Murphy, Adam ;
Musani, Solomon ;
Neslund-Dudas, Christine ;
Nyante, Sarah ;
Papanicolaou, George J. ;
Press, Michael F. ;
Psaty, Bruce M. ;
Reiner, Alex P. .
NATURE, 2011, 476 (7359) :170-U67
[10]   Patterns of Ancestry, Signatures of Natural Selection, and Genetic Association with Stature in Western African Pygmies [J].
Jarvis, Joseph P. ;
Scheinfeldt, Laura B. ;
Soi, Sameer ;
Lambert, Charla ;
Omberg, Larsson ;
Ferwerda, Bart ;
Froment, Alain ;
Bodo, Jean-Marie ;
Beggs, William ;
Hoffman, Gabriel ;
Mezey, Jason ;
Tishkoff, Sarah A. .
PLOS GENETICS, 2012, 8 (04) :299-313