Ancestral Spectrum Analysis With Population-Specific Variants

被引:3
作者
Shi, Gang [1 ]
Kuang, Qingmin [1 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian, Peoples R China
关键词
admixture; population-specific SNP; rare variants; best linear unbiased estimator; ancestry inference; GENOME-WIDE PATTERNS; STRATIFICATION; INFERENCE; ADMIXTURE;
D O I
10.3389/fgene.2021.724638
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the advance of sequencing technology, an increasing number of populations have been sequenced to study the histories of worldwide populations, including their divergence, admixtures, migration, and effective sizes. The variants detected in sequencing studies are largely rare and mostly population specific. Population-specific variants are often recent mutations and are informative for revealing substructures and admixtures in populations; however, computational methods and tools to analyze them are still lacking. In this work, we propose using reference populations and single nucleotide polymorphisms (SNPs) specific to the reference populations. Ancestral information, the best linear unbiased estimator (BLUE) of the ancestral proportion, is proposed, which can be used to infer ancestral proportions in recently admixed target populations and measure the extent to which reference populations serve as good proxies for the admixing sources. Based on the same panel of SNPs, the ancestral information is comparable across samples from different studies and is not affected by genetic outliers, related samples, or the sample sizes of the admixed target populations. In addition, ancestral spectrum is useful for detecting genetic outliers or exploring co-ancestry between study samples and the reference populations. The methods are implemented in a program, Ancestral Spectrum Analyzer (ASA), and are applied in analyzing high-coverage sequencing data from the 1000 Genomes Project and the Human Genome Diversity Project (HGDP). In the analyses of American populations from the 1000 Genomes Project, we demonstrate that recent admixtures can be dissected from ancient admixtures by comparing ancestral spectra with and without indigenous Americans being included in the reference populations.</p>
引用
收藏
页数:13
相关论文
共 31 条
[1]   Enhancements to the ADMIXTURE algorithm for individual ancestry estimation [J].
Alexander, David H. ;
Lange, Kenneth .
BMC BIOINFORMATICS, 2011, 12
[2]   Fast model-based estimation of ancestry in unrelated individuals [J].
Alexander, David H. ;
Novembre, John ;
Lange, Kenneth .
GENOME RESEARCH, 2009, 19 (09) :1655-1664
[3]  
[Anonymous], 2018, J OPEN SOURCE SOFTW, DOI DOI 10.21105/JOSS.00861
[4]   Insights into human genetic variation and population history from 929 diverse genomes [J].
Bergstrom, Anders ;
McCarthy, Shane A. ;
Hui, Ruoyun ;
Almarri, Mohamed A. ;
Ayub, Qasim ;
Danecek, Petr ;
Chen, Yuan ;
Felkel, Sabine ;
Hallast, Pille ;
Kamm, Jack ;
Blanche, Helene ;
Deleuze, Jean-Francois ;
Cann, Howard ;
Mallick, Swapan ;
Reich, David ;
Sandhu, Manjinder S. ;
Skoglund, Pontus ;
Scally, Aylwyn ;
Xue, Yali ;
Durbin, Richard ;
Tyler-Smith, Chris .
SCIENCE, 2020, 367 (6484) :1339-+
[5]  
Brooks LD, 2015, Nature, V526, P68, DOI DOI 10.1038/NATURE15393
[6]   Genome-wide patterns of population structure and admixture in West Africans and African Americans [J].
Bryc, Katarzyna ;
Auton, Adam ;
Nelson, Matthew R. ;
Oksenberg, Jorge R. ;
Hauser, Stephen L. ;
Williams, Scott ;
Froment, Alain ;
Bodo, Jean-Marie ;
Wambebe, Charles ;
Tishkoff, Sarah A. ;
Bustamante, Carlos D. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (02) :786-791
[7]  
Byrska-Bishop M., 2021, BIORXIV, DOI DOI 10.1101/2021.02.06.430068
[8]   Second-generation PLINK: rising to the challenge of larger and richer datasets [J].
Chang, Christopher C. ;
Chow, Carson C. ;
Tellier, Laurent C. A. M. ;
Vattikuti, Shashaank ;
Purcell, Shaun M. ;
Lee, James J. .
GIGASCIENCE, 2015, 4
[9]   UMAP reveals cryptic population structure and phenotype heterogeneity in large genomic cohorts [J].
Diaz-Papkovich, Alex ;
Anderson-Trocme, Luke ;
Ben-Eghan, Chief ;
Gravel, Simon .
PLOS GENETICS, 2019, 15 (11)
[10]  
Falush D, 2003, GENETICS, V164, P1567