Ancestral Spectrum Analysis With Population-Specific Variants

被引:3
作者
Shi, Gang [1 ]
Kuang, Qingmin [1 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian, Peoples R China
关键词
admixture; population-specific SNP; rare variants; best linear unbiased estimator; ancestry inference; GENOME-WIDE PATTERNS; STRATIFICATION; INFERENCE; ADMIXTURE;
D O I
10.3389/fgene.2021.724638
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the advance of sequencing technology, an increasing number of populations have been sequenced to study the histories of worldwide populations, including their divergence, admixtures, migration, and effective sizes. The variants detected in sequencing studies are largely rare and mostly population specific. Population-specific variants are often recent mutations and are informative for revealing substructures and admixtures in populations; however, computational methods and tools to analyze them are still lacking. In this work, we propose using reference populations and single nucleotide polymorphisms (SNPs) specific to the reference populations. Ancestral information, the best linear unbiased estimator (BLUE) of the ancestral proportion, is proposed, which can be used to infer ancestral proportions in recently admixed target populations and measure the extent to which reference populations serve as good proxies for the admixing sources. Based on the same panel of SNPs, the ancestral information is comparable across samples from different studies and is not affected by genetic outliers, related samples, or the sample sizes of the admixed target populations. In addition, ancestral spectrum is useful for detecting genetic outliers or exploring co-ancestry between study samples and the reference populations. The methods are implemented in a program, Ancestral Spectrum Analyzer (ASA), and are applied in analyzing high-coverage sequencing data from the 1000 Genomes Project and the Human Genome Diversity Project (HGDP). In the analyses of American populations from the 1000 Genomes Project, we demonstrate that recent admixtures can be dissected from ancient admixtures by comparing ancestral spectra with and without indigenous Americans being included in the reference populations.</p>
引用
收藏
页数:13
相关论文
共 31 条
[11]   Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia [J].
Galinsky, Kevin J. ;
Bhatia, Gaurav ;
Loh, Po-Ru ;
Georgiev, Stoyan ;
Mukherjee, Sayan ;
Patterson, Nick J. ;
Price, Alkes L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 98 (03) :456-472
[12]   A Genetic Atlas of Human Admixture History [J].
Hellenthal, Garrett ;
Busby, George B. J. ;
Band, Gavin ;
Wilson, James F. ;
Capelli, Cristian ;
Falush, Daniel ;
Myers, Simon .
SCIENCE, 2014, 343 (6172) :747-751
[13]   Inferring weak population structure with the assistance of sample group information [J].
Hubisz, Melissa J. ;
Falush, Daniel ;
Stephens, Matthew ;
Pritchard, Jonathan K. .
MOLECULAR ECOLOGY RESOURCES, 2009, 9 (05) :1322-1332
[14]  
Johnson R.A., 2007, Applied Multivariate Statistical Analysis, V6th
[15]   A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots [J].
Lawson, Daniel J. ;
Van Dorp, Lucy ;
Falush, Daniel .
NATURE COMMUNICATIONS, 2018, 9
[16]   Worldwide human relationships inferred from genome-wide patterns of variation [J].
Li, Jun Z. ;
Absher, Devin M. ;
Tang, Hua ;
Southwick, Audrey M. ;
Casto, Amanda M. ;
Ramachandran, Sohini ;
Cann, Howard M. ;
Barsh, Gregory S. ;
Feldman, Marcus ;
Cavalli-Sforza, Luigi L. ;
Myers, Richard M. .
SCIENCE, 2008, 319 (5866) :1100-1104
[17]   Theoretical Formulation of Principal Components Analysis to Detect and Correct for Population Stratification [J].
Ma, Jianzhong ;
Amos, Christopher I. .
PLOS ONE, 2010, 5 (09) :1-14
[18]   The Simons Genome Diversity Project: 300 genomes from 142 diverse populations [J].
Mallick, Swapan ;
Li, Heng ;
Lipson, Mark ;
Mathieson, Iain ;
Gymrek, Melissa ;
Racimo, Fernando ;
Zhao, Mengyao ;
Chennagiri, Niru ;
Nordenfelt, Susanne ;
Tandon, Arti ;
Skoglund, Pontus ;
Lazaridis, Iosif ;
Sankararaman, Sriram ;
Fu, Qiaomei ;
Rohland, Nadin ;
Renaud, Gabriel ;
Erlich, Yaniv ;
Willems, Thomas ;
Gallo, Carla ;
Spence, Jeffrey P. ;
Song, Yun S. ;
Poletti, Giovanni ;
Balloux, Francois ;
van Driem, George ;
de Knijff, Peter ;
Romero, Irene Gallego ;
Jha, Aashish R. ;
Behar, Doron M. ;
Bravi, Claudio M. ;
Capelli, Cristian ;
Hervig, Tor ;
Moreno-Estrada, Andres ;
Posukh, Olga L. ;
Balanovska, Elena ;
Balanovsky, Oleg ;
Karachanak-Yankova, Sena ;
Sahakyan, Hovhannes ;
Toncheva, Draga ;
Yepiskoposyan, Levon ;
Tyler-Smith, Chris ;
Xue, Yali ;
Abdullah, M. Syafiq ;
Ruiz-Linares, Andres ;
Beall, Cynthia M. ;
Di Rienzo, Anna ;
Jeong, Choongwon ;
Starikovskaya, Elena B. ;
Metspalu, Ene ;
Parik, Juri ;
Villems, Richard .
NATURE, 2016, 538 (7624) :201-+
[19]   Genetic Consequences of the Transatlantic Slave Trade in the Americas [J].
Micheletti, Steven J. ;
Bryc, Kasia ;
Esselmann, Samantha G. Ancona ;
Freyman, William A. ;
Moreno, Meghan E. ;
Poznik, G. David ;
Shastri, Anjali J. ;
Beleza, Sandra ;
Mountain, Joanna L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2020, 107 (02) :265-277
[20]   Tracing the peopling of the world through genomics [J].
Nielsen, Rasmus ;
Akey, Joshua M. ;
Jakobsson, Mattias ;
Pritchard, Jonathan K. ;
Tishkoff, Sarah ;
Willerslev, Eske .
NATURE, 2017, 541 (7637) :302-310