A Hidden Markov Model Approach for Simultaneously Estimating Local Ancestry and Admixture Time Using Next Generation Sequence Data in Samples of Arbitrary Ploidy

被引:83
作者
Corbett-Detig, Russell [1 ,2 ,3 ]
Nielsen, Rasmus [3 ,4 ]
机构
[1] UC Santa Cruz, Genom Inst, Santa Cruz, CA 95064 USA
[2] UC Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
[3] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[4] Univ Copenhagen, Nat Hist Museum Denmark, Copenhagen, Denmark
来源
PLOS GENETICS | 2017年 / 13卷 / 01期
关键词
INCIPIENT SEXUAL ISOLATION; GENOME-WIDE PATTERNS; DROSOPHILA-MELANOGASTER; ALCOHOL-DEHYDROGENASE; POPULATIONS; INFERENCE; SPECIATION; EVOLUTION; AFRICAN; HISTORY;
D O I
10.1371/journal.pgen.1006529
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Admixture-the mixing of genomes from divergent populations-is increasingly appreciated as a central process in evolution. To characterize and quantify patterns of admixture across the genome, a number of methods have been developed for local ancestry inference. However, existing approaches have a number of shortcomings. First, all local ancestry inference methods require some prior assumption about the expected ancestry tract lengths. Second, existing methods generally require genotypes, which is not feasible to obtain for many next-generation sequencing projects. Third, many methods assume samples are diploid, however a wide variety of sequencing applications will fail to meet this assumption. To address these issues, we introduce a novel hidden Markov model for estimating local ancestry that models the read pileup data, rather than genotypes, is generalized to arbitrary ploidy, and can estimate the time since admixture during local ancestry inference. We demonstrate that our method can simultaneously estimate the time since admixture and local ancestry with good accuracy, and that it performs well on samples of high ploidy-i.e. 100 or more chromosomes. As this method is very general, we expect it will be useful for local ancestry inference in a wider variety of populations than what previously has been possible. We then applied our method to pooled sequencing data derived from populations of Drosophila melanogaster on an ancestry cline on the east coast of North America. We find that regions of local recombination rates are negatively correlated with the proportion of African ancestry, suggesting that selection against foreign ancestry is the least efficient in low recombination regions. Finally we show that clinal outlier loci are enriched for genes associated with gene regulatory functions, consistent with a role of regulatory evolution in ecological adaptation of admixed D. melanogaster populations. Our results illustrate the potential of local ancestry inference for elucidating fundamental evolutionary processes.
引用
收藏
页数:40
相关论文
共 97 条
[61]  
Li H., 2013, Aligning sequence reads, clone sequences and assembly contigs with BWAMEM, DOI [DOI 10.48550/ARXIV.1303.3997, 10.48550/arXiv.1303.3997]
[62]   Inferring the demographic history and rate of adaptive substitution in Drosophila [J].
Li, Haipeng ;
Stephan, Wolfgang .
PLOS GENETICS, 2006, 2 (10) :1580-1589
[63]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760
[64]   The Lengths of Admixture Tracts [J].
Liang, Mason ;
Nielsen, Rasmus .
GENETICS, 2014, 197 (03) :953-967
[65]   Inferring Admixture Histories of Human Populations Using Linkage Disequilibrium [J].
Loh, Po-Ru ;
Lipson, Mark ;
Patterson, Nick ;
Moorjani, Priya ;
Pickrell, Joseph K. ;
Reich, David ;
Berger, Bonnie .
GENETICS, 2013, 193 (04) :1233-+
[66]   Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease [J].
Lohmueller, KE ;
Pearce, CL ;
Pike, M ;
Lander, ES ;
Hirschhorn, JN .
NATURE GENETICS, 2003, 33 (02) :177-182
[67]   The Drosophila melanogaster Genetic Reference Panel [J].
Mackay, Trudy F. C. ;
Richards, Stephen ;
Stone, Eric A. ;
Barbadilla, Antonio ;
Ayroles, Julien F. ;
Zhu, Dianhui ;
Casillas, Sonia ;
Han, Yi ;
Magwire, Michael M. ;
Cridland, Julie M. ;
Richardson, Mark F. ;
Anholt, Robert R. H. ;
Barron, Maite ;
Bess, Crystal ;
Blankenburg, Kerstin Petra ;
Carbone, Mary Anna ;
Castellano, David ;
Chaboub, Lesley ;
Duncan, Laura ;
Harris, Zeke ;
Javaid, Mehwish ;
Jayaseelan, Joy Christina ;
Jhangiani, Shalini N. ;
Jordan, Katherine W. ;
Lara, Fremiet ;
Lawrence, Faye ;
Lee, Sandra L. ;
Librado, Pablo ;
Linheiro, Raquel S. ;
Lyman, Richard F. ;
Mackey, Aaron J. ;
Munidasa, Mala ;
Muzny, Donna Marie ;
Nazareth, Lynne ;
Newsham, Irene ;
Perales, Lora ;
Pu, Ling-Ling ;
Qu, Carson ;
Ramia, Miquel ;
Reid, Jeffrey G. ;
Rollmann, Stephanie M. ;
Rozas, Julio ;
Saada, Nehad ;
Turlapati, Lavanya ;
Worley, Kim C. ;
Wu, Yuan-Qing ;
Yamamoto, Akihiko ;
Zhu, Yiming ;
Bergman, Casey M. ;
Thornton, Kevin R. .
NATURE, 2012, 482 (7384) :173-178
[68]   RFMix: A Discriminative Modeling Approach for Rapid and Robust Local-Ancestry Inference [J].
Maples, Brian K. ;
Gravel, Simon ;
Kenny, Eimear E. ;
Bustamante, Carlos D. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 93 (02) :278-288
[69]   Fast "coalescent" simulation [J].
Marjoram, P ;
Wall, JD .
BMC GENETICS, 2006, 7 (1)
[70]   Uncovering the Genetic History of the Present-Day Greenlandic Population [J].
Moltke, Ida ;
Fumagalli, Matteo ;
Korneliussen, Thorfinn S. ;
Crawford, Jacob E. ;
Bjerregaard, Peter ;
Jorgensen, Marit E. ;
Grarup, Niels ;
Gullov, Hans Christian ;
Linneberg, Allan ;
Pedersen, Oluf ;
Hansen, Torben ;
Nielsen, Rasmus ;
Albrechtsen, Anders .
AMERICAN JOURNAL OF HUMAN GENETICS, 2015, 96 (01) :54-69