A Hidden Markov Model Approach for Simultaneously Estimating Local Ancestry and Admixture Time Using Next Generation Sequence Data in Samples of Arbitrary Ploidy

被引:83
作者
Corbett-Detig, Russell [1 ,2 ,3 ]
Nielsen, Rasmus [3 ,4 ]
机构
[1] UC Santa Cruz, Genom Inst, Santa Cruz, CA 95064 USA
[2] UC Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
[3] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[4] Univ Copenhagen, Nat Hist Museum Denmark, Copenhagen, Denmark
来源
PLOS GENETICS | 2017年 / 13卷 / 01期
关键词
INCIPIENT SEXUAL ISOLATION; GENOME-WIDE PATTERNS; DROSOPHILA-MELANOGASTER; ALCOHOL-DEHYDROGENASE; POPULATIONS; INFERENCE; SPECIATION; EVOLUTION; AFRICAN; HISTORY;
D O I
10.1371/journal.pgen.1006529
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Admixture-the mixing of genomes from divergent populations-is increasingly appreciated as a central process in evolution. To characterize and quantify patterns of admixture across the genome, a number of methods have been developed for local ancestry inference. However, existing approaches have a number of shortcomings. First, all local ancestry inference methods require some prior assumption about the expected ancestry tract lengths. Second, existing methods generally require genotypes, which is not feasible to obtain for many next-generation sequencing projects. Third, many methods assume samples are diploid, however a wide variety of sequencing applications will fail to meet this assumption. To address these issues, we introduce a novel hidden Markov model for estimating local ancestry that models the read pileup data, rather than genotypes, is generalized to arbitrary ploidy, and can estimate the time since admixture during local ancestry inference. We demonstrate that our method can simultaneously estimate the time since admixture and local ancestry with good accuracy, and that it performs well on samples of high ploidy-i.e. 100 or more chromosomes. As this method is very general, we expect it will be useful for local ancestry inference in a wider variety of populations than what previously has been possible. We then applied our method to pooled sequencing data derived from populations of Drosophila melanogaster on an ancestry cline on the east coast of North America. We find that regions of local recombination rates are negatively correlated with the proportion of African ancestry, suggesting that selection against foreign ancestry is the least efficient in low recombination regions. Finally we show that clinal outlier loci are enriched for genes associated with gene regulatory functions, consistent with a role of regulatory evolution in ecological adaptation of admixed D. melanogaster populations. Our results illustrate the potential of local ancestry inference for elucidating fundamental evolutionary processes.
引用
收藏
页数:40
相关论文
共 97 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]  
[Anonymous], BIOINFORMATICS
[3]  
[Anonymous], MOL BIOL EVOL
[4]  
[Anonymous], NATURE
[5]   Chromosomal inversion polymorphism in Afrotropical populations of Drosophila melanogaster [J].
Aulard, S ;
David, JR ;
Lemeunier, F .
GENETICS RESEARCH, 2002, 79 (01) :49-63
[6]   The distribution of surviving blocks of an ancestral genome [J].
Baird, SJE ;
Barton, NH ;
Etheridge, AM .
THEORETICAL POPULATION BIOLOGY, 2003, 64 (04) :451-471
[7]   Fast and accurate inference of local ancestry in Latino populations [J].
Baran, Yael ;
Pasaniuc, Bogdan ;
Sankararaman, Sriram ;
Torgerson, Dara G. ;
Gignoux, Christopher ;
Eng, Celeste ;
Rodriguez-Cintron, William ;
Chapela, Rocio ;
Ford, Jean G. ;
Avila, Pedro C. ;
Rodriguez-Santana, Jose ;
Burchard, Esteban Gonzalez ;
Halperin, Eran .
BIOINFORMATICS, 2012, 28 (10) :1359-1367
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]   Secondary contact and local adaptation contribute to genome-wide patterns of clinal variation in Drosophila melanogaster [J].
Bergland, Alan O. ;
Tobler, Ray ;
Gonzalez, Josefa ;
Schmidt, Paul ;
Petrov, Dmitri .
MOLECULAR ECOLOGY, 2016, 25 (05) :1157-1174
[10]   Genomic Evidence of Rapid and Stable Adaptive Oscillations over Seasonal Time Scales in Drosophila [J].
Bergland, Alan O. ;
Behrman, Emily L. ;
O'Brien, Katherine R. ;
Schmidt, Paul S. ;
Petrov, Dmitri A. .
PLOS GENETICS, 2014, 10 (11)