Population divergence time estimation using individual lineage label switching

被引:3
作者
Beerli, Peter [1 ]
Ashki, Haleh [2 ]
Mashayekhi, Somayeh [3 ]
Palczewski, Michal [4 ]
机构
[1] Florida State Univ, Dept Sci Comp, Tallahassee, FL 32306 USA
[2] Fdn Med Inc, San Diego, CA 92121 USA
[3] Kennesaw State Univ, Dept Math, Marietta, GA 30060 USA
[4] Maplebear Inc, San Francisco, CA 94105 USA
来源
G3-GENES GENOMES GENETICS | 2022年 / 12卷 / 04期
基金
美国国家科学基金会;
关键词
coalescence; gene tree; species tree; Bayesian inference; divergence time; MAXIMUM-LIKELIHOOD-ESTIMATION; MIGRATION; INFERENCE; MODEL;
D O I
10.1093/g3journal/jkac040
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Divergence time estimation from multilocus genetic data has become common in population genetics and phylogenetics. We present a new Bayesian inference method that treats the divergence time as a random variable. The divergence time is calculated from an assembly of splitting events on individual lineages in a genealogy. The time for such a splitting event is drawn from a hazard function of the truncated normal distribution. This allows easy integration into the standard coalescence framework used in programs such as Migrate. We explore the accuracy of the new inference method with simulated population splittings over a wide range of divergence time values and with a reanalysis of a dataset of 5 populations consisting of 3 present-day populations (Africans, Europeans, Asian) and 2 archaic samples (Altai and Ust'Isthim). Evaluations of simple divergence models without subsequent geneflow show high accuracy, whereas the accuracy of the results of isolation with migration models depends on the magnitude of the immigration rate. High immigration rates lead to a time of the most recent common ancestor of the sample that, looking backward in time, predates the divergence time. Even with many independent loci, accurate estimation of the divergence time with high immigration rates becomes problematic. Our comparison to other software tools reveals that our lineage-switching method, implemented in Migrate, is comparable to IMa2p. The software Migrate can run large numbers of sequence loci (>1,000) on computer clusters in parallel.
引用
收藏
页数:9
相关论文
共 26 条
  • [1] Estimating divergence times from molecular data on phylogenetic and population genetic timescales
    Arbogast, BS
    Edwards, SV
    Wakeley, J
    Beerli, P
    Slowinski, JB
    [J]. ANNUAL REVIEW OF ECOLOGY AND SYSTEMATICS, 2002, 33 : 707 - 740
  • [3] Beerli P, 1999, GENETICS, V152, P763
  • [4] Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach
    Beerli, P
    Felsenstein, J
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (08) : 4563 - 4568
  • [5] Unified Framework to Evaluate Panmixia and Migration Direction Among Multiple Sampling Locations
    Beerli, Peter
    Palczewski, Michal
    [J]. GENETICS, 2010, 185 (01): : 313 - U463
  • [6] BEAST 2: A Software Platform for Bayesian Evolutionary Analysis
    Bouckaert, Remco
    Heled, Joseph
    Kuehnert, Denise
    Vaughan, Tim
    Wu, Chieh-Hsi
    Xie, Dong
    Suchard, Marc A.
    Rambaut, Andrew
    Drummond, Alexei J.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (04)
  • [7] Edwards SV, 2000, EVOLUTION, V54, P1839
  • [8] EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH
    FELSENSTEIN, J
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) : 368 - 376
  • [9] Bayesian inference of population size history from multiple loci
    Heled, Joseph
    Drummond, Alexei J.
    [J]. BMC EVOLUTIONARY BIOLOGY, 2008, 8 (1)
  • [10] Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics
    Hey, Jody
    Nielsen, Rasmus
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (08) : 2785 - 2790