A Dirichlet Process Prior for Estimating Lineage-Specific Substitution Rates

被引:61
作者
Heath, Tracy A. [1 ,2 ]
Holder, Mark T. [1 ]
Huelsenbeck, John P. [2 ]
机构
[1] Univ Kansas, Dept Ecol & Evolutionary Biol, Lawrence, KS 66045 USA
[2] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
divergence time estimation; relaxed clock; phylogenetics; Bayesian estimation; Markov chain Monte Carlo; Dirichlet process prior; mixed model; simulation; ESTIMATING DIVERGENCE TIMES; DNA-SEQUENCES; MOLECULAR CLOCK; LIKELIHOOD-ESTIMATION; BAYESIAN-ESTIMATION; EVOLUTIONARY TREES; SAMPLING METHODS; MIXTURE MODEL; INFERENCE; SELECTION;
D O I
10.1093/molbev/msr255
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We introduce a new model for relaxing the assumption of a strict molecular clock for use as a prior in Bayesian methods for divergence time estimation. Lineage-specific rates of substitution are modeled using a Dirichlet process prior (DPP), a type of stochastic process that assumes lineages of a phylogenetic tree are distributed into distinct rate classes. Under the Dirichlet process, the number of rate classes, assignment of branches to rate classes, and the rate value associated with each class are treated as random variables. The performance of this model was evaluated by conducting analyses on data sets simulated under a range of different models. We compared the Dirichlet process model with two alternative models for rate variation: the strict molecular clock and the independent rates model. Our results show that divergence time estimation under the DPP provides robust estimates of node ages and branch rates without significantly reducing power. Further analyses were conducted on a biological data set, and we provide examples of ways to summarize Markov chain Monte Carlo samples under this model.
引用
收藏
页码:939 / 955
页数:17
相关论文
共 69 条
[1]  
Ané C, 2007, MOL BIOL EVOL, V24, P412
[2]  
[Anonymous], 1963, Low-Density Parity-Check Codes
[3]  
[Anonymous], 2009, TRACER V1 5
[4]   MIXTURES OF DIRICHLET PROCESSES WITH APPLICATIONS TO BAYESIAN NONPARAMETRIC PROBLEMS [J].
ANTONIAK, CE .
ANNALS OF STATISTICS, 1974, 2 (06) :1152-1174
[5]   On selecting a prior for the precision parameter of Dirichlet process mixture models [J].
Dorazio, Robert M. .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (09) :3384-3390
[6]   Relaxed Clocks and Inferences of Heterogeneous Patterns of Nucleotide Substitution and Divergence Time Estimates across Whales and Dolphins (Mammalia: Cetacea) [J].
Dornburg, Alex ;
Brandley, Matthew C. ;
McGowen, Michael R. ;
Near, Thomas J. .
MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (02) :721-736
[7]   BEAST: Bayesian evolutionary analysis by sampling trees [J].
Drummond, Alexei J. ;
Rambaut, Andrew .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (1)
[8]   Relaxed phylogenetics and dating with confidence [J].
Drummond, Alexei J. ;
Ho, Simon Y. W. ;
Phillips, Matthew J. ;
Rambaut, Andrew .
PLOS BIOLOGY, 2006, 4 (05) :699-710
[9]   Bayesian random local clocks, or one rate to rule them all [J].
Drummond, Alexei J. ;
Suchard, Marc A. .
BMC BIOLOGY, 2010, 8
[10]   Nonparametric Bayes local partition models for random effects [J].
Dunson, David B. .
BIOMETRIKA, 2009, 96 (02) :249-262