When Trees Grow Too Long: Investigating the Causes of Highly Inaccurate Bayesian Branch-Length Estimates

被引:194
作者
Brown, Jeremy M. [1 ,2 ]
Hedtke, Shannon M. [1 ,2 ]
Lemmon, Alan R. [3 ]
Lemmon, Emily Moriarty [3 ]
机构
[1] Univ Texas Austin, Sect Integrat Biol, Austin, TX 78712 USA
[2] Univ Texas Austin, Ctr Computat Biol & Bioinformat, Austin, TX 78712 USA
[3] Univ Calif Davis, Sect Evolut & Ecol, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Bayesian; branch length; Markov chain Monte Carlo; parameter space; phylogeny; posterior; prior; SPECIES LIMITS; DIVERGENCE; PHYLOGENY;
D O I
10.1093/sysbio/syp081
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A surprising number of recent Bayesian phylogenetic analyses contain branch-length estimates that are several orders of magnitude longer than corresponding maximum-likelihood estimates. The levels of divergence implied by such branch lengths are unreasonable for studies using biological data and are known to be false for studies using simulated data. We conducted additional Bayesian analyses and studied approximate-posterior surfaces to investigate the causes underlying these large errors. We manipulated the starting parameter values of the Markov chain Monte Carlo (MCMC) analyses, the moves used by the MCMC analyses, and the prior-probability distribution on branch lengths. We demonstrate that inaccurate branch-length estimates result from either 1) poor mixing of MCMC chains or 2) posterior distributions with excessive weight at long tree lengths. Both effects are caused by a rapid increase in the volume of branch-length space as branches become longer. In the former case, both an MCMC move that scales all branch lengths in the tree simultaneously and the use of overdispersed starting branch lengths allow the chain to accurately sample the posterior distribution and should be used in Bayesian analyses of phylogeny. In the latter case, branch-length priors can have strong effects on resulting inferences and should be carefully chosen to reflect biological expectations. We provide a formula to calculate an exponential rate parameter for the branch-length prior that should eliminate inference of biased branch lengths in many cases. In any phylogenetic analysis, the biological plausibility of branch-length output must be carefully considered.
引用
收藏
页码:145 / 161
页数:17
相关论文
共 28 条
  • [1] The importance of data partitioning and the utility of bayes factors in Bayesian phylogenetics
    Brown, Jeremy M.
    Lemmon, Alan R.
    [J]. SYSTEMATIC BIOLOGY, 2007, 56 (04) : 643 - 655
  • [2] Species limits and phylogeography of North American cricket frogs (Acris: Hylidae)
    Gamble, Tony
    Berendzen, Peter B.
    Shaffer, H. Bradley
    Starkey, David E.
    Simons, Andrew M.
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2008, 48 (01) : 112 - 125
  • [3] Gelman A., 1995, Bayesian data analysis
  • [4] GEYER CJ, 1991, COMPUTING SCIENCE AND STATISTICS, P156
  • [5] All-male asexuality:: Origin and maintenance of androgenesis in the asian clam corbicula
    Hedtke, Shannon M.
    Stanger-Hall, Kathrin
    Baker, Robert J.
    Hillis, David M.
    [J]. EVOLUTION, 2008, 62 (05) : 1119 - 1136
  • [6] JEFFREYS H, 1939, P R SOC A, V186, P453
  • [7] Larget B, 2005, STAT BIOL HEALTH, P45, DOI 10.1007/0-387-27733-1_3
  • [8] Phylogeny, divergence times and species limits of spiny lizards (Sceloporus magister species group) in western North American deserts and Baja California
    Leache, Adam D.
    Mulcahy, Daniel G.
    [J]. MOLECULAR ECOLOGY, 2007, 16 (24) : 5216 - 5233
  • [9] Phylogeny-based delimitation of species boundaries and contact zones in the trilling chorus frogs (Pseudacris)
    Lemmon, Emily Moriarty
    Lemmon, Alan R.
    Collins, Joseph T.
    Lee-Yaw, Julie A.
    Cannatella, David C.
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2007, 44 (03) : 1068 - 1082
  • [10] Geological and climatic forces driving speciation in the continentally distributed trilling chorus frogs (pseudacris)
    Lemmon, Emily Moriarty
    Lemmon, Alan R.
    Cannatella, David C.
    [J]. EVOLUTION, 2007, 61 (09) : 2086 - 2103