Bayesian phylogenetic analysis of combined data

被引:1586
作者
Nylander, JAA
Ronquist, F
Huelsenbeck, JP
Nieves-Aldrey, JL
机构
[1] Uppsala Univ, Evolutionary Biol Ctr, Dept Systemat Zool, SE-75236 Uppsala, Sweden
[2] Univ Calif San Diego, Div Biol Sci, Sect Ecol Behav & Evolut, La Jolla, CA 92093 USA
[3] CSIC, Museo Nacl Ciencias Nat, Dept Bioversidad & Biol Evolut, E-28006 Madrid, Spain
关键词
Bayes factors; Bayesian analysis; combined data; Cynipidae; gall wasps; MCMC; model heterogeneity; model selection;
D O I
10.1080/10635150490264699
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent development of Bayesian phylogenetic inference using Markov chain Monte Carlo( MCMC) techniques has facilitated the exploration of parameter-rich evolutionary models. At the same time, stochastic models have become more realistic ( and complex) and have been extended to new types of data, such as morphology. Based on this foundation, we developed a Bayesian MCMC approach to the analysis of combined data sets and explored its utility in inferring relationships among gall wasps based on data from morphology and four genes ( nuclear and mitochondrial, ribosomal and protein coding). Examined models range in complexity from those recognizing only a morphological and a molecular partition to those having complex substitution models with independent parameters for each gene. Bayesian MCMC analysis deals efficiently with complex models: convergence occurs faster and more predictably for complex models, mixing is adequate for all parameters even under very complex models, and the parameter update cycle is virtually unaffected by model partitioning across sites. Morphology contributed only 5% of the characters in the data set but nevertheless influenced the combined-data tree, supporting the utility of morphological data in multigene analyses. We used Bayesian criteria ( Bayes factors) to show that process heterogeneity across data partitions is a significant model component, although not as important as among-site rate variation. More complex evolutionary models are associated with more topological uncertainty and less conflict between morphology and molecules. Bayes factors sometimes favor simpler models over considerably more parameter-rich models, but the best model overall is also the most complex and Bayes factors do not support exclusion of apparently weak parameters from this model. Thus, Bayes factors appear to be useful for selecting among complex models, but it is still unclear whether their use strikes a reasonable balance between model complexity and error in parameter estimates.
引用
收藏
页码:47 / 67
页数:21
相关论文
共 67 条
[1]  
AITKIN M, 1991, J ROY STAT SOC B MET, V53, P111
[2]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[3]   Effects of models of rate evolution on estimation of divergence dates with special reference to the metazoan 18S ribosomal RNA Phylogeny [J].
Aris-Brosou, S ;
Yang, ZH .
SYSTEMATIC BIOLOGY, 2002, 51 (05) :703-714
[4]  
Askew R.R., 1984, P223
[5]   The utility of the incongruence length difference test [J].
Barker, FK ;
Lutzoni, FM .
SYSTEMATIC BIOLOGY, 2002, 51 (04) :625-637
[6]  
BARTLETT MS, 1957, BIOMETRIKA, V44, P187
[7]   Bayesian model adequacy and choice in phylogenetics [J].
Bollback, JP .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (07) :1171-1180
[8]   Combined data, Bayesian phylogenetics, and the origin of the New Zealand cicada genera [J].
Buckley, TR ;
Arensburger, P ;
Simon, C ;
Chambers, GK .
SYSTEMATIC BIOLOGY, 2002, 51 (01) :4-18
[9]  
Burnham K. P., 2002, MODEL SELECTION MULT
[10]   Can three incongruence tests predict when data should be combined? [J].
Cunningham, CW .
MOLECULAR BIOLOGY AND EVOLUTION, 1997, 14 (07) :733-740