Optimal data partitioning and a test case for ray-finned fishes (Actinopterygii) based on ten nuclear loci

被引:173
作者
Li, Chenhong [1 ]
Lu, Guoqing [2 ]
Orti, Guillermo [1 ]
机构
[1] Univ Nebraska, Sch Biol Sci, Lincoln, NE 68588 USA
[2] Univ Nebraska, Dept Biol, Omaha, NE 68182 USA
基金
美国国家科学基金会;
关键词
cluster analysis; data partitioning; Holostei; nuclear loci; phylogenetics; ray-finned fish; Actinopterygii;
D O I
10.1080/10635150802206883
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data partitioning, the combined phylogenetic analysis of homogeneous blocks of data, is a common strategy used to accommodate heterogeneities in complex multilocus data sets. Variation in evolutionary rates and substitution patterns among sites are typically addressed by partitioning data by gene, codon position, or both. Excessive partitioning of the data, however, could lead to overparameterization; therefore, it seems critical to define the minimum numbers of partitions necessary to improve the overall fit of the model. We propose a new method, based on cluster analysis, to find an optimal partitioning strategy for multilocus protein-coding data sets. A heuristic exploration of alternative partitioning schemes, based on Bayesian and maximum likelihood (ML) criteria, is shown here to produce an optimal number of partitions. We tested this method using sequence data of 10 nuclear genes collected from 52 ray-finned fish (Actinopterygii) and four tetrapods. The concatenated sequences included 7995 nucleotide sites maximally split into 30 partitions defined a priori based on gene and codon position. Our results show that a model based on only 10 partitions defined by cluster analysis performed better than partitioning by both gene and codon position. Alternative data partitioning schemes also are shown to affect the topologies resulting from phylogenetic analysis, especially when Bayesian methods are used, suggesting that overpartitioning may be of major concern. The phylogenetic relationships among the major clades of ray-finned fish were assessed using the best data-partitioning schemes under ML and Bayesian methods. Some significant results include the monophyly of Holostei (Amia and Lepisosteus), the sister-group relationships between (1) esociforms and salmoniforms and (2) osmeriforms and stomiiforms, the polyphyly of Perciformes, and a close relationship of cichlids and atherinomorphs.
引用
收藏
页码:519 / 539
页数:21
相关论文
共 121 条
[11]   Exploring among-site rate variation models in a maximum likelihood framework using empirical data: Effects of model assumptions on estimates of topology, branch lengths, and bootstrap support [J].
Buckley, TR ;
Simon, C ;
Chambers, GK .
SYSTEMATIC BIOLOGY, 2001, 50 (01) :67-86
[12]   PARTITIONING AND COMBINING DATA IN PHYLOGENETIC ANALYSIS [J].
BULL, JJ ;
HUELSENBECK, JP ;
CUNNINGHAM, CW ;
SWOFFORD, DL ;
WADDELL, PJ .
SYSTEMATIC BIOLOGY, 1993, 42 (03) :384-397
[13]  
Burnham K.P., 2002, Model selection and multimodel inference: a practical information-theoretic approach, DOI 10.1007/978-1-4757-2917-7_3
[14]   Bayesian mixed models and the phylogeny of pitvipers (Viperidae: Serpentes) [J].
Castoe, TA ;
Parkinson, CL .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2006, 39 (01) :91-110
[15]   Data partitions and complex models in Bayesian analysis: The phylogeny of Gymnophthalmid lizards [J].
Castoe, TA ;
Doan, TM ;
Parkinson, CL .
SYSTEMATIC BIOLOGY, 2004, 53 (03) :448-469
[16]   A partitioned likelihood analysis of swallowtail butterfly phylogeny (Lepidoptera: papilionidae) [J].
Caterino, MS ;
Reed, RD ;
Kuo, MM ;
Sperling, FAH .
SYSTEMATIC BIOLOGY, 2001, 50 (01) :106-127
[17]   Novel evolutionary relationship among four fish model systems [J].
Chen, WJ ;
Ortí, G ;
Meyer, A .
TRENDS IN GENETICS, 2004, 20 (09) :424-431
[18]   Repeatability of clades as a criterion of reliability: a case study for molecular phylogeny of Acanthomorpha (Teleostei) with larger number of taxa [J].
Chen, WJ ;
Bonillo, C ;
Lecointre, G .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2003, 26 (02) :262-288
[19]   Bichir HoxA cluster sequence reveals surprising trends in ray-finned fish genomic evolution [J].
Chiu, CH ;
Dewar, K ;
Wagner, GP ;
Takahashi, K ;
Ruddle, F ;
Ledje, C ;
Bartsch, P ;
Scemama, JL ;
Stellwag, E ;
Fried, C ;
Prohaska, SJ ;
Stadler, PF ;
Amemiya, CT .
GENOME RESEARCH, 2004, 14 (01) :11-17
[20]  
Cloutier R., 2004, RECENT ADV ORIGIN EA, P217