The Supertree Tool Kit

被引:7
作者
Davis K.E. [1 ]
Hill J. [2 ]
机构
[1] Faculty of Biomedical and Life Sciences, Graham Kerr Building, University of Glasgow
[2] Applied Modelling and Computation Group, Earth Science and Engineering, Imperial College London
基金
英国自然环境研究理事会;
关键词
Source Tree; Supertree Method; Taxon List; Perl Module; Large Phylogeny;
D O I
10.1186/1756-0500-3-95
中图分类号
学科分类号
摘要
Background. Large phylogenies are crucial for many areas of biological research. One method of creating such large phylogenies is the supertree method, but creating supertrees containing thousands of taxa, and hence providing a comprehensive phylogeny, requires hundred or even thousands of source input trees. Managing and processing these data in a systematic and error-free manner is challenging and will become even more so as supertrees contain ever increasing numbers of taxa. Protocols for processing input source phylogenies have been proposed to ensure data quality, but no robust software implementations of these protocols as yet exist. Findings. The aim of the Supertree Tool Kit (STK) is to aid in the collection, storage and processing of input source trees for use in supertree analysis. It is therefore invaluable when creating supertrees containing thousands of taxa and hundreds of source trees. The STK is a Perl module with executable scripts to carry out various steps in the processing protocols. In order to aid processing we have added meta-data, via XML, to each tree which contains information such as the bibliographic source information for the tree and how the data were derived, for instance the character data used to carry out the original analysis. These data are essential parts of previously proposed protocols. Conclusions. The STK is a bioinformatics tool designed to make it easier to process source phylogenies for inclusion in supertree analysis from hundreds or thousands of input source trees, whilst reducing potential errors and enabling easy sharing of such datasets. It has been successfully used to create the largest known supertree to date containing over 5000 taxa from over 700 source phylogenies. © 2010 Davis et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 20 条
[1]  
Sanderson M.J., Purvis A., Henze C., Phylogenetic supertrees: Assembling the trees of life, Trends in Ecology and Evolution, 13, 3, pp. 105-109, (1998)
[2]  
Davis K.E., Reweaving the Tapestry: A supertree of birds, PhD Thesis, (2008)
[3]  
Lloyd G.T., Davis K.E., Pisani D., Tarver J.E., Ruta M., Sakamoto M., Hone D.W.E., Jennings R., Benton M.J., Dinosaurs and the cretaceous terrestrial revolution, Proceedings of the Royal Society B: Biological Sciences, 275, 1650, pp. 2483-2490, (2008)
[4]  
Bininda-Emonds O.R.P., Cardillo M., Jones K.E., MacPhee R.D.E., Beck R.M.D., Grenyer R., Price S.A., Vos R.A., Gittleman J.L., Purvis A., The delayed rise of present-day mammals, Nature, 446, pp. 507-512, (2007)
[5]  
Felsenstein J., Phylogenies and the comparative method., American Naturalist, 125, 1, pp. 1-15, (1985)
[6]  
Purvis A., Nee S., Harvey P., Macroevolutionary inferences from primate phylogeny, Proceedings of the Royal Society of London B, 260, pp. 329-333, (1995)
[7]  
Maddison D.R., Swofford D.L., Maddison W.P., NEXUS: An extensible file Format for systematic information, Systematic Biology, 46, 4, pp. 590-621, (1997)
[8]  
Gatesy J., Baker C.R.H., Hayashi C., Inconsistencies in arguments for the supertree approach: Supermatrices versus supertrees of Crocodylia, Systematic Biology, 53, pp. 342-355, (2004)
[9]  
Bininda-Emonds O.R.P., Jones K.E., Price S.A., Cardillo M., Grenyer R., Purvis A., Garbage in, garbage out: Data issues in supertree construction, Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life., Volume 3 of Computational Biology, pp. 267-280, (2004)
[10]  
Swofford D.L., PAUP*. Phylogenetic Analysis Using Parsimony (*And Other Methods), (2002)