Construction of a Species-Level Tree of Life for the Insects and Utility in Taxonomic Profiling

被引:33
作者
Chesters, Douglas [1 ]
机构
[1] Chinese Acad Sci, Inst Zool, Key Lab Zool Systemat & Evolut, Beijing 100101, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Data integration; data mining; insects; phylogenomics; phyloinformatics; tree of life; MULTIPLE SEQUENCE ALIGNMENT; BIODIVERSITY ASSESSMENT; PHYLOGENOMICS RESOLVES; PHYLOGENETIC PLACEMENT; MITOCHONDRIAL GENOMES; MAXIMUM-LIKELIHOOD; EVOLUTION; HYMENOPTERA; GENERATION; DIVERSITY;
D O I
10.1093/sysbio/syw099
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Although comprehensive phylogenies have proven an invaluable tool in ecology and evolution, their construction is made increasingly challenging both by the scale and structure of publically available sequences. The distinct partition between gene-rich (genomic) and species-rich (DNA barcode) data is a feature of data that has been largely overlooked, yet presents a key obstacle to scaling supermatrix analysis. I present a phyloinformatics framework for draft construction of a species-level phylogeny of insects (Class Insecta). Matrix-building requires separately optimized pipelines for nuclear transcriptomic, mitochondrial genomic, and species-rich markers, whereas tree-building requires hierarchical inference in order to capture species-breadth while retaining deep-level resolution. The phylogeny of insects contains 49,358 species, 13,865 genera, 760 families. Deep-level splits largely reflected previous findings for sections of the tree that are data rich or unambiguous, such as inter-ordinal Endopterygota and Dictyoptera, the recently evolved and relatively homogeneous Lepidoptera, Hymenoptera, Brachycera (Diptera), and Cucujiformia (Coleoptera). However, analysis of bias, matrix construction and gene-tree variation suggests confidence in some relationships (such as in Polyneoptera) is less than has been indicated by the matrix bootstrap method. To assess the utility of the insect tree as a tool in query profiling several tree-based taxonomic assignment methods are compared. Using test data sets with existing taxonomic annotations, a tendency is observed for greater accuracy of species-level assignments where using a fixed comprehensive tree of life in contrast to methods generating smaller de novo reference trees. Described herein is a solution to the discrepancy in the way data are fit into supermatrices. The resulting tree facilitates wider studies of insect diversification and application of advanced descriptions of diversity in community studies, among other presumed applications.
引用
收藏
页码:426 / 439
页数:14
相关论文
共 112 条
[1]   Fine-scale phylogenetic architecture of a complex bacterial community [J].
Acinas, SG ;
Klepac-Ceraj, V ;
Hunt, DE ;
Pharino, C ;
Ceraj, I ;
Distel, DL ;
Polz, MF .
NATURE, 2004, 430 (6999) :551-554
[2]   Phylogenetic community ecology of soil biodiversity using mitochondrial metagenomics [J].
Andujar, Carmelo ;
Arribas, Paula ;
Ruzicka, Filip ;
Crampton-Platt, Alex ;
Timmermans, Martijn J. T. N. ;
Vogler, Alfried P. .
MOLECULAR ECOLOGY, 2015, 24 (14) :3603-3617
[3]  
Barraclough TG, 2003, EVOLUTION, V57, P2166
[4]   BlastAlign:: a program that uses blast to align problematic nucleotide sequences [J].
Belshaw, R ;
Katzourakis, A .
BIOINFORMATICS, 2005, 21 (01) :122-123
[5]  
Benson Dennis A, 2006, Nucleic Acids Res, V34, pD16
[6]   Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood [J].
Berger, Simon A. ;
Krompass, Denis ;
Stamatakis, Alexandros .
SYSTEMATIC BIOLOGY, 2011, 60 (03) :291-302
[7]   A review of long-branch attraction [J].
Bergsten, J .
CLADISTICS, 2005, 21 (02) :163-193
[8]   Morphological and molecular evidence converge upon a robust phylogeny of the megadiverse Holometabola [J].
Beutel, Rolf G. ;
Friedrich, Frank ;
Hoernschemeyer, Thomas ;
Pohl, Hans ;
Huenefeld, Frank ;
Beckmann, Felix ;
Meier, Rudolf ;
Misof, Bernhard ;
Whiting, Michael F. ;
Vilhelmsen, Lars .
CLADISTICS, 2011, 27 (04) :341-355
[9]   The evolution of supertrees [J].
Bininda-Emonds, ORP .
TRENDS IN ECOLOGY & EVOLUTION, 2004, 19 (06) :315-322
[10]   Defining operational taxonomic units using DNA barcode data [J].
Blaxter, M ;
Mann, J ;
Chapman, T ;
Thomas, F ;
Whitton, C ;
Floyd, R ;
Abebe, E .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) :1935-1943