Building the Coleoptera tree-of-life for >8000 species: composition of public DNA data and fit with Linnaean classification

被引:218
作者
Bocak, Ladislav [1 ,2 ]
Barton, Christopher [1 ,3 ]
Crampton-Platt, Alex [1 ]
Chesters, Douglas [1 ,3 ]
Ahrens, Dirk [1 ,4 ]
Vogler, Alfried P. [1 ,3 ]
机构
[1] NHM, Dept Life Sci, London, England
[2] Fac Sci UP, Dept Zool, Olomouc, Czech Republic
[3] Univ London Imperial Coll Sci Technol & Med, Dept Life Sci, London SW7 2AZ, Berks, England
[4] Zool Forsch Museum A Koenig, Bonn, Germany
关键词
RIBOSOMAL-RNA; PHYLOGENETIC-RELATIONSHIPS; MOLECULAR PHYLOGENETICS; BASAL RELATIONSHIPS; ADEPHAGAN BEETLES; ALIGNMENT; SEQUENCES; 18S; TAXONOMY; FAMILY;
D O I
10.1111/syen.12037
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The species representation of public databases is growing rapidly and permits increasingly detailed phylogenetic inferences. We present a supermatrix based on all gene sequences of Coleoptera available in Genbank for two nuclear (18S and 28S rRNA) and two mitochondrial (rrnL and cox1) genes. After filtering for unique species names and the addition of 2000 unpublished sequences for cox1 and 18S rRNA, the resulting data matrix included 8441 species-level terminals and 6600 aligned nucleotide positions. The concatenated matrix represents the equivalent of 2.17% of the 390000 described species of Coleoptera and includes 152 beetle families. The remaining 29 families constitute small lineages with 250 known species in total. Taxonomic coverage remains low for several major lineages, including Buprestidae (0.16% of described species), Staphylinidae (1.03%), Tenebrionidae (0.90%) and Cerambycidae (0.58%). The current taxon sampling was strongly biased towards the Northern Hemisphere. Phylogenetic trees obtained from the supermatrix were in very good agreement with the Linnaean classification, in particular at the family level, but lower for the subfamily and lowest for the genus level. The topology supports the basal split of Derodontidae and Scirtoidea from the remaining Polyphaga, and the broad paraphyly of Cucujoidea. The data extraction pipeline and detailed tree provide a framework for placement of any new sequences, including environmental samples, into a DNA-based classification system of Coleoptera.
引用
收藏
页码:97 / 110
页数:14
相关论文
共 63 条
[1]   DNA-based taxonomy for associating adults and larvae in multi-species assemblages of chafers (Coleoptera: Scarabaeidae) [J].
Ahrens, Dirk ;
Monaghan, Michael T. ;
Vogler, Alfried P. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2007, 44 (01) :436-449
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
[Anonymous], P IPDPS 2006 RHOD GR
[4]   BlastAlign:: a program that uses blast to align problematic nucleotide sequences [J].
Belshaw, R ;
Katzourakis, A .
BIOINFORMATICS, 2005, 21 (01) :122-123
[5]   Endopterygote systematics - where do we stand and what is the goal (Hexapoda, Arthropoda)? [J].
Beutel, RG ;
Pohl, H .
SYSTEMATIC ENTOMOLOGY, 2006, 31 (02) :202-219
[6]  
BEUTEL RG, 1988, Z ZOOL SYST EVOL, V26, P380
[7]  
Beutel RG, 2000, CLADISTICS, V16, P103, DOI 10.1111/j.1096-0031.2000.tb00350.x
[8]  
BEUTEL RG, 2005, HDB ZOOLOGY NATURAL, V4
[9]   On the head morphology of Tetraphalerus, the phylogeny of Archostemata and the basal branching events in Coleoptera [J].
Beutel, Rolf G. ;
Ge, Si-qin ;
Hoernschemeyer, Thomas .
CLADISTICS, 2008, 24 (03) :270-298
[10]   transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences [J].
Bininda-Emonds, ORP .
BMC BIOINFORMATICS, 2005, 6 (1)