Removal of Noisy Characters from Chloroplast Genome-Scale Data Suggests Revision of Phylogenetic Placements of Amborella and Ceratophyllum

被引:25
作者
Goremykin, Vadim V. [1 ]
Viola, Roberto [1 ]
Hellwig, Frank H. [2 ]
机构
[1] IASMA Res Ctr, I-38010 San Michele All Adige, TN, Italy
[2] Univ Jena, Inst Spezielle Bot, D-07743 Jena, Germany
关键词
Chloroplast genomes; Molecular evolution; Angiosperm diversification; BASAL ANGIOSPERM PHYLOGENY; LONG-BRANCH ATTRACTION; EARLIEST ANGIOSPERMS; MAXIMUM-LIKELIHOOD; GENES; SEQUENCES; DNA; MITOCHONDRIAL; INCONGRUENCE; EVOLUTION;
D O I
10.1007/s00239-009-9206-9
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It is widely appreciated that noisy, highly variable data can impede phylogeney reconstruction. Researchers have for a long time omitted problematic data from phylogenetic analyses, such as the third-codon positions and variable regions. In the analyses of the phylogenetic relations of the angiosperms; however, inclusion of complete gene sequences into genomic-scale alignments has become a common practice. Here we demonstrate that this practice can be misleading. We show that support of the basal-most position of Amborella trichopoda among the angiosperms in the chloroplast genomic data is based only on a tiny subset (< 1% of the total alignment length) of the most variable positions in alignment, exhibiting mean maximum likelihood (ML) distance among the angiosperm operational taxonomic units (OTUs) approximately 36 substitutions/site. Exclusion of these positions leads to disappearance of the basal Amborella branch. Likewise, the recently reported sister-group relationship of Ceratophyllum to the eudicots is based on the presence of 2% of the most variable positions in the genomic alignment, exhibiting, on average, 20 substitutions/site in comparison among the angiosperm OTUs. These observations highlight a need for excluding a certain proportion of saturated positions in alignment from phylogenomic analyses.
引用
收藏
页码:197 / 204
页数:8
相关论文
共 35 条
[1]   Independent and combined analyses of sequences from all three genomic compartments converge on the root of flowering plant phylogeny [J].
Barkman, TJ ;
Chenery, G ;
McNeal, JR ;
Lyons-Weiler, J ;
Ellisens, WJ ;
Moore, G ;
Wolfe, AD ;
dePamphilis, CW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (24) :13166-13171
[2]   A review of long-branch attraction [J].
Bergsten, J .
CLADISTICS, 2005, 21 (02) :163-193
[3]   Noncoding plastid trnT-trnF sequences reveal a well resolved phylogeny of basal angiosperms [J].
Borsch, T ;
Hilu, KW ;
Quandt, D ;
Wilde, V ;
Neinhuis, C ;
Barthlott, W .
JOURNAL OF EVOLUTIONARY BIOLOGY, 2003, 16 (04) :558-576
[4]   THE ORIGIN AND EARLY DIVERSIFICATION OF ANGIOSPERMS [J].
CRANE, PR ;
FRIIS, EM ;
PEDERSEN, KR .
NATURE, 1995, 374 (6517) :27-33
[5]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[6]   CASES IN WHICH PARSIMONY OR COMPATIBILITY METHODS WILL BE POSITIVELY MISLEADING [J].
FELSENSTEIN, J .
SYSTEMATIC ZOOLOGY, 1978, 27 (04) :401-410
[7]   The chloroplast genome of the "basal" angiosperm Calycanthus fertilis -: structural and phylogenetic analyses [J].
Goremykin, V ;
Hirsch-Ernst, KI ;
Wölfl, S ;
Hellwig, FH .
PLANT SYSTEMATICS AND EVOLUTION, 2003, 242 (1-4) :119-135
[8]   A new test of phylogenetic model fitness addresses the issue of the basal angiosperm phylogeny [J].
Goremykin, Vadim V. ;
Hellwig, Frank H. .
GENE, 2006, 381 :81-91
[9]   Analysis of Acorus calamus chloroplast genome and its phylogenetic implications [J].
Goremykin, VV ;
Holland, B ;
Hirsch-Ernst, KI ;
Hellwig, FH .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (09) :1813-1822
[10]   The chloroplast genome of Nymphaea alba:: Whole-genome analyses and the problem of identifying the most basal angiosperm [J].
Goremykin, VV ;
Hirsch-Ernst, KI ;
Wölfl, S ;
Hellwig, FH .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (07) :1445-1454