AliGROOVE - visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support

被引:150
作者
Kueck, Patrick [1 ]
Meid, Sandra A. [1 ]
Gross, Christian [2 ]
Waegele, Johann W. [1 ]
Misof, Bernhard [1 ]
机构
[1] Zool Forsch Museum A Koenig, D-53113 Bonn, Germany
[2] Univ Amsterdam, Amsterdam, Netherlands
关键词
Software; Alignment quality; Sequence heterogeneity; Topological node support; PHYLOGENETIC ANALYSES; MAXIMUM-LIKELIHOOD; TREE; SENSITIVITY; IMPROVEMENT; RANDOMNESS; SELECTION; POSITION; BLOCKS; MITES;
D O I
10.1186/1471-2105-15-294
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Masking of multiple sequence alignment blocks has become a powerful method to enhance the tree-likeness of the underlying data. However, existing masking approaches are insensitive to heterogeneous sequence divergence which can mislead tree reconstructions. We present AliGROOVE, a new method based on a sliding window and a Monte Carlo resampling approach, that visualizes heterogeneous sequence divergence or alignment ambiguity related to single taxa or subsets of taxa within a multiple sequence alignment and tags suspicious branches on a given tree. Results: We used simulated multiple sequence alignments to show that the extent of alignment ambiguity in pairwise sequence comparison is correlated with the frequency of misplaced taxa in tree reconstructions. The approach implemented in AliGROOVE allows to detect nodes within a tree that are supported despite the absence of phylogenetic signal in the underlying multiple sequence alignment. We show that AliGROOVE equally well detects heterogeneous sequence divergence in a case study based on an empirical data set of mitochondrial DNA sequences of chelicerates. Conclusions: The AliGROOVE approach has the potential to identify single taxa or subsets of taxa which show predominantly randomized sequence similarity in comparison with other taxa in a multiple sequence alignment. It further allows to evaluate the reliability of node support in a novel way.
引用
收藏
页数:15
相关论文
共 31 条
[1]   Measuring guide-tree dependency of inferred gaps in progressive aligners [J].
Capella-Gutierrez, Salvador ;
Gabaldon, Toni .
BIOINFORMATICS, 2013, 29 (08) :1011-1017
[2]   trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses [J].
Capella-Gutierrez, Salvador ;
Silla-Martinez, Jose M. ;
Gabaldon, Toni .
BIOINFORMATICS, 2009, 25 (15) :1972-1973
[3]   Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis [J].
Castresana, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) :540-552
[4]   BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments [J].
Criscuolo, Alexis ;
Gribaldo, Simonetta .
BMC EVOLUTIONARY BIOLOGY, 2010, 10
[5]   Molecular phylogeny of acariform mites (Acari, Arachnida): Strong conflict between phylogenetic signal and long-branch attraction artifacts [J].
Dabert, Miroslawa ;
Witalinski, Wojciech ;
Kazmierski, Andrzej ;
Olszanowski, Ziemowit ;
Dabert, Jacek .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2010, 56 (01) :222-241
[6]   NEW INSIGHTS INTO THE PHYLOGENY OF THE PYRAMIDELLIDAE (GASTROPODA) [J].
Dinapoli, Angela ;
Zinssmeister, Carmen ;
Klussmann-Kolb, Annette .
JOURNAL OF MOLLUSCAN STUDIES, 2011, 77 :1-7
[7]   Noisy:: Identification of problematic columns in multiple sequence alignments [J].
Dress, Andreas W. M. ;
Flamm, Christoph ;
Fritzsch, Guido ;
Gruenewald, Stefan ;
Kruspe, Matthias ;
Prohaska, Sonja J. ;
Stadler, Peter F. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2008, 3 (1)
[8]   The affinities of mites and ticks: a review [J].
Dunlop, J. A. ;
Alberti, G. .
JOURNAL OF ZOOLOGICAL SYSTEMATICS AND EVOLUTIONARY RESEARCH, 2008, 46 (01) :1-18
[9]   INDELible: A Flexible Simulator of Biological Sequence Evolution [J].
Fletcher, William ;
Yang, Ziheng .
MOLECULAR BIOLOGY AND EVOLUTION, 2009, 26 (08) :1879-1888
[10]   A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood [J].
Guindon, S ;
Gascuel, O .
SYSTEMATIC BIOLOGY, 2003, 52 (05) :696-704