Phylogenetic identification of lateral genetic transfer events

被引:101
作者
Beiko, RG [1 ]
Hamilton, N
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld, Australia
[2] Univ Queensland, Adv Computat Modelling Ctr, Brisbane, Qld, Australia
关键词
D O I
10.1186/1471-2148-6-15
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Lateral genetic transfer can lead to disagreements among phylogenetic trees comprising sequences from the same set of taxa. Where topological discordance is thought to have arisen through genetic transfer events, tree comparisons can be used to identify the lineages that may have shared genetic information. An 'edit path' of one or more transfer events can be represented with a series of subtree prune and regraft (SPR) operations, but finding the optimal such set of operations is NP-hard for comparisons between rooted trees, and may be so for unrooted trees as well. Results: Efficient Evaluation of Edit Paths ( EEEP) is a new tree comparison algorithm that uses evolutionarily reasonable constraints to identify and eliminate many unproductive search avenues, reducing the time required to solve many edit path problems. The performance of EEEP compares favourably to that of other algorithms when applied to strictly bifurcating trees with specified numbers of SPR operations. We also used EEEP to recover edit paths from over 19 000 unrooted, incompletely resolved protein trees containing up to 144 taxa as part of a large phylogenomic study. While inferred protein trees were far more similar to a reference supertree than random trees were to each other, the phylogenetic distance spanned by random versus inferred transfer events was similar, suggesting that real transfer events occur most frequently between closely related organisms, but can span large phylogenetic distances as well. While most of the protein trees examined here were very similar to the reference supertree, requiring zero or one edit operations for reconciliation, some trees implied up to 40 transfer events within a single orthologous set of proteins. Conclusion: Since sequence trees typically have no implied root and may contain unresolved or multifurcating nodes, the strategy implemented in EEEP is the most appropriate for phylogenomic analyses. The high degree of consistency among inferred protein trees shows that vertical inheritance is the dominant pattern of evolution, at least for the set of organisms considered here. However, the edit paths inferred using EEEP suggest an important role for genetic transfer in the evolution of microbial genomes as well.
引用
收藏
页数:17
相关论文
共 49 条
  • [1] Addario-Berry L, 2003, LECT N BIOINFORMAT, V2812, P202
  • [2] Allen CR, 2001, CONSERV ECOL, V5
  • [3] [Anonymous], 2001, Proceedings of the 5th Annual International Conference on Research in Computational Molecular Biology, DOI [DOI 10.1145/369133.369188, 10.1145/369133.369188]
  • [4] Bayesian models of episodic evolution support a late Precambrian explosive diversification of the Metazoa
    Aris-Brosou, S
    Yang, ZH
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (12) : 1947 - 1954
  • [5] Highways of gene sharing in prokaryotes
    Beiko, RG
    Harlow, TJ
    Ragan, MA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (40) : 14332 - 14337
  • [6] Counting consistent phylogenetic trees is #P-complete
    Bordewich, M
    Semple, C
    Talbot, J
    [J]. ADVANCES IN APPLIED MATHEMATICS, 2004, 33 (02) : 416 - 430
  • [7] Bryant J, 2004, HEALTH TECHNOL ASSES, V8, P1
  • [8] The neomuran origin of archaebacteria, the negibacterial root of the universal tree and bacterial megaclassification
    Cavalier-Smith, T
    [J]. INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2002, 52 : 7 - 76
  • [9] Does a tree-like phylogeny only exist at the tips in the prokaryotes?
    Creevey, CJ
    Fitzpatrick, DA
    Philip, GK
    Kinsella, RJ
    O'Connell, MJ
    Pentony, MM
    Travers, SA
    Wilkinson, M
    McInerney, JO
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2004, 271 (1557) : 2551 - 2558
  • [10] On the linear-cost subtree-transfer distance between phylogenetic trees
    DasGupta, B
    He, X
    Jiang, T
    Li, M
    Tromp, J
    [J]. ALGORITHMICA, 1999, 25 (2-3) : 176 - 195