Telling the whole story in a 10,000-genome world

被引:35
作者
Beiko, Robert G. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada
基金
加拿大创新基金会; 加拿大自然科学与工程研究理事会;
关键词
HORIZONTAL GENE-TRANSFER; GENOME TREES; PHYLOGENETIC POSITION; MICROBIAL GENOMES; PROTEIN; LIFE; NETWORKS; CORE; IDENTIFICATION; EVOLUTIONARY;
D O I
10.1186/1745-6150-6-34
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Genome sequencing has revolutionized our view of the relationships among genomes, particularly in revealing the confounding effects of lateral genetic transfer (LGT). Phylogenomic techniques have been used to construct purported trees of microbial life. Although such trees are easily interpreted and allow the use of a subset of genomes as "proxies" for the full set, LGT and other phenomena impact the positioning of different groups in genome trees, confounding and potentially invalidating attempts to construct a phylogeny-based taxonomy of microorganisms. Network and graph approaches can reveal complex sets of relationships, but applying these techniques to large data sets is a significant challenge. Notwithstanding the question of what exactly it might represent, generating and interpreting a Tree or Network of All Genomes will only be feasible if current algorithms can be improved upon. Results: Complex relationships among even the most-similar genomes demonstrate that proxy-based approaches to simplifying large sets of genomes are not alone sufficient to solve the analysis problem. A phylogenomic analysis of 1173 sequenced bacterial and archaeal genomes generated phylogenetic trees for 159,905 distinct homologous gene sets. The relationships inferred from this set can be heavily dependent on the inclusion of other taxa: for example, phyla such as Spirochaetes, Proteobacteria and Firmicutes are recovered as cohesive groups or split depending on the presence of other specific lineages. Furthermore, named groups such as Acidithiobacillus, Coprothermobacter and Brachyspira show a multitude of affiliations that are more consistent with their ecology than with small subunit ribosomal DNA-based taxonomy. Network and graph representations can illustrate the multitude of conflicting affinities, but all methods impose constraints on the input data and create challenges of construction and interpretation. Conclusions: These complex relationships highlight the need for an inclusive approach to genomic data, and current methods with minor alterations will likely scale to allow the analysis of data sets with 10,000 or more genomes. The main challenges lie in the visualization and interpretation of genomic relationships, and the redefinition of microbial taxonomy when subsets of genomic data are so evidently in conflict with one another, and with the "canonical" molecular taxonomy.
引用
收藏
页数:36
相关论文
共 113 条
  • [1] OMA 2011: orthology inference among 1000 complete genomes
    Altenhoff, Adrian M.
    Schneider, Adrian
    Gonnet, Gaston H.
    Dessimoz, Christophe
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D289 - D294
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] [Anonymous], 1999, Readings in information visualization: using vision to think
  • [4] A kingdom-level phylogeny of eukaryotes based on combined protein data
    Baldauf, SL
    Roger, AJ
    Wenk-Siefert, I
    Doolittle, WF
    [J]. SCIENCE, 2000, 290 (5493) : 972 - 977
  • [5] Alternative methods for concatenation of core genes indicate a lack of resolution in deep nodes of the prokaryotic phylogeny
    Bapteste, E.
    Susko, E.
    Leigh, J.
    Ruiz-Trillo, I.
    Bucknam, J.
    Doolittle, W. F.
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (01) : 83 - 91
  • [6] Phylogenetic identification of lateral genetic transfer events
    Beiko, RG
    Hamilton, N
    [J]. BMC EVOLUTIONARY BIOLOGY, 2006, 6 (1) : 17P
  • [7] Highways of gene sharing in prokaryotes
    Beiko, RG
    Harlow, TJ
    Ragan, MA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (40) : 14332 - 14337
  • [8] Gene sharing and genome evolution: networks in trees and trees in networks
    Beiko, Robert G.
    [J]. BIOLOGY & PHILOSOPHY, 2010, 25 (04) : 659 - 673
  • [9] The Impact of Reticulate Evolution on Genome Phylogeny
    Beiko, Robert G.
    Doolittle, W. Ford
    Charlebois, Robert L.
    [J]. SYSTEMATIC BIOLOGY, 2008, 57 (06) : 844 - 856
  • [10] Accounting for horizontal gene transfers explains conflicting hypotheses regarding the position of aquificales in the phylogeny of Bacteria
    Boussau, Bastien
    Gueguen, Laurent
    Gouy, Manolo
    [J]. BMC EVOLUTIONARY BIOLOGY, 2008, 8 (1)