xenoGI: reconstructing the history of genomic island insertions in clades of closely related bacteria

被引:9
作者
Bush, Eliot C. [1 ]
Clark, Anne E. [1 ,2 ]
DeRanek, Carissa A. [1 ]
Eng, Alexander [1 ,2 ]
Forman, Juliet [1 ]
Heath, Kevin [1 ,3 ]
Lee, Alexander B. [1 ,4 ]
Stoebel, Daniel M. [1 ]
Wang, Zunyan [1 ]
Wilber, Matthew [1 ]
Wu, Helen [1 ]
机构
[1] Harvey Mudd Coll, Dept Biol, 301 Platt Blvd, Claremont, CA 91711 USA
[2] Univ Washington, Dept Genome Sci, 3720 15th Ave NF, Seattle, WA 98195 USA
[3] Worcester Polytech Inst, Dept Biol & Biotechnol, 100 Inst Rd, Worcester, MA 01609 USA
[4] Georgia Inst Technol, Quantitat Biosci Program, 837 State St, Atlanta, GA 30332 USA
关键词
Genomic island; Horizontal transfer; Synteny; Gene family; GLUTAMATE-DECARBOXYLASE GENES; III SECRETION SYSTEM; PATHOGENICITY ISLANDS; IDENTIFICATION; SEQUENCE; PREDICTION; CLUSTERS; REGIONS; LOCUS; GADE;
D O I
10.1186/s12859-018-2038-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Genomic islands play an important role in microbial genome evolution, providing a mechanism for strains to adapt to new ecological conditions. A variety of computational methods, both genome-composition based and comparative, have been developed to identify them. Some of these methods are explicitly designed to work in single strains, while others make use of multiple strains. In general, existing methods do not identify islands in the context of the phylogeny in which they evolved. Even multiple strain approaches are best suited to identifying genomic islands that are present in one strain but absent in others. They do not automatically recognize islands which are shared between some strains in the clade or determine the branch on which these islands inserted within the phylogenetic tree. Results: We have developed a software package, xenoGI, that identifies genomic islands and maps their origin within a clade of closely related bacteria, determining which branch they inserted on. It takes as input a set of sequenced genomes and a tree specifying their phylogenetic relationships. Making heavy use of synteny information, the package builds gene families in a species-tree-aware way, and then attempts to combine into islands those families whose members are adjacent and whose most recent common ancestor is shared. The package provides a variety of text-based analysis functions, as well as the ability to export genomic islands into formats suitable for viewing in a genome browser. We demonstrate the capabilities of the package with several examples from enteric bacteria, including an examination of the evolution of the acid fitness island in the genus Escherichia. In addition we use output from simulations and a set of known genomic islands from the literature to show that xenoGI can accurately identify genomic islands and place them on a phylogenetic tree. Conclusions: xenoGI is an effective tool for studying the history of genomic island insertions in a clade of microbes. It identifies genomic islands, and determines which branch they inserted on within the phylogenetic tree for the clade. Such information is valuable because it helps us understand the adaptive path that has produced living species.
引用
收藏
页数:11
相关论文
共 60 条
[1]   Detection of genomic islands via segmental genome heterogeneity [J].
Arvey, Aaron J. ;
Azad, Rajeev K. ;
Raval, Alpan ;
Lawrence, Jeffrey G. .
NUCLEIC ACIDS RESEARCH, 2009, 37 (16) :5255-5266
[2]   Recent gene conversions between duplicated glutamate decarboxylase genes (gadA and gadB) in pathogenic Escherichia coli [J].
Bergholz, Teresa M. ;
Tarr, Cheryl L. ;
Christensen, Lisa. M. ;
Betting, David J. ;
Whittam, Thomas S. .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (10) :2323-2333
[3]   IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets [J].
Bertelli, Claire ;
Laird, Matthew R. ;
Williams, Kelly P. ;
Lau, Britney Y. ;
Hoad, Gemma ;
Winsor, Geoffrey L. ;
Brinkman, Fiona S. L. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (W1) :W30-W35
[4]   The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129 [J].
Cerdeño-Tárraga, AM ;
Efstratiou, A ;
Dover, LG ;
Holden, MTG ;
Pallen, M ;
Bentley, SD ;
Besra, GS ;
Churcher, C ;
James, KD ;
De Zoysa, A ;
Chillingworth, T ;
Cronin, A ;
Dowd, L ;
Feltwell, T ;
Hamlin, N ;
Holroyd, S ;
Jagels, K ;
Moule, S ;
Quail, MA ;
Rabbinowitsch, E ;
Rutherford, KM ;
Thomson, NR ;
Unwin, L ;
Whitehead, S ;
Barrell, BG ;
Parkhill, J .
NUCLEIC ACIDS RESEARCH, 2003, 31 (22) :6516-6523
[5]   On detection and assessment of statistical significance of Genomic Islands [J].
Chatterjee, Raghunath ;
Chaudhuri, Keya ;
Chaudhuri, Probal .
BMC GENOMICS, 2008, 9 (1)
[6]   GET_HOMOLOGUES, a Versatile Software Package for Scalable and Robust Microbial Pangenome Analysis [J].
Contreras-Moreira, Bruno ;
Vinuesa, Pablo .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2013, 79 (24) :7696-7701
[7]  
Dai Q, 2016, BRIEF BIOINFORM, V118
[8]   Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments [J].
Daily, Jeff .
BMC BIOINFORMATICS, 2016, 16
[9]   Mauve: Multiple alignment of conserved genomic sequence with rearrangements [J].
Darling, ACE ;
Mau, B ;
Blattner, FR ;
Perna, NT .
GENOME RESEARCH, 2004, 14 (07) :1394-1403
[10]   A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database [J].
Dehal, Paramvir S. ;
Boore, Jeffrey L. .
BMC BIOINFORMATICS, 2006, 7 (1)