Global optimal eBURST analysis of multilocus typing data using a graphic matroid approach

被引:472
作者
Francisco, Alexandre P. [1 ,2 ]
Bugalho, Miguel [1 ,2 ]
Ramirez, Mario [3 ]
Carrico, Joao A. [1 ,3 ]
机构
[1] ID Lisboa, Inst Engn Sistemas & Computadores, P-1000029 Lisbon, Portugal
[2] Univ Tecn Lisboa, Inst Super Tecn, P-1049001 Lisbon, Portugal
[3] Univ Lisbon, Inst Microbiol, Inst Mol Med, Fac Med, P-1649028 Lisbon, Portugal
关键词
STREPTOCOCCUS-PNEUMONIAE; ENTEROCOCCUS-FAECIUM; POPULATION-STRUCTURE; SEROGROUP-C; IDENTIFICATION; EMERGENCE; EXPANSION; OUTBREAK; COMPLEX; CLONES;
D O I
10.1186/1471-2105-10-152
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Multilocus Sequence Typing (MLST) is a frequently used typing method for the analysis of the clonal relationships among strains of several clinically relevant microbial species. MLST is based on the sequence of housekeeping genes that result in each strain having a distinct numerical allelic profile, which is abbreviated to a unique identifier: the sequence type (ST). The relatedness between two strains can then be inferred by the differences between allelic profiles. For a more comprehensive analysis of the possible patterns of evolutionary descent, a set of rules were proposed and implemented in the eBURST algorithm. These rules allow the division of a data set into several clusters of related strains, dubbed clonal complexes, by implementing a simple model of clonal expansion and diversification. Within each clonal complex, the rules identify which links between STs correspond to the most probable pattern of descent. However, the eBURST algorithm is not globally optimized, which can result in links, within the clonal complexes, that violate the rules proposed. Results: Here, we present a globally optimized implementation of the eBURST algorithm-goeBURST. The search for a global optimal solution led to the formalization of the problem as a graphic matroid, for which greedy algorithms that provide an optimal solution exist. Several public data sets of MLST data were tested and differences between the two implementations were found and are discussed for five bacterial species: Enterococcus faecium, Streptococcus pneumoniae, Burkholderia pseudomallei, Campylobacter jejuni and Neisseria spp.. A novel feature implemented in goeBURST is the representation of the level of tiebreak rule reached before deciding if a link should be drawn, which can used to visually evaluate the reliability of the represented hypothetical pattern of descent. Conclusion: goeBURST is a globally optimized implementation of the eBURST algorithm, that identifies alternative patterns of descent for several bacterial species. Furthermore, the algorithm can be applied to any multilocus typing data based on the number of differences between numeric profiles. A software implementation is available at http://goeBURST.phyloviz.net.
引用
收藏
页数:15
相关论文
共 38 条
[1]   Evolution, Population Structure, and Phylogeography of Genetically Monomorphic Bacterial Pathogens [J].
Achtman, Mark .
ANNUAL REVIEW OF MICROBIOLOGY, 2008, 62 :53-70
[2]   Microbial diversity and the genetic nature of microbial species [J].
Achtman, Mark ;
Wagner, Michael .
NATURE REVIEWS MICROBIOLOGY, 2008, 6 (06) :431-440
[3]   Seasonality and outbreak of a predominant Streptococcus pneumoniae serotype 1 clone from The Gambia: Expansion of ST217 hypervirulent clonal complex in West Africa [J].
Antonio, Martin ;
Hakeem, Ishrat ;
Awine, Timothy ;
Secka, Ousman ;
Sankareh, Kawsu ;
Nsekpong, David ;
Lahai, George ;
Akisanya, Abiodun ;
Egere, Uzochukwu ;
Enwere, Godwin ;
Zaman, Syed M. A. ;
Hill, Philip C. ;
Corrah, Tumani ;
Cutts, Felicity ;
Greenwood, Brian M. ;
Adegbola, Richard A. .
BMC MICROBIOLOGY, 2008, 8 (1)
[4]  
*BERK I DES, PREF VIS TOOLK
[5]  
BORUVKA O, 1926, PRACE MORASKE PRIDOV, V3
[6]  
Cormen TH., 2001, Introduction to Algorithms
[7]   Polyclonal population structure of Streptococcus pneumoniae isolates in Spain carrying mef and mef plus erm(B) [J].
de la Pedrosa, Elia Gomez G. ;
Morosini, Maria-Isabel ;
van der Linden, Mark ;
Ruiz-Garbajosa, Patricia ;
Galan, Juan Carlos ;
Baquero, Fernando ;
Reinert, Ralf Rene ;
Canton, Rafael .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 2008, 52 (06) :1964-1969
[8]   High acquisition and environmental contamination rates of CC17 ampicillin-resistant Enterococcus faecium in a Dutch hospital [J].
de Regt, Marieke J. A. ;
van der Wagen, Lotte E. ;
Top, Janetta ;
Blok, Hetty E. M. ;
Hopmans, Titia E. M. ;
Dekker, Adriaan W. ;
Hene, Ronald J. ;
Siersema, Peter D. ;
Willems, Rob J. L. ;
Bonten, Marc J. M. .
JOURNAL OF ANTIMICROBIAL CHEMOTHERAPY, 2008, 62 (06) :1401-1406
[9]   Inference of bacterial microevolution using multilocus sequence data [J].
Didelot, Xavier ;
Falush, Daniel .
GENETICS, 2007, 175 (03) :1251-1266
[10]   Evolution and Diversity of Clonal Bacteria: The Paradigm of Mycobacterium tuberculosis [J].
Dos Vultos, Tiago ;
Mestre, Olga ;
Rauzier, Jean ;
Golec, Marcin ;
Rastogi, Nalin ;
Rasolofo, Voahangy ;
Tonjum, Tone ;
Sola, Christophe ;
Matic, Ivan ;
Gicquel, Brigitte .
PLOS ONE, 2008, 3 (02)