mlplasmids: a user-friendly tool to predict plasmid- and chromosome-derived sequences for single species

被引:135
作者
Arredondo-Alonso, Sergio [1 ]
Rogers, Malbert R. C. [1 ]
Braat, Johanna C. [1 ]
Verschuuren, Tess D. [2 ]
Top, Janetta [1 ]
Corander, Jukka [3 ,4 ,5 ]
Willems, Rob J. L. [1 ]
Schurch, Anita C. [1 ]
机构
[1] Univ Med Ctr Utrecht, Dept Med Microbiol, Utrecht, Netherlands
[2] Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, Utrecht, Netherlands
[3] Univ Oslo, Dept Biostat, Fac Med, Oslo, Norway
[4] Univ Helsinki, Dept Math & Stat, Helsinki, Finland
[5] Wellcome Trust Sanger Inst, Infect Genom, Hinxton, England
基金
欧洲研究理事会;
关键词
whole-genome sequencing; plasmid; chromosome; machine learning; antibiotic resistance; ESCHERICHIA-COLI; GENOME; RESISTANCE; MCR-1;
D O I
10.1099/mgen.0.000224
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Assembly of bacterial short-read whole-genome sequencing data frequently results in hundreds of contigs for which the origin, plasmid or chromosome, is unclear. Complete genomes resolved by long-read sequencing can be used to generate and label short-read contigs. These were used to train several popular machine learning methods to classify the origin of contigs from Enterococcus faecium, Klebsiella pneumoniae and Escherichia colt using pentamer frequencies. We selected support-vector machine (SVM) models as the best classifier for all three bacterial species (Fl-score E. faecium=0.92, F1-score K. pneumoniae=0.90, F1-score E. coli=0.76), which outperformed other existing plasmid prediction tools using a benchmarking set of isolates. We demonstrated the scalability of our models by accurately predicting the plasmidome of a large collection of 1644 E. faecium isolates and illustrate its applicability by predicting the location of antibiotic-resistance genes in all three species. The SVM classifiers are publicly available as an R package and graphical-user interface called 'mlplasmids'. We anticipate that this tool may significantly facilitate research on the dissemination of plasmids encoding antibiotic resistance and/or contributing to host adaptation.
引用
收藏
页数:15
相关论文
共 38 条
[21]  
Loman NJ, 2015, NAT METHODS, V12, P733, DOI [10.1038/NMETH.3444, 10.1038/nmeth.3444]
[22]   Performance comparison of benchtop high-throughput sequencing platforms [J].
Loman, Nicholas J. ;
Misra, Raju V. ;
Dallman, Timothy J. ;
Constantinidou, Chrystala ;
Gharbia, Saheer E. ;
Wain, John ;
Pallen, Mark J. .
NATURE BIOTECHNOLOGY, 2012, 30 (05) :434-+
[23]   Klebsiella pneumoniae: a major worldwide source and shuttle for antibiotic resistance [J].
Navon-Venezia, Shiri ;
Kondratyeva, Kira ;
Carattoli, Alessandra .
FEMS MICROBIOLOGY REVIEWS, 2017, 41 (03) :252-275
[24]   Mash: fast genome and metagenome distance estimation using MinHash [J].
Ondov, Brian D. ;
Treangen, Todd J. ;
Melsted, Pall ;
Mallonee, Adam B. ;
Bergman, Nicholas H. ;
Koren, Sergey ;
Phillippy, Adam M. .
GENOME BIOLOGY, 2016, 17
[25]   Plasmid Classification in an Era of Whole-Genome Sequencing: Application in Studies of Antibiotic Resistance Epidemiology [J].
Orlek, Alex ;
Stoesser, Nicole ;
Anjum, Muna F. ;
Doumith, Michel ;
Ellington, Matthew J. ;
Peto, Tim ;
Crook, Derrick ;
Woodford, Neil ;
Walker, A. Sarah ;
Phan, Hang ;
Sheppard, Anna E. .
FRONTIERS IN MICROBIOLOGY, 2017, 8
[26]  
Pages H, 2016, BIOSTRINGS STRING OB, V42, P1
[27]   A single chromosome assembly of Bacteroides fragilis strain BE1 from Illumina and MinION nanopore sequencing data [J].
Risse, Judith ;
Thomson, Marian ;
Patrick, Sheila ;
Blakely, Garry ;
Koutsovoulos, Georgios ;
Blaxter, Mark ;
Watson, Mick .
GIGASCIENCE, 2015, 4
[28]   Recycler: an algorithm for detecting plasmids from de novo assembly graphs [J].
Rozov, Roye ;
Kav, Aya Brown ;
Bogumil, David ;
Shterzer, Naama ;
Halperin, Eran ;
Mizrahi, Itzhak ;
Shamir, Ron .
BIOINFORMATICS, 2017, 33 (04) :475-482
[29]  
Rstudio, 2014, SHIN EAS WEB APPL R
[30]   Nested Russian Doll-Like Genetic Mobility Drives Rapid Dissemination of the Carbapenem Resistance Gene blaKPC [J].
Sheppard, Anna E. ;
Stoesser, Nicole ;
Wilson, Daniel J. ;
Sebra, Robert ;
Kasarskis, Andrew ;
Anson, Luke W. ;
Giess, Adam ;
Pankhurst, Louise J. ;
Vaughan, Alison ;
Grim, Christopher J. ;
Cox, Heather L. ;
Yeh, Anthony J. ;
Sifri, Costi D. ;
Walker, A. Sarah ;
Peto, Tim E. ;
Crook, Derrick W. ;
Mathers, Amy J. .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 2016, 60 (06) :3767-3778