mlplasmids: a user-friendly tool to predict plasmid- and chromosome-derived sequences for single species

被引:129
作者
Arredondo-Alonso, Sergio [1 ]
Rogers, Malbert R. C. [1 ]
Braat, Johanna C. [1 ]
Verschuuren, Tess D. [2 ]
Top, Janetta [1 ]
Corander, Jukka [3 ,4 ,5 ]
Willems, Rob J. L. [1 ]
Schurch, Anita C. [1 ]
机构
[1] Univ Med Ctr Utrecht, Dept Med Microbiol, Utrecht, Netherlands
[2] Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, Utrecht, Netherlands
[3] Univ Oslo, Dept Biostat, Fac Med, Oslo, Norway
[4] Univ Helsinki, Dept Math & Stat, Helsinki, Finland
[5] Wellcome Trust Sanger Inst, Infect Genom, Hinxton, England
来源
MICROBIAL GENOMICS | 2018年 / 4卷 / 11期
基金
欧洲研究理事会;
关键词
whole-genome sequencing; plasmid; chromosome; machine learning; antibiotic resistance; ESCHERICHIA-COLI; GENOME; RESISTANCE; MCR-1;
D O I
10.1099/mgen.0.000224
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Assembly of bacterial short-read whole-genome sequencing data frequently results in hundreds of contigs for which the origin, plasmid or chromosome, is unclear. Complete genomes resolved by long-read sequencing can be used to generate and label short-read contigs. These were used to train several popular machine learning methods to classify the origin of contigs from Enterococcus faecium, Klebsiella pneumoniae and Escherichia colt using pentamer frequencies. We selected support-vector machine (SVM) models as the best classifier for all three bacterial species (Fl-score E. faecium=0.92, F1-score K. pneumoniae=0.90, F1-score E. coli=0.76), which outperformed other existing plasmid prediction tools using a benchmarking set of isolates. We demonstrated the scalability of our models by accurately predicting the plasmidome of a large collection of 1644 E. faecium isolates and illustrate its applicability by predicting the location of antibiotic-resistance genes in all three species. The SVM classifiers are publicly available as an R package and graphical-user interface called 'mlplasmids'. We anticipate that this tool may significantly facilitate research on the dissemination of plasmids encoding antibiotic resistance and/or contributing to host adaptation.
引用
收藏
页数:15
相关论文
共 38 条
[1]   plasmidSPAdes: assembling plasmids from whole genome sequencing data [J].
Antipov, Dmitry ;
Hartwick, Nolan ;
Shen, Max ;
Raiko, Mikhail ;
Lapidus, Alla ;
Pevzner, Pavel A. .
BIOINFORMATICS, 2016, 32 (22) :3380-3387
[2]  
Arredondo-Alonso S, 2018, GITLAB
[3]   On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data [J].
Arredondo-Alonso, Sergio ;
Willems, Rob J. ;
van Schaik, Willem ;
Schurch, Anita C. .
MICROBIAL GENOMICS, 2017, 3 (10)
[4]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[5]  
Bischl B, 2016, J MACH LEARN RES, V17
[6]   In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing [J].
Carattoli, Alessandra ;
Zankari, Ea ;
Garcia-Fernandez, Aurora ;
Larsen, Mette Voldby ;
Lund, Ole ;
Villa, Laura ;
Aarestrup, Frank Moller ;
Hasman, Henrik .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 2014, 58 (07) :3895-3903
[7]   Plasmids and the spread of resistance [J].
Carattoli, Alessandra .
INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2013, 303 (6-7) :298-304
[8]  
Clewell D.B., 2014, Extrachromosomal and Mobile Elements in Enterococci: Transmission, Maintenance, and Epidemiology, Massachusetts Eye and Ear Infirmary
[9]  
Clewell DB., 2014, Enterococci: from Commensals to Leading Causes of Drug Resistant Infection, P309
[10]   Single-molecule sequencing to track plasmid diversity of hospital-associated carbapenemase-producing Enterobacteriaceae [J].
Conlan, Sean ;
Thomas, Pamela J. ;
Deming, Clayton ;
Park, Morgan ;
Lau, Anna F. ;
Dekker, John P. ;
Snitkin, Evan S. ;
Clark, Tyson A. ;
Khai Luong ;
Song, Yi ;
Tsai, Yu-Chih ;
Boitano, Matthew ;
Dayal, Jyoti ;
Brooks, Shelise Y. ;
Schmidt, Brian ;
Young, Alice C. ;
Thomas, James W. ;
Bouffard, Gerard G. ;
Blakesley, Robert W. ;
Mullikin, James C. ;
Korlach, Jonas ;
Henderson, David K. ;
Frank, Karen M. ;
Palmore, Tara N. ;
Segre, Julia A. .
SCIENCE TRANSLATIONAL MEDICINE, 2014, 6 (254)