PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest

被引:13
作者
Aytan-Aktug, Derya [1 ]
Clausen, Philip T. L. C. [1 ]
Szarvas, Judit [1 ]
Munk, Patrick [1 ]
Otani, Saria [1 ]
Nguyen, Marcus [2 ,3 ]
Davis, James J. [2 ,3 ,4 ]
Lund, Ole [1 ]
Aarestrup, Frank M. [1 ]
机构
[1] Tech Univ Denmark, Natl Food Inst, Lyngby, Denmark
[2] Univ Chicago, Consortium Adv Sci & Engn, Chicago, IL 60637 USA
[3] Argonne Natl Lab, Data Sci & Learning Div, 9700 S Cass Ave, Argonne, IL 60439 USA
[4] Northwestern Argonne Inst Sci & Engn, Evanston, IL USA
基金
美国国家卫生研究院;
关键词
antimicrobial resistance; horizontal gene transfer; machine learning; plasmid; plasmid host; plasmid host range; random forest; EXPANSION;
D O I
10.1128/msystems.01180-21
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Plasmids play a major role facilitating the spread of antimicrobial resistance between bacteria. Understanding the host range and dissemination trajectories of plasmids is critical for surveillance and prevention of antimicrobial resistance. Identification of plasmid host ranges could be improved using automated pattern detection methods compared to homology-based methods due to the diversity and genetic plasticity of plasmids. In this study, we developed a method for predicting the host range of plasmids using machine learning-specifically, random forests. We trained the models with 8,519 plasmids from 359 different bacterial species per taxonomic level; the models achieved Matthews correlation coefficients of 0.662 and 0.867 at the species and order levels, respectively. Our results suggest that despite the diverse nature and genetic plasticity of plasmids, our random forest model can accurately distinguish between plasmid hosts. This tool is available online through the Center for Genomic Epidemiology (https://cge.cbs.dtu.dk/services/PlasmidHostFinder/). IMPORTANCE Antimicrobial resistance is a global health threat to humans and animals, causing high mortality and morbidity while effectively ending decades of success in fighting against bacterial infections. Plasmids confer extra genetic capabilities to the host organisms through accessory genes that can encode antimicrobial resistance and virulence. In addition to lateral inheritance, plasmids can be transferred horizontally between bacterial taxa. Therefore, detection of the host range of plasmids is crucial for understanding and predicting the dissemination trajectories of extrachromosomal genes and bacterial evolution as well as taking effective countermeasures against antimicrobial resistance.
引用
收藏
页数:16
相关论文
共 41 条
[21]   Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation [J].
O'Leary, Nuala A. ;
Wright, Mathew W. ;
Brister, J. Rodney ;
Ciufo, Stacy ;
McVeigh, Diana Haddad Rich ;
Rajput, Bhanu ;
Robbertse, Barbara ;
Smith-White, Brian ;
Ako-Adjei, Danso ;
Astashyn, Alexander ;
Badretdin, Azat ;
Bao, Yiming ;
Blinkova, Olga ;
Brover, Vyacheslav ;
Chetvernin, Vyacheslav ;
Choi, Jinna ;
Cox, Eric ;
Ermolaeva, Olga ;
Farrell, Catherine M. ;
Goldfarb, Tamara ;
Gupta, Tripti ;
Haft, Daniel ;
Hatcher, Eneida ;
Hlavina, Wratko ;
Joardar, Vinita S. ;
Kodali, Vamsi K. ;
Li, Wenjun ;
Maglott, Donna ;
Masterson, Patrick ;
McGarvey, Kelly M. ;
Murphy, Michael R. ;
O'Neill, Kathleen ;
Pujar, Shashikant ;
Rangwala, Sanjida H. ;
Rausch, Daniel ;
Riddick, Lillian D. ;
Schoch, Conrad ;
Shkeda, Andrei ;
Storz, Susan S. ;
Sun, Hanzhen ;
Thibaud-Nissen, Francoise ;
Tolstoy, Igor ;
Tully, Raymond E. ;
Vatsan, Anjana R. ;
Wallin, Craig ;
Webb, David ;
Wu, Wendy ;
Landrum, Melissa J. ;
Kimchi, Avi ;
Tatusova, Tatiana .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D733-D745
[22]   Ordering the mob: Insights into replicon and MOB typing schemes from analysis of a curated dataset of publicly available plasmids [J].
Orlek, Alex ;
Phan, Hang ;
Sheppard, Anna E. ;
Doumith, Michel ;
Ellington, Matthew ;
Peto, Tim ;
Crook, Derrick ;
Walker, A. Sarah ;
Woodford, Neil ;
Anjum, Muna F. ;
Stoesser, Nicole .
PLASMID, 2017, 91 :42-52
[23]   Plasmid Classification in an Era of Whole-Genome Sequencing: Application in Studies of Antibiotic Resistance Epidemiology [J].
Orlek, Alex ;
Stoesser, Nicole ;
Anjum, Muna F. ;
Doumith, Michel ;
Ellington, Matthew J. ;
Peto, Tim ;
Crook, Derrick ;
Woodford, Neil ;
Walker, A. Sarah ;
Phan, Hang ;
Sheppard, Anna E. .
FRONTIERS IN MICROBIOLOGY, 2017, 8
[24]   Understanding and predicting ciprofloxacin minimum inhibitory concentration in Escherichia coli with machine learning [J].
Pataki, Balint Armin ;
Matamoros, Sebastien ;
van der Putten, Boas C. L. ;
Remondini, Daniel ;
Giampieri, Enrico ;
Aytan-Aktug, Derya ;
Hendriksen, Rene S. ;
Lund, Ole ;
Csabai, Istvan ;
Schultsz, Constance .
SCIENTIFIC REPORTS, 2020, 10 (01)
[25]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[26]   COPLA, a taxonomic classifier of plasmids [J].
Redondo-Salvo, Santiago ;
Bartomeus-Penalver, Roger ;
Vielva, Luis ;
Tagg, Kaitlin A. ;
Webb, Hattie E. ;
Fernandez-Lopez, Raul ;
de la Cruz, Fernando .
BMC BIOINFORMATICS, 2021, 22 (01)
[27]   Pathways for horizontal gene transfer in bacteria revealed by a global map of their plasmids [J].
Redondo-Salvo, Santiago ;
Fernandez-Lopez, Raul ;
Ruiz, Raul ;
Vielva, Luis ;
de Toro, Maria ;
Rocha, Eduardo P. C. ;
Pilar Garcillan-Barcia, M. ;
de la Cruz, Fernando .
NATURE COMMUNICATIONS, 2020, 11 (01)
[28]   Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance [J].
Robertson, James ;
Bessonov, Kyrylo ;
Schonfeld, Justin ;
Nash, John H. E. .
MICROBIAL GENOMICS, 2020, 6 (10) :1-12
[29]   Beyond horizontal gene transfer: the role of plasmids in bacterial evolution [J].
Rodriguez-Beltran, Jeronimo ;
DelaFuente, Javier ;
Leon-Sampedro, Ricardo ;
MacLean, R. Craig ;
San Millan, Alvaro .
NATURE REVIEWS MICROBIOLOGY, 2021, 19 (06) :347-359
[30]   Prediction of the intestinal resistome by a three-dimensional structure-based method [J].
Ruppe, Etienne ;
Ghozlane, Amine ;
Tap, Julien ;
Pons, Nicolas ;
Alvarez, Anne-Sophie ;
Maziers, Nicolas ;
Cuestas, Trinidad ;
Hernando-Amado, Sara ;
Clares, Irene ;
Luis Martinez, Jose ;
Coque, Teresa M. ;
Baquero, Fernando ;
Lanza, Val F. ;
Maiz, Luis ;
Goulenok, Tiphaine ;
de Lastours, Victoire ;
Amor, Nawal ;
Fantin, Bruno ;
Wieder, Ingrid ;
Andremont, Antoine ;
Schaik, Van ;
Rogers, Malbert ;
Zhang, Xinglin ;
Willems, Rob J. L. ;
de Brevern, Alexandre G. ;
Battol, Jean-Michel ;
Blottiere, Herve M. ;
Leonard, Pierre ;
Lejardi, Veronique ;
Letur, Aline ;
Levenez, Florence ;
Weiszer, Kevin ;
Haimet, Florence ;
Dore, Joel ;
Kennedy, Sean P. ;
Ehrlich, S. Dusko .
NATURE MICROBIOLOGY, 2019, 4 (01) :112-+