Evaluation of Machine Learning Models for Predicting Antimicrobial Resistance of Actinobacillus pleuropneumoniae From Whole Genome Sequences

被引:49
作者
Liu, Zhichang [1 ,2 ,3 ,4 ]
Deng, Dun [1 ,2 ,3 ,4 ]
Lu, Huijie [1 ,2 ,3 ,4 ]
Sun, Jian [5 ]
Lv, Luchao [5 ]
Li, Shuhong [1 ,2 ,3 ,4 ]
Peng, Guanghui [1 ,2 ,3 ,4 ]
Ma, Xianyong [1 ,2 ,3 ,4 ]
Li, Jiazhou [1 ,2 ,3 ,4 ]
Li, Zhenming [1 ,2 ,3 ,4 ]
Rong, Ting [1 ,2 ,3 ,4 ]
Wang, Gang [1 ,2 ,3 ,4 ]
机构
[1] Guangdong Acad Agr Sci, Inst Anim Sci, Guangzhou, Peoples R China
[2] State Key Lab Livestock & Poultry Breeding, Guangzhou, Peoples R China
[3] Minist Agr South China, Key Lab Anim Nutr & Feed Sci, Guangzhou, Peoples R China
[4] Guangdong Engn Technol Res Ctr Anim Meat Qual & S, Guangzhou, Peoples R China
[5] South China Agr Univ, Coll Vet Med, Natl Vet Microbiol Drug Resistance Risk Assessmen, Guangzhou, Peoples R China
关键词
machine learning; Support Vector Machine; Set Covering Machine; antimicrobial resistance; Actinobacillus pleuropneumoniae; genomics; IDENTIFICATION; ALGORITHMS;
D O I
10.3389/fmicb.2020.00048
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Antimicrobial resistance (AMR) is becoming a huge problem in countries all over the world, and new approaches to identifying strains resistant or susceptible to certain antibiotics are essential in fighting against antibiotic-resistant pathogens. Genotype-based machine learning methods showed great promise as a diagnostic tool, due to the increasing availability of genomic datasets and AST phenotypes. In this article, Support Vector Machine (SVM) and Set Covering Machine (SCM) models were used to learn and predict the resistance of the five drugs (Tetracycline, Ampicillin, Sulfisoxazole, Trimethoprim, and Enrofloxacin). The SVM model used the number of co-occurring k-mers between the genome of the isolates and the reference genes to learn and predict the phenotypes of the bacteria to a specific antimicrobial, while the SCM model uses a greedy approach to construct conjunction or disjunction of Boolean functions to find the most concise set of k-mers that allows for accurate prediction of the phenotype. Five-fold cross-validation was performed on the training set of the SVM and SCM model to select the best hyperparameter values to avoid model overfitting. The training accuracy (mean cross-validation score) and the testing accuracy of SVM and SCM models of five drugs were above 90% regardless of the resistant mechanism of which were acquired resistant or point mutation in the chromosome. The results of correlation between the phenotype and the model predictions of the five drugs indicated that both SVM and SCM models could significantly classify the resistant isolates from the sensitive isolates of the bacteria (p < 0.01), and would be used as potential tools in antimicrobial resistance surveillance and clinical diagnosis in veterinary medicine.
引用
收藏
页数:7
相关论文
共 18 条
[1]   Whole Genome Sequencing for Surveillance of Antimicrobial Resistance in Actinobacillus pleuropneumoniae [J].
Bosse, Janine T. ;
Li, Yanwen ;
Rogers, Jon ;
Crespo, Roberto Fernandez ;
Li, Yinghui ;
Chaudhuri, Roy R. ;
Holden, Matthew T. G. ;
Maskell, Duncan J. ;
Tucker, Alexander W. ;
Wren, Brendan W. ;
Rycroft, Andrew N. ;
Langford, Paul R. .
FRONTIERS IN MICROBIOLOGY, 2017, 8
[2]   Algorithms for the set covering problem [J].
Caprara, A ;
Toth, P ;
Fischetti, M .
ANNALS OF OPERATIONS RESEARCH, 2000, 98 (1-4) :353-371
[3]   Benchmarking of methods for identification of antimicrobial resistance genes in bacterial whole genome data [J].
Clausen, Philip T. L. C. ;
Zankari, Ea ;
Aarestrup, Frank M. ;
Lund, Ole .
JOURNAL OF ANTIMICROBIAL CHEMOTHERAPY, 2016, 71 (09) :2484-2488
[4]   Prediction of protein-protein interactions between viruses and human by an SVM model [J].
Cui, Guangyu ;
Fang, Chao ;
Han, Kyungsook .
BMC BIOINFORMATICS, 2012, 13
[5]   Phenetic Comparison of Prokaryotic Genomes Using k-mers [J].
Deraspe, Maxime ;
Raymond, Frederic ;
Boisvert, Sebastien ;
Culley, Alexander ;
Roy, Paul H. ;
Laviolette, Francois ;
Corbeil, Jacques .
MOLECULAR BIOLOGY AND EVOLUTION, 2017, 34 (10) :2716-2729
[6]   Predictive computational phenotyping and biomarker discovery using reference-free genome comparisons [J].
Drouin, Alexandre ;
Giguere, Sebastien ;
Deraspe, Maxime ;
Marchand, Mario ;
Tyers, Michael ;
Loo, Vivian G. ;
Bourgault, Anne-Marie ;
Laviolette, Francois ;
Corbeil, Jacques .
BMC GENOMICS, 2016, 17
[7]   Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology [J].
Gibson, Molly K. ;
Forsberg, Kevin J. ;
Dantas, Gautam .
ISME JOURNAL, 2015, 9 (01) :207-216
[8]   Molecular serotyping and antimicrobial resistance profiles of Actinobacillus pleuropneumoniae isolated from pigs in South Korea [J].
Kim, Boram ;
Hur, Jin ;
Lee, Ji Yeong ;
Choi, Yoonyoung ;
Lee, John Hwa .
VETERINARY QUARTERLY, 2016, 36 (03) :137-144
[9]   Application of machine learning techniques to tuberculosis drug resistance analysis [J].
Kouchaki, Samaneh ;
Yang, Yang ;
Walker, Timothy M. ;
Walker, A. Sarah ;
Wilson, Daniel J. ;
Peto, Timothy E. A. ;
Crook, Derrick W. ;
Clifton, David A. ;
Hoosdally, Sarah J. ;
Cruz, Ana L. Gibertoni ;
Carter, Joshua ;
Grazian, Clara ;
Fowler, Philip W. ;
Iqbal, Zamin ;
Hunt, Martin ;
Smith, E. Grace ;
Rathod, Priti ;
Jarrett, Lisa ;
Matias, Daniela ;
Cirillo, Daniela M. ;
Borroni, Emanuele ;
Battaglia, Simone ;
Ghodousi, Arash ;
Spitaleri, Andrea ;
Cabibbe, Andrea ;
Tahseen, Sabira ;
Nilgiriwala, Kayzad ;
Shah, Sanchi ;
Rodrigues, Camilla ;
Kambli, Priti ;
Surve, Utkarsha ;
Khot, Rukhsar ;
Niemann, Stefan ;
Kohl, Thomas ;
Merker, Matthias ;
Hoffmann, Harald ;
Molodtsov, Nikolay ;
Plesnik, Sara ;
Ismail, Nazir ;
Omar, Shaheed Vally ;
Joseph, Lavania ;
Marubini, Elliott ;
Thwaites, Guy ;
Thuong Nguyen Thuy Thuong ;
Nhung Hoang Ngoc ;
Srinivasan, Vijay ;
Moore, David ;
Coronel, Jorge ;
Solano, Walter ;
Gao, George F. .
BIOINFORMATICS, 2019, 35 (13) :2276-2282
[10]   Using Support Vector Machine (SVM) for Classification of Selectivity of H1N1 Neuraminidase Inhibitors [J].
Li, Yang ;
Kong, Yue ;
Zhang, Mengdi ;
Yan, Aixia ;
Liu, Zhenming .
MOLECULAR INFORMATICS, 2016, 35 (3-4) :116-124