Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features

被引:15
作者
Thung, Tze Y. [1 ,2 ,3 ]
White, Murray E. [1 ,2 ,3 ]
Dai, Wei [1 ,2 ,4 ]
Wilksch, Jonathan J. [1 ,2 ,3 ]
Bamert, Rebecca S. [1 ,2 ,3 ]
Rocker, Andrea [1 ,2 ]
Stubenrauch, Christopher J. [1 ,2 ,3 ]
Williams, Daniel [1 ,2 ,3 ]
Huang, Cheng [5 ,6 ,7 ]
Schittelhelm, Ralf [5 ,6 ,7 ]
Barr, Jeremy J. [3 ,8 ]
Jameson, Eleanor [9 ]
McGowan, Sheena [1 ,2 ,3 ]
Zhang, Yanju [4 ]
Wang, Jiawei [1 ,2 ,3 ]
Dunstan, Rhys A. [1 ,2 ,3 ]
Lithgow, Trevor [1 ,2 ,3 ]
机构
[1] Monash Univ, Biomed Discovery Inst, Infect & Immun Program, Clayton, Vic, Australia
[2] Monash Univ, Dept Microbiol, Clayton, Vic, Australia
[3] Monash Univ, Ctr Impact AMR, Clayton, Vic, Australia
[4] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin, Peoples R China
[5] Monash Univ, Monash Prote & Metabol Facil, Clayton, Vic, Australia
[6] Monash Univ, Biomed Discovery Inst, Clayton, Vic, Australia
[7] Monash Univ, Dept Biochem & Mol Biol, Clayton, Vic, Australia
[8] Monash Univ, Sch Biol Sci, Clayton, Vic, Australia
[9] Univ Warwick, Sch Life Sci, Coventry, W Midlands, England
基金
澳大利亚研究理事会;
关键词
antimicrobial resistance; phage therapy; bacteriophage; artificial intelligence; Klebsiella; bacteriophage therapy; bacteriophages; machine learning; virion structure; TAIL MACHINES; PREDICTION; SEQUENCES; SELECTION; SERVER;
D O I
10.1128/mSystems.00242-21
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Antimicrobial resistance (AMR) continues to evolve as a major threat to human health, and new strategies are required for the treatment of AMR infections. Bacteriophages (phages) that kill bacterial pathogens are being identified for use in phage therapies, with the intention to apply these bactericidal viruses directly into the infection sites in bespoke phage cocktails. Despite the great unsampled phage diversity for this purpose, an issue hampering the roll out of phage therapy is the poor quality annotation of many of the phage genomes, particularly for those from infrequently sampled environmental sources. We developed a computational tool called STEP3 to use the "evolutionary features" that can be recognized in genome sequences of diverse phages. These features, when integrated into an ensemble framework, achieved a stable and robust prediction performance when benchmarked against other prediction tools using phages from diverse sources. Validation of the prediction accuracy of STEP3 was conducted with high-resolution mass spectrometry analysis of two novel phages, isolated from a watercourse in the Southern Hemisphere. STEP3 provides a robust computational approach to distinguish specific and universal features in phages to improve the quality of phage cocktails and is available for use at http://step3.erc.monash.edu/. IMPORTANCE In response to the global problem of antimicrobial resistance, there are moves to use bacteriophages (phages) as therapeutic agents. Selecting which phages will be effective therapeutics relies on interpreting features contributing to shelf-life and applicability to diagnosed infections. However, the protein components of the phage virions that dictate these properties vary so much in sequence that best estimates suggest failure to recognize up to 90% of them. We have utilized this diversity in evolutionary features as an advantage, to apply machine learning for prediction accuracy for diverse components in phage virions. We benchmark this new tool showing the accurate recognition and evaluation of phage component parts using genome sequence data of phages from undersampled environments, where the richest diversity of phage still lies.
引用
收藏
页数:17
相关论文
共 72 条
[1]   Pred-BVP-Unb: Fast prediction of bacteriophage Virion proteins using un-biased multi-perspective properties with recursive feature elimination [J].
Arif, Muhammad ;
Ali, Farman ;
Ahmad, Saeed ;
Kabir, Muhammad ;
Ali, Zakir ;
Hayat, Maqsood .
GENOMICS, 2020, 112 (02) :1565-1574
[2]   Molecular assembly and structure of the bacteriophage T4 tail [J].
Arisaka F. ;
Yap M.L. ;
Kanamaru S. ;
Rossmann M.G. .
Biophysical Reviews, 2016, 8 (4) :385-396
[3]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[4]   BlasterJS']JS: A novel interactive Java']JavaScript visualisation component for BLAST alignment results [J].
Blanco-Miguez, Aitor ;
Fdez-Riverola, Florentino ;
Sanchez, Borja ;
Lourenco, Analia .
PLOS ONE, 2018, 13 (10)
[5]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[6]   Isolation and Characterization of Pectobacterium Phage vB_PatM_CB7: New Insights into the Genus Certrevirus [J].
Buttimer, Colin ;
Lynch, Caoimhe ;
Hendrix, Hanne ;
Neve, Horst ;
Noben, Jean-Paul ;
Lavigne, Rob ;
Coffey, Aidan .
ANTIBIOTICS-BASEL, 2020, 9 (06) :1-24
[7]   A Smooth-Type, Phage-Resistant Klebsiella pneumoniae Mutant Strain Reveals that OmpC Is Indispensable for Infection by Phage GH-K3 [J].
Cai, Ruopeng ;
Wu, Mei ;
Zhang, Hao ;
Zhang, Yufeng ;
Cheng, Mengjun ;
Guo, Zhimin ;
Ji, Yalu ;
Xi, Hengyu ;
Wang, Xinwu ;
Xue, Yibing ;
Sun, Changjiang ;
Feng, Xin ;
Lei, Liancheng ;
Tong, Yigang ;
Liu, Xiaoyun ;
Han, Wenyu ;
Gu, Jingmin .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2018, 84 (21)
[8]  
Centers for Disease Control and Prevention, 2019, ANTIBIOTIC RESISTANC, DOI DOI 10.15620/CDC:82532
[9]   PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method [J].
Charoenkwan, Phasit ;
Kanthawong, Sakawrat ;
Schaduangrat, Nalini ;
Yana, Janchai ;
Shoombuatong, Watshara .
CELLS, 2020, 9 (02)
[10]   Prediction of protein subcellular locations by incorporating quasi-sequence-order effect [J].
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 278 (02) :477-483