Validation of a Bioinformatics Workflow for Routine Analysis of Whole-Genome Sequencing Data and Related Challenges for Pathogen Typing in a European National Reference Center: Neisseria meningitidis as a Proof-of-Concept

被引:40
作者
Bogaerts, Bert [1 ]
Winand, Raf [1 ]
Fu, Qiang [1 ]
Van Braekel, Julien [1 ]
Ceyssens, Pieter-Jan [2 ]
Mattheus, Wesley [2 ]
Bertrand, Sophie [2 ]
De Keersmaecker, Sigrid C. J. [1 ]
Roosens, Nancy H. C. [1 ]
Vanneste, Kevin [1 ]
Aminov, Rustam [3 ]
De Toro, Marla [4 ]
Maruyama, Fumito [5 ]
Vanneste, Kevin [1 ]
机构
[1] Sciensano, Transversal Act Appl Genom, Brussels, Belgium
[2] Sciensano, Bacterial Dis, Brussels, Belgium
[3] Univ Aberdeen, Aberdeen, Scotland
[4] Ctr Invest Biomed La Rioja, Logrono, Spain
[5] Hiroshima Univ, Higashihiroshima, Japan
关键词
Neisseria meningitidis; whole-genome sequencing; validation; public health; national reference center; PUBLIC-HEALTH; CLINICAL MICROBIOLOGY; MENINGOCOCCAL DISEASE; QUALITY ASSESSMENT; READ ALIGNMENT; EPIDEMIOLOGY; SURVEILLANCE; IDENTIFICATION; BIOLOGY; TOOL;
D O I
10.3389/fmicb.2019.00362
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Despite being a well-established research method, the use of whole-genome sequencing (WGS) for routine molecular typing and pathogen characterization remains a substantial challenge due to the required bioinformatics resources and/or expertise. Moreover, many national reference laboratories and centers, as well as other laboratories working under a quality system, require extensive validation to demonstrate that employed methods are "fit-for-purpose" and provide high-quality results. A harmonized framework with guidelines for the validation of WGS workflows does currently, however, not exist yet, despite several recent case studies highlighting the urgent need thereof. We present a validation strategy focusing specifically on the exhaustive characterization of the bioinformatics analysis of a WGS workflow designed to replace conventionally employed molecular typing methods for microbial isolates in a representative small-scale laboratory, using the pathogen Neisseria meningitidis as a proof-of-concept. We adapted several classically employed performance metrics specifically toward three different bioinformatics assays: resistance gene characterization (based on the ARG-ANNOT, ResFinder, CARD, and NDARO databases), several commonly employed typing schemas (including, among others, core genome multilocus sequence typing), and serogroup determination. We analyzed a core validation dataset of 67 well-characterized samples typed by means of classical genotypic and/or phenotypic methods that were sequenced in-house, allowing to evaluate repeatability, reproducibility, accuracy, precision, sensitivity, and specificity of the different bioinformatics assays. We also analyzed an extended validation dataset composed of publicly available WGS data for 64 samples by comparing results of the different bioinformatics assays against results obtained from commonly used bioinformatics tools. We demonstrate high performance, with values for all performance metrics >87%, >97%, and >90% for the resistance gene characterization, sequence typing, and serogroup determination assays, respectively, for both validation datasets. Our WGS workflow has been made publicly available as a "push-button" pipeline for Illumina data at https://galaxy.sciensano.be to showcase its implementation for non-profit and/or academic usage. Our validation strategy can be adapted to other WGS workflows for other pathogens of interest and demonstrates the added value and feasibility of employing WGS with the aim of being integrated into routine use in an applied public health setting.
引用
收藏
页数:19
相关论文
共 59 条
[1]   Whole-Genome Sequencing for Routine Pathogen Surveillance in Public Health: a Population Snapshot of Invasive Staphylococcus aureus in Europe [J].
Aanensen, David M. ;
Feil, Edward J. ;
Holden, Matthew T. G. ;
Dordel, Janina ;
Yeats, Corin A. ;
Fedosejev, Artemij ;
Goater, Richard ;
Castillo-Ramirez, Santiago ;
Corander, Jukka ;
Colijn, Caroline ;
Chlebowicz, Monika A. ;
Schouls, Leo ;
Heck, Max ;
Pluister, Gerlinde ;
Ruimy, Raymond ;
Kahlmeter, Gunnar ;
Ahman, Jenny ;
Matuschek, Erika ;
Friedrich, Alexander W. ;
Parkhill, Julian ;
Bentley, Stephen D. ;
Spratt, Brian G. ;
Grundmann, Hajo .
MBIO, 2016, 7 (03)
[2]   The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update [J].
Afgan, Enis ;
Baker, Dannon ;
van den Beek, Marius ;
Blankenberg, Daniel ;
Bouvier, Dave ;
Cech, Martin ;
Chilton, John ;
Clements, Dave ;
Coraor, Nate ;
Eberhard, Carl ;
Gruening, Bjoern ;
Guerler, Aysam ;
Hillman-Jackson, Jennifer ;
Von Kuster, Greg ;
Rasche, Eric ;
Soranzo, Nicola ;
Turaga, Nitesh ;
Taylor, James ;
Nekrutenko, Anton ;
Goecks, Jeremy .
NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) :W3-W10
[3]   The Future of Whole-Genome Sequencing for Public Health and the Clinic [J].
Allard, Marc W. .
JOURNAL OF CLINICAL MICROBIOLOGY, 2016, 54 (08) :1946-1948
[4]  
Angers-Loustau Alexandre, 2018, F1000Res, V7, DOI 10.12688/f1000research.14509.2
[5]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[6]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[7]   A gene-by-gene population genomics platform: de novo assembly, annotation and genealogical analysis of 108 representative Neisseria meningitidis genomes [J].
Bratcher, Holly B. ;
Corton, Craig ;
Jolley, Keith A. ;
Parkhill, Julian ;
Maiden, Martin C. J. .
BMC GENOMICS, 2014, 15
[8]   Variation of the factor H-binding protein of Neisseria meningitidis [J].
Brehony, Carina ;
Wilson, Daniel J. ;
Maiden, Martin C. J. .
MICROBIOLOGY-SGM, 2009, 155 :4155-4169
[9]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[10]  
Carriço JA, 2013, EUROSURVEILLANCE, V18, P32