German-Wide Interlaboratory Study Compares Consistency, Accuracy and Reproducibility of Whole-Genome Short Read Sequencing

被引:16
作者
Uelze, Laura [1 ]
Borowiak, Maria [1 ]
Boenn, Markus [2 ]
Brinks, Erik [3 ]
Deneke, Carlus [1 ]
Hankeln, Thomas [4 ]
Kleta, Sylvia [1 ]
Murr, Larissa [5 ]
Stingl, Kerstin [1 ]
Szabo, Kathrin [6 ]
Tausch, Simon H. [1 ]
Woehlke, Anne [7 ]
Malorny, Burkhard [1 ]
机构
[1] German Fed Inst Risk Assessment BfR, Dept Biol Safety, Berlin, Germany
[2] Landesamt Verbraucherschutz Sachsen Anhalt LAV, Halle, Saale, Germany
[3] Max Rubner Inst MRI, Dept Microbiol & Biotechnol, Kiel, Germany
[4] Johannes Gutenberg Univ Mainz, AG Mol Genet & Genome Anal, Inst Organism & Mol Evolut, Mainz, Germany
[5] Bavarian Hlth & Food Safety Author LGL, Oberschleissheim, Germany
[6] Fed Off Consumer Protect & Food Safety BVL, Dept 5, Berlin, Germany
[7] Lower Saxony State Off Consumer Protect & Food Sa, Food & Vet Inst, Braunschweig, Germany
关键词
interlaboratory study; whole-genome sequencing; food safety; illumina; ion torrent; GENERATION; ILLUMINA;
D O I
10.3389/fmicb.2020.573972
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
We compared the consistency, accuracy and reproducibility of next-generation short read sequencing between ten laboratories involved in food safety (research institutes, state laboratories, universities and companies) from Germany and Austria. Participants were asked to sequence six DNA samples of three bacterial species (Campylobacter jejuni,Listeria monocytogenesandSalmonella enterica) in duplicate, according to their routine in-house sequencing protocol. Four different types of Illumina sequencing platforms (MiSeq, NextSeq, iSeq, NovaSeq) and one Ion Torrent sequencing instrument (S5) were involved in the study. Sequence quality parameters were determined for all data sets and centrally compared between laboratories. SNP and cgMLST calling were performed to assess the reproducibility of sequence data collected for individual samples. Overall, we found Illumina short read data to be more accurate (higher base calling accuracy, fewer miss-assemblies) and consistent (little variability between independent sequencing runs within a laboratory) than Ion Torrent sequence data, with little variation between the different Illumina instruments. Two laboratories with Illumina instruments submitted sequence data with lower quality, probably due to the use of a library preparation kit, which shows difficulty in sequencing low GC genome regions. Differences in data quality were more evident after assembling short reads into genome assemblies, with Ion Torrent assemblies featuring a great number of allele differences to Illumina assemblies. Clonality of samples was confirmed through SNP calling, which proved to be a more suitable method for an integrated data analysis of Illumina and Ion Torrent data sets in this study.
引用
收藏
页数:14
相关论文
共 36 条
[1]   Identification of Salmonella for public health surveillance using whole genome sequencing [J].
Ashton, Philip M. ;
Nair, Satheesh ;
Peters, Tansy M. ;
Bale, Janet A. ;
Powell, David G. ;
Painset, Anais ;
Tewolde, Rediat ;
Schaefer, Ulf ;
Jenkins, Claire ;
Dallman, Timothy J. ;
de Pinna, Elizabeth M. ;
Grant, Kathie A. .
PEERJ, 2016, 4
[2]   Summarizing and correcting the GC content bias in high-throughput sequencing [J].
Benjamini, Yuval ;
Speed, Terence P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (10) :e72
[3]   Complete Genome Sequence of a VIM-1-Producing Salmonella enterica subsp. enterica Serovar Infantis Isolate Derived from Minced Pork Meat [J].
Borowia, Maria ;
Fischer, Jennie ;
Baumann, Beatrice ;
Hammerl, Jens A. ;
Szabo, Istvan ;
Malornya, Burkhard .
GENOME ANNOUNCEMENTS, 2018, 6 (17)
[4]   GC bias affects genomic and metagenomic reconstructions, underrepresenting GC-poor organisms [J].
Browne, Patrick Denis ;
Nielsen, Tue Kjaergaard ;
Kot, Witold ;
Aggerholm, Anni ;
Gilbert, M. Thomas P. ;
Puetz, Lara ;
Rasmussen, Morten ;
Zervas, Athanasios ;
Hansen, Lars Hestbjerg .
GIGASCIENCE, 2020, 9 (02)
[5]   Next generation sequencing technology: Advances and applications [J].
Buermans, H. P. J. ;
den Dunnen, J. T. .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR BASIS OF DISEASE, 2014, 1842 (10) :1932-1941
[6]   In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing [J].
Carattoli, Alessandra ;
Zankari, Ea ;
Garcia-Fernandez, Aurora ;
Larsen, Mette Voldby ;
Lund, Ole ;
Villa, Laura ;
Aarestrup, Frank Moller ;
Hasman, Henrik .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 2014, 58 (07) :3895-3903
[7]   fastp: an ultra-fast all-in-one FASTQ preprocessor [J].
Chen, Shifu ;
Zhou, Yanqing ;
Chen, Yaru ;
Gu, Jia .
BIOINFORMATICS, 2018, 34 (17) :884-890
[8]  
Chun J, 2018, INT J SYST EVOL MICR, V68, P461, DOI [10.1099/ijsem.0.002516, 10.1099/ijsem.0.002532]
[9]   Standardization and quality management in next-generation sequencing [J].
Endrullat, Christoph ;
Gloekler, Joern ;
Franke, Philipp ;
Frohme, Marcus .
APPLIED AND TRANSLATIONAL GENOMICS, 2016, 10 :2-9
[10]   A comparison of tools for the simulation of genomic next-generation sequencing data [J].
Escalona, Merly ;
Rocha, Sara ;
Posada, David .
NATURE REVIEWS GENETICS, 2016, 17 (08) :459-469