ESSENTIALS: Software for Rapid Analysis of High Throughput Transposon Insertion Sequencing Data

被引:102
作者
Zomer, Aldert [1 ,2 ]
Burghout, Peter [1 ]
Bootsma, Hester J. [1 ]
Hermans, Peter W. M. [1 ]
van Hijum, Sacha A. F. T. [2 ,3 ,4 ,5 ]
机构
[1] Radboud Univ Nijmegen, Lab Pediat Infect Dis, Med Ctr, NL-6525 ED Nijmegen, Netherlands
[2] Radboud Univ Nijmegen, Ctr Mol & Biomol Informat, Nijmegen Ctr Mol Life Sci, Med Ctr, NL-6525 ED Nijmegen, Netherlands
[3] Kluyver Ctr Genom Ind Fermentat, NIZO Food Res, Ede, Netherlands
[4] TI Food & Nutr, Wageningen, Netherlands
[5] Netherlands Bioinformat Ctr, Nijmegen, Netherlands
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; CONDITIONALLY ESSENTIAL GENES; IDENTIFICATION; GENOME;
D O I
10.1371/journal.pone.0043012
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
High-throughput analysis of genome-wide random transposon mutant libraries is a powerful tool for (conditional) essential gene discovery. Recently, several next-generation sequencing approaches, e.g. Tn-seq/INseq, HITS and TraDIS, have been developed that accurately map the site of transposon insertions by mutant-specific amplification and sequence readout of DNA flanking the transposon insertions site, assigning a measure of essentiality based on the number of reads per insertion site flanking sequence or per gene. However, analysis of these large and complex datasets is hampered by the lack of an easy to use and automated tool for transposon insertion sequencing data. To fill this gap, we developed ESSENTIALS, an open source, web-based software tool for researchers in the genomics field utilizing transposon insertion sequencing analysis. It accurately predicts (conditionally) essential genes and offers the flexibility of using different sample normalization methods, genomic location bias correction, data preprocessing steps, appropriate statistical tests and various visualizations to examine the results, while requiring only a minimum of input and hands-on work from the researcher. We successfully applied ESSENTIALS to in-house and published Tn-seq, TraDIS and HITS datasets and we show that the various pre- and post-processing steps on the sequence reads and count data with ESSENTIALS considerably improve the sensitivity and specificity of predicted gene essentiality.
引用
收藏
页数:9
相关论文
共 26 条
[1]   High-throughput phenotyping using parallel sequencing of RNA interference targets in the African trypanosome [J].
Alsford, Sam ;
Turner, Daniel J. ;
Obado, Samson O. ;
Sanchez-Flores, Alejandro ;
Glover, Lucy ;
Berriman, Matthew ;
Hertz-Fowler, Christiane ;
Horn, David .
GENOME RESEARCH, 2011, 21 (06) :915-924
[2]   Development of genomic array footprinting for identification of conditionally essential genes in Streptococcus pneumoniae [J].
Bijlsma, Jetta J. E. ;
Burghout, Peter ;
Moosterman, Tomas G. ;
Bootsma, Flester J. ;
de Jong, Anne ;
Hermans, Peter W. M. ;
Kuipers, Oscar P. .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2007, 73 (05) :1514-1524
[3]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[4]   MINOMICS: visualizing prokaryote transcriptomics and proteomics data in a genomic context [J].
Brouwer, Rutger W. W. ;
van Hijum, Sacha A. F. T. ;
Kuipers, Oscar P. .
BIOINFORMATICS, 2009, 25 (01) :139-140
[5]   Search for genes essential for pneumococcal transformation: The RadA DNA repair protein plays a role in genomic recombination of donor DNA [J].
Burghout, Peter ;
Bootsma, Hester J. ;
Kloosterman, Tomas G. ;
Bijlsma, Jetta J. E. ;
de Jongh, Christa E. ;
Kuipers, Oscar P. ;
Hermans, Peter W. M. .
JOURNAL OF BACTERIOLOGY, 2007, 189 (18) :6540-6550
[6]   PASS: a program to align short sequences [J].
Campagna, Davide ;
Albiero, Alessandro ;
Bilardi, Alessandra ;
Caniato, Elisa ;
Forcato, Claudio ;
Manavski, Svetlin ;
Vitulo, Nicola ;
Valle, Giorgio .
BIOINFORMATICS, 2009, 25 (07) :967-968
[7]   OGEE: an online gene essentiality database [J].
Chen, Wei-Hua ;
Minguez, Pablo ;
Lercher, Martin J. ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D901-D906
[8]   The essential genome of a bacterium [J].
Christen, Beat ;
Abeliuk, Eduardo ;
Collier, John M. ;
Kalogeraki, Virginia S. ;
Passarelli, Ben ;
Coller, John A. ;
Fero, Michael J. ;
McAdams, Harley H. ;
Shapiro, Lucy .
MOLECULAR SYSTEMS BIOLOGY, 2011, 7
[9]   Genome-Scale Identification of Resistance Functions in Pseudomonas aeruginosa Using Tn-seq [J].
Gallagher, Larry A. ;
Shendure, Jay ;
Manoil, Colin .
MBIO, 2011, 2 (01)
[10]   Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for Haemophilus genes required in the lung [J].
Gawronski, Jeffrey D. ;
Wong, Sandy M. S. ;
Giannoukos, Georgia ;
Ward, Doyle V. ;
Akerley, Brian J. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (38) :16422-16427