Normalization of transposon-mutant library sequencing datasets to improve identification of conditionally essential genes

被引:10
|
作者
DeJesus, Michael A. [1 ]
Ioerger, Thomas R. [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci, College Stn, TX 77843 USA
关键词
Normalization; TnSeq; essentiality; DIFFERENTIAL EXPRESSION ANALYSIS;
D O I
10.1142/S021972001642004X
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Sequencing of transposon-mutant libraries using next-generation sequencing (TnSeq) has become a popular method for determining which genes and non-coding regions are essential for growth under various conditions in bacteria. For methods that rely on quantitative comparison of counts of reads at transposon insertion sites, proper normalization of TnSeq datasets is vitally important. Real TnSeq datasets are often noisy and exhibit a significant skew that can be dominated by high counts at a small number of sites (often for non-biological reasons). If two datasets that are not appropriately normalized are compared, it might cause the artifactual appearance of Differentially Essential (DE) genes in a statistical test, constituting type I errors (false positives). In this paper, we propose a novel method for normalization of TnSeq datasets that corrects for the skew of read-count distributions by fitting them to a Beta-Geometric distribution. We show that this read-count correction procedure reduces the number of false positives when comparing replicate datasets grown under the same conditions (for which no genuine differences in essentiality are expected). We compare these results to results obtained with other normalization procedures, and show that it results in greater reduction in the number of false positives. In addition we investigate the effects of normalization on the detection of DE genes.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Identification of essential genes in the human fungal pathogen Aspergillus fumigatus by transposon mutagenesis
    Firon, A
    Villalba, F
    Beffa, R
    d'Enfert, C
    EUKARYOTIC CELL, 2003, 2 (02) : 247 - 255
  • [32] Construction of ethyl methane sulfonate mutant library in G. arboreum and rapid identification of mutant genes via repeated re-sequencing
    Wang, Wenwen
    Yang, Xinrui
    Zeng, Jianyan
    Liang, Aimin
    Liu, Dexin
    Wang, Weirang
    Wang, Meng
    Li, Yaohua
    Lin, Xiaoxin
    Zhang, Jingyi
    Zhang, Zhengsheng
    Kong, Jie
    Xiao, Yuehua
    INDUSTRIAL CROPS AND PRODUCTS, 2024, 213
  • [33] Identification of genes affecting alginate biosynthesis in Pseudomonas fluorescens by screening a transposon insertion library
    Ertesvag, Helga
    Sletta, Havard
    Senneset, Mona
    Sun, Yi-Qian
    Klinkenberg, Geir
    Konradsen, Therese Aursand
    Ellingsen, Trond E.
    Valla, Svein
    BMC GENOMICS, 2017, 18
  • [34] Identification of genes affecting alginate biosynthesis in Pseudomonas fluorescens by screening a transposon insertion library
    Helga Ertesvåg
    Håvard Sletta
    Mona Senneset
    Yi-Qian Sun
    Geir Klinkenberg
    Therese Aursand Konradsen
    Trond E. Ellingsen
    Svein Valla
    BMC Genomics, 18
  • [35] Comparison of inherently essential genes of Porphyromonas gingivalis identified in two transposon-sequencing libraries
    Hutcherson, J. A.
    Gogeneni, H.
    Yoder-Himes, D.
    Hendrickson, E. L.
    Hackett, M.
    Whiteley, M.
    Lamont, R. J.
    Scott, D. A.
    MOLECULAR ORAL MICROBIOLOGY, 2016, 31 (04) : 354 - 364
  • [36] Screening for phagocytosis resistance-related genes via a transposon mutant library ofStreptococcus suisserotype 2
    Pei, Xiaomeng
    Liu, Mingxing
    Zhou, Hong
    Fan, Hongjie
    VIRULENCE, 2020, 11 (01) : 825 - 838
  • [37] Identification of putative essential protein domains from high-density transposon insertion sequencing
    Rahman, A. S. M. Zisanur
    Timmerman, Lukas
    Gallardo, Flyn
    Cardona, Silvia T.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [38] Identification of putative essential protein domains from high-density transposon insertion sequencing
    A. S. M. Zisanur Rahman
    Lukas Timmerman
    Flyn Gallardo
    Silvia T. Cardona
    Scientific Reports, 12
  • [39] Genome-scale identification of conditionally essential genes in E-coli by DNA microarrays
    Tong, X
    Campbell, JW
    Balázsi, G
    Kay, KA
    Wanner, BL
    Gerdes, SY
    Oltvai, ZN
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 322 (01) : 347 - 354
  • [40] Transposon Insertion Sequencing in a Clinical Isolate of Legionella pneumophila Identifies Essential Genes and Determinants of Natural Transformation
    Hardy, Leo
    Juan, Pierre-Alexandre
    Coupat-Goutaland, Benedicte
    Charpentier, Xavier
    JOURNAL OF BACTERIOLOGY, 2021, 203 (03)