SigmoID: a user-friendly tool for improving bacterial genome annotation through analysis of transcription control signals

被引:10
作者
Nikolaichik, Yevgeny [1 ]
Damienikan, Aliaksandr U. [1 ]
机构
[1] Belarusian State Univ, Dept Mol Biol, Minsk, BELARUS
关键词
Transcription factor binding site; Promoter; Terminator; Genome browser; Genome annotation; Sequence logo; Pectobacterium atrosepticum; PV. TOMATO DC3000; ESCHERICHIA-COLI; 2-COMPONENT SYSTEM; GENE-EXPRESSION; WEB SERVER; SEQUENCE; REGULON; DNA; IDENTIFICATION; DISCOVERY;
D O I
10.7717/peerj.2056
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The majority of bacterial genome annotations are currently automated and based on a 'gene by gene' approach. Regulatory signals and operon structures are rarely taken into account which often results in incomplete and even incorrect gene function assignments. Here we present SigmoID, a cross-platform (OS X, Linux and Windows) open-source application aiming at simplifying the identification of transcription regulatory sites (promoters, transcription factor binding sites and terminators) in bacterial genomes and providing assistance in correcting annotations in accordance with regulatory information. SigmoID combines a user-friendly graphical interface to well known command line tools with a genome browser for visualising regulatory elements in genomic context. Integrated access to online databases with regulatory information (RegPrecise and RegulonDB) and web-based search engines speeds up genome analysis and simplifies correction of genome annotation. We demonstrate some features of SigmoID by constructing a series of regulatory protein binding site profiles for two groups of bacteria: Soft Rot Enterobacteriaceae (Pectobacterium and Dickeya spp.) and Pseudomonas spp. Furthermore, we inferred over 900 transcription factor binding sites and alternative sigma factor promoters in the annotated genome of Pectobacterium afrosepticum. These regulatory signals control putative transcription units covering about 40% of the P. afrosepticum chromosome. Reviewing the annotation in cases where it didn't fit with regulatory information allowed us to correct product and gene names for over 300 loci.
引用
收藏
页数:21
相关论文
共 63 条
[1]   A role for the Rcs phosphorelay in regulating expression of plant cell wall degrading enzymes in Pectobacterium carotovorum subsp carotovorum [J].
Andresen, Liis ;
Sala, Erki ;
Koiv, Viia ;
Maee, Andres .
MICROBIOLOGY-SGM, 2010, 156 :1323-1334
[2]  
[Anonymous], 1994, MOL BIOL
[3]   The RAST server: Rapid annotations using subsystems technology [J].
Aziz, Ramy K. ;
Bartels, Daniela ;
Best, Aaron A. ;
DeJongh, Matthew ;
Disz, Terrence ;
Edwards, Robert A. ;
Formsma, Kevin ;
Gerdes, Svetlana ;
Glass, Elizabeth M. ;
Kubal, Michael ;
Meyer, Folker ;
Olsen, Gary J. ;
Olson, Robert ;
Osterman, Andrei L. ;
Overbeek, Ross A. ;
McNeil, Leslie K. ;
Paarmann, Daniel ;
Paczian, Tobias ;
Parrello, Bruce ;
Pusch, Gordon D. ;
Reich, Claudia ;
Stevens, Rick ;
Vassieva, Olga ;
Vonstein, Veronika ;
Wilke, Andreas ;
Zagnitko, Olga .
BMC GENOMICS, 2008, 9 (1)
[4]   The MEME Suite [J].
Bailey, Timothy L. ;
Johnson, James ;
Grant, Charles E. ;
Noble, William S. .
NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) :W39-W49
[5]   Combining evidence using p-values: application to sequence homology searches [J].
Bailey, TL ;
Gribskov, M .
BIOINFORMATICS, 1998, 14 (01) :48-54
[6]   Compilation and analysis of σ54-dependent promoter sequences [J].
Barrios, H ;
Valderrama, B ;
Morett, E .
NUCLEIC ACIDS RESEARCH, 1999, 27 (22) :4305-4313
[7]   Genome sequence of the enterobacterial phytopathogen Erwinia carotovora subsp atroseptica and characterization of virulence factors [J].
Bell, KS ;
Sebaihia, M ;
Pritchard, L ;
Holden, MTG ;
Hyman, LJ ;
Holeva, MC ;
Thomson, NR ;
Bentley, SD ;
Churcher, LJC ;
Mungall, K ;
Atkin, R ;
Bason, N ;
Brooks, K ;
Chillingworth, T ;
Clark, K ;
Doggett, J ;
Fraser, A ;
Hance, Z ;
Hauser, H ;
Jagels, K ;
Moule, S ;
Norbertczak, H ;
Ormond, D ;
Price, C ;
Quail, MA ;
Sanders, M ;
Walker, D ;
Whitehead, S ;
Salmond, GPC ;
Birch, PRJ ;
Parkhill, J ;
Toth, IK .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (30) :11105-11110
[8]   Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data [J].
Carver, Tim ;
Harris, Simon R. ;
Berriman, Matthew ;
Parkhill, Julian ;
McQuillan, Jacqueline A. .
BIOINFORMATICS, 2012, 28 (04) :464-469
[9]   RegTransBase - a database of regulatory sequences and interactions based on literature: a resource for investigating transcriptional regulation in prokaryotes [J].
Cipriano, Michael J. ;
Novichkov, Pavel N. ;
Kazakov, Alexey E. ;
Rodionov, Dmitry A. ;
Arkin, Adam P. ;
Gelfand, Mikhail S. ;
Dubchak, Inna .
BMC GENOMICS, 2013, 14
[10]   GenBank [J].
Clark, Karen ;
Karsch-Mizrachi, Ilene ;
Lipman, David J. ;
Ostell, James ;
Sayers, Eric W. .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D67-D72