PromoterSweep: a tool for identification of transcription factor binding sites

被引:2
作者
del Val, Coral [1 ,2 ]
Pelz, Oliver [1 ]
Glatting, Karl-Heinz [1 ]
Barta, Endre [1 ,3 ]
Hotz-Wagenblatt, Agnes [1 ]
机构
[1] DKFZ, German Canc Res Ctr, D-69120 Heidelberg, Germany
[2] Univ Granada, Fac Informat, E-18071 Granada, Spain
[3] Agr Biotechnol Ctr, H-2100 Godollo, Hungary
关键词
Promoter; Transcription factor; Motif discovery; Annotation; DATABASE; DISCOVERY; PREDICTION; INTERFACE; ALGORITHM; SEQUENCES; FRAMEWORK; MOTIFS; W2H;
D O I
10.1007/s00214-009-0643-8
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
There are many tools available for the prediction of potential promoter regions and the transcription factor binding sites (TFBS) harboured by them. Unfortunately, these tools cannot really avoid the prediction of vast amounts of false positives, the greatest problem in promoter analysis. The combination of different methods and algorithms has shown an improvement in prediction accuracy for similar biological problems such as gene prediction. The web-tool presented here uses this approach to perform an exhaustive integrative analysis, identification and annotation of potential promoter regions. The combination of methods employed includes searches in different experimental promoter databases to identify promoter regions and their orthologs, use of TFBS databases and search tools, and a phylogenetic footprinting strategy, combining multiple alignment of genomic sequences together with motif discovery tools that were tested previously in order to get the best method combination. The pipeline is available for academic users at the HUSAR open server "http://genius.embnet.dkfz-heidelberg.de/menu/biounit/open-husar/" . It integrates all of this information and identifies among the huge number of TFBS predictions those, which are more likely to be potentially functional.
引用
收藏
页码:583 / 591
页数:9
相关论文
共 31 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Bailey T L, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P21
[3]   Performance assessment of promoter predictions on ENCODE regions in the EGASP experiment [J].
Bajic, Vladimir B. ;
Brent, Michael R. ;
Brown, Randall H. ;
Frankish, Adam ;
Harrow, Jennifer ;
Ohler, Uwe ;
Solovyev, Victor V. ;
Tan, Sin Lam .
GENOME BIOLOGY, 2006, 7 (Suppl 1)
[4]   DoOP:: Databases of orthologous promoters, collections of clusters of orthologous upstream sequences from chordates and plants [J].
Barta, E ;
Sebestyén, E ;
Pálfy, TB ;
Tóth, G ;
Ortutay, CP ;
Patthy, L .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D86-D90
[5]   DNA dynamically directs its own transcription initiation [J].
Choi, CH ;
Kalosakas, G ;
Rasmussen, KO ;
Hiromura, M ;
Bishop, AR ;
Usheva, A .
NUCLEIC ACIDS RESEARCH, 2004, 32 (04) :1584-1590
[6]  
ENDRE B, 2007, METHOD MOL BIOL, V395, P319
[7]   A task framework for the web interface W2H [J].
Ernst, P ;
Glatting, KH ;
Suhai, S .
BIOINFORMATICS, 2003, 19 (02) :278-282
[8]   A Gibbs sampler for identification of symmetrically structured, spaced DNA motifs with improved estimation of the signal length [J].
Favorov, AV ;
Gelfand, MS ;
Gerasimova, AV ;
Ravcheev, DA ;
Mironov, AA ;
Makeev, VJ .
BIOINFORMATICS, 2005, 21 (10) :2240-2245
[9]   Ensembl 2009 [J].
Hubbard, T. J. P. ;
Aken, B. L. ;
Ayling, S. ;
Ballester, B. ;
Beal, K. ;
Bragin, E. ;
Brent, S. ;
Chen, Y. ;
Clapham, P. ;
Clarke, L. ;
Coates, G. ;
Fairley, S. ;
Fitzgerald, S. ;
Fernandez-Banet, J. ;
Gordon, L. ;
Graf, S. ;
Haider, S. ;
Hammond, M. ;
Holland, R. ;
Howe, K. ;
Jenkinson, A. ;
Johnson, N. ;
Kahari, A. ;
Keefe, D. ;
Keenan, S. ;
Kinsella, R. ;
Kokocinski, F. ;
Kulesha, E. ;
Lawson, D. ;
Longden, I. ;
Megy, K. ;
Meidl, P. ;
Overduin, B. ;
Parker, A. ;
Pritchard, B. ;
Rios, D. ;
Schuster, M. ;
Slater, G. ;
Smedley, D. ;
Spooner, W. ;
Spudich, G. ;
Trevanion, S. ;
Vilella, A. ;
Vogel, J. ;
White, S. ;
Wilder, S. ;
Zadissa, A. ;
Birney, E. ;
Cunningham, F. ;
Curwen, V. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D690-D697
[10]   Measuring similarities between transcription factor binding sites [J].
Kielbasa, SM ;
Gonze, D ;
Herzel, H .
BMC BIOINFORMATICS, 2005, 6 (1)