MatrixCatch - a novel tool for the recognition of composite regulatory elements in promoters

被引:22
作者
Deyneko, Igor V. [1 ,2 ]
Kel, Alexander E. [2 ,3 ]
Kel-Margoulis, Olga V. [2 ]
Deineko, Elena V. [4 ]
Wingender, Edgar [2 ,5 ]
Weiss, Siegfried [1 ]
机构
[1] Helmholtz Ctr Infect Res, Dept Mol Immunol, Braunschweig, Germany
[2] GeneXplain GmbH, Wolfenbuttel, Germany
[3] Inst Chem Biol & Fundamental Med SB RAS, Novosibirsk, Russia
[4] Inst Cytol & Genet SB RAS, Lab Plant Bioengn, Novosibirsk, Russia
[5] Univ Med Ctr Gottingen, Inst Bioinformat, Gottingen, Germany
关键词
PROSTATE-CANCER CELLS; TRANSCRIPTION FACTOR; GENE-EXPRESSION; IDENTIFICATION; ACTIVATION; MOTIF;
D O I
10.1186/1471-2105-14-241
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Accurate recognition of regulatory elements in promoters is an essential prerequisite for understanding the mechanisms of gene regulation at the level of transcription. Composite regulatory elements represent a particular type of such transcriptional regulatory elements consisting of pairs of individual DNA motifs. In contrast to the present approach, most available recognition techniques are based purely on statistical evaluation of the occurrence of single motifs. Such methods are limited in application, since the accuracy of recognition is greatly dependent on the size and quality of the sequence dataset. Methods that exploit available knowledge and have broad applicability are evidently needed. Results: We developed a novel method to identify composite regulatory elements in promoters using a library of known examples. In depth investigation of regularities encoded in known composite elements allowed us to introduce a new characteristic measure and to improve the specificity compared with other methods. Tests on an established benchmark and real genomic data show that our method outperforms other available methods based either on known examples or statistical evaluations. In addition to better recognition, a practical advantage of this method is first the ability to detect a high number of different types of composite elements, and second direct biological interpretation of the identified results. The program is available at http://gnaweb.helmholtz-hzi.de/cgi-bin/MCatch/MatrixCatch.pl and includes an option to extend the provided library by user supplied data. Conclusions: The novel algorithm for the identification of composite regulatory elements presented in this paper was proved to be superior to existing methods. Its application to tissue specific promoters identified several highly specific composite elements with relevance to their biological function. This approach together with other methods will further advance the understanding of transcriptional regulation of genes.
引用
收藏
页数:10
相关论文
共 19 条
[11]   A predictive model for regulatory sequences directing liver-specific transcription [J].
Krivan, W ;
Wasserman, WW .
GENOME RESEARCH, 2001, 11 (09) :1559-1566
[12]   TRANSFAC® and its module TRANSCompel®:: transcriptional gene regulation in eukaryotes [J].
Matys, V. ;
Kel-Margoulis, O. V. ;
Fricke, E. ;
Liebich, I. ;
Land, S. ;
Barre-Dirrie, A. ;
Reuter, I. ;
Chekmenev, D. ;
Krull, M. ;
Hornischer, K. ;
Voss, N. ;
Stegmaier, P. ;
Lewicki-Potapov, B. ;
Saxel, H. ;
Kel, A. E. ;
Wingender, E. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D108-D110
[13]  
Shelest Ekaterina, 2003, In Silico Biology, V3, P71
[14]   AUTOCRINE GROWTH INDUCED BY KINASE TYPE ONCOGENES IN MYELOID CELLS REQUIRES AP-1 AND NF-M, A MYELOID SPECIFIC, C-EBP-LIKE FACTOR [J].
STERNECK, E ;
MULLER, C ;
KATZ, S ;
LEUTZ, A .
EMBO JOURNAL, 1992, 11 (01) :115-126
[15]   Computational methods for the detection of cis-regulatory modules [J].
Van Loo, Peter ;
Marynen, Peter .
BRIEFINGS IN BIOINFORMATICS, 2009, 10 (05) :509-524
[16]   Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm [J].
Waleev, T. ;
Shtokalo, D. ;
Konovalova, T. ;
Voss, N. ;
Cheremushkin, E. ;
Stegmaier, P. ;
Kel-Margoulis, O. ;
Wingender, E. ;
Kel, A. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W541-W545
[17]   Identification of regulatory regions which confer muscle-specific gene expression [J].
Wasserman, WW ;
Fickett, JW .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 278 (01) :167-181
[18]   coMOTIF: a mixture framework for identifying transcription factor and a coregulator motif in ChIP-seq Data [J].
Xu, Mengyuan ;
Weinberg, Clarice R. ;
Umbach, David M. ;
Li, Leping .
BIOINFORMATICS, 2011, 27 (19) :2625-2632
[19]   Deletion of ETS-1, a gene in the Jacobsen syndrome critical region, causes ventricular septal defects and abnormal ventricular morphology in mice [J].
Ye, Maoqing ;
Coldren, Chris ;
Liang, Xingqun ;
Mattina, Teresa ;
Goldmuntz, Elizabeth ;
Benson, D. Woodrow ;
Ivy, Dunbar ;
Perryman, M. B. ;
Garrett-Sinha, Lee Ann ;
Grossfeld, Paul .
HUMAN MOLECULAR GENETICS, 2010, 19 (04) :648-656