Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm

被引:37
作者
Waleev, T.
Shtokalo, D.
Konovalova, T.
Voss, N.
Cheremushkin, E.
Stegmaier, P.
Kel-Margoulis, O.
Wingender, E.
Kel, A.
机构
[1] BIOBASE GmbH, D-38304 Wolfenbuttel, Germany
[2] AP Ershovs Inst Informat Syst, Novosibirsk 630090, Russia
[3] Univ Gottingen, Dept Bioinformat, UKG, D-37077 Gottingen, Germany
关键词
D O I
10.1093/nar/gkl342
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Composite Module Analyst (CMA) is a novel software tool aiming to identify promoter-enhancer models based on the composition of transcription factor (TF) binding sites and their pairs. CMA is closely interconnected with the TRANSFAC (R) database. In particular, CMA uses the positional weight matrix (PWM) library collected in TRANSFAC (R) and therefore provides the possibility to search for a large variety of different TF binding sites. We model the structure of the long gene regulatory regions by a Boolean function that joins several local modules, each consisting of co-localized TF binding sites. Having as an input a set of co-regulated genes, CMA builds the promoter model and optimizes the parameters of the model automatically by applying a genetic-regression algorithm. We use a multicomponent fitness function of the algorithm which includes several statistical criteria in a weighted linear function. We show examples of successful application of CMA to a microarray data on transcription profiling of TNF-alpha stimulated primary human endothelial cells. The CMA web server is freely accessible at http://www.gene-regulation.com/pub/programs/cma/CMA.html. An advanced version of CMA is also a part of the commercial system ExPlain (TM) (www.biobase.de) designed for causal analysis of gene expression data.
引用
收藏
页码:W541 / W545
页数:5
相关论文
共 29 条
[1]   Computational detection of cis-regulatory modules [J].
Aerts, Stein ;
Van Loo, Peter ;
Thijs, Gert ;
Moreau, Yves ;
De Moor, Bart .
BIOINFORMATICS, 2003, 19 :II5-II14
[2]  
Boehlk S, 2000, EUR J IMMUNOL, V30, P1102, DOI 10.1002/(SICI)1521-4141(200004)30:4<1102::AID-IMMU1102>3.0.CO
[3]  
2-X
[4]  
BRAZMA A, 1997, COMPUTER SCI BIOL, P57
[5]   The early growth response protein (EGR-1) regulates interleukin-2 transcription by synergistic interaction with the nuclear factor of activated T cells [J].
Decker, EL ;
Skerka, C ;
Zipfel, PF .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (41) :26923-26930
[6]   Human housekeeping genes are compact [J].
Eisenberg, E ;
Levanon, EY .
TRENDS IN GENETICS, 2003, 19 (07) :362-365
[7]  
Eskin E., 2002, BIOINFORMATICS, V18, P354, DOI DOI 10.1093/BIOINFORMATICS/18.SUPPL_1.S354
[8]  
Fessele S, 2001, FASEB J, V15, P577
[9]  
Frech K, 1998, In Silico Biol, V1, P29
[10]   Identifying target sites for cooperatively binding factors [J].
GuhaThakurta, D ;
Stormo, GD .
BIOINFORMATICS, 2001, 17 (07) :608-621