De-Novo Discovery of Differentially Abundant Transcription Factor Binding Sites Including Their Positional Preference

被引:46
作者
Keilwagen, Jens [1 ]
Grau, Jan [2 ]
Paponov, Ivan A. [3 ,4 ]
Posch, Stefan [2 ]
Strickert, Marc [1 ]
Grosse, Ivo [2 ]
机构
[1] Leibniz Inst Plant Genet & Crop Plant Res IPK, Gatersleben, Germany
[2] Univ Halle Wittenberg, Inst Comp Sci, Halle, Germany
[3] Univ Freiburg, Inst Biol Bot 2, Fac Biol, Freiburg, Germany
[4] Univ Freiburg, Ctr Biol Signalling Studies BIOSS, Freiburg, Germany
关键词
PROTEIN-DNA INTERACTIONS; AUXIN RESPONSE FACTORS; GENOME; EUKARYOTES; ELEMENTS; DATABASE; SAMPLER; TOOLS;
D O I
10.1371/journal.pcbi.1001070
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Transcription factors are a main component of gene regulation as they activate or repress gene expression by binding to specific binding sites in promoters. The de-novo discovery of transcription factor binding sites in target regions obtained by wet-lab experiments is a challenging problem in computational biology, which has not been fully solved yet. Here, we present a de-novo motif discovery tool called Dispom for finding differentially abundant transcription factor binding sites that models existing positional preferences of binding sites and adjusts the length of the motif in the learning process. Evaluating Dispom, we find that its prediction performance is superior to existing tools for de-novo motif discovery for 18 benchmark data sets with planted binding sites, and for a metazoan compendium based on experimental data from microarray, ChIP-chip, ChIP-DSL, and DamID as well as Gene Ontology data. Finally, we apply Dispom to find binding sites differentially abundant in promoters of auxin-responsive genes extracted from Arabidopsis thaliana microarray data, and we find a motif that can be interpreted as a refined auxin responsive element predominately positioned in the 250-bp region upstream of the transcription start site. Using an independent data set of auxin-responsive genes, we find in genome-wide predictions that the refined motif is more specific for auxin-responsive genes than the canonical auxin-responsive element. In general, Dispom can be used to find differentially abundant motifs in sequences of any origin. However, the positional distribution learned by Dispom is especially beneficial if all sequences are aligned to some anchor point like the transcription start site in case of promoter sequences. We demonstrate that the combination of searching for differentially abundant motifs and inferring a position distribution from the data is beneficial for de-novo motif discovery. Hence, we make the tool freely available as a component of the open-source Java framework Jstacs and as a stand-alone application at http://www.jstacs.de/index.php/Dispom.
引用
收藏
页数:13
相关论文
共 41 条
  • [1] [Anonymous], FITTING MIXTURE MODE
  • [2] Environmentally induced foregut remodeling by PHA-4/FoxA and DAF-12/NHR
    Ao, W
    Gaudet, J
    Kent, WJ
    Muttumu, S
    Mango, SE
    [J]. SCIENCE, 2004, 305 (5691) : 1743 - 1746
  • [3] Nonisotopic quantitative analysis of protein-DNA interactions at equilibrium
    Benotmane, AM
    Hoylaerts, MF
    Collen, D
    Belayew, A
    [J]. ANALYTICAL BIOCHEMISTRY, 1997, 250 (02) : 181 - 185
  • [4] JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update
    Bryne, Jan Christian
    Valen, Eivind
    Tang, Man-Hung Eric
    Marstrand, Troels
    Winther, Ole
    da Piedade, Isabelle
    Krogh, Anders
    Lenhard, Boris
    Sandelin, Albin
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D102 - D106
  • [5] Cerquides J, 2005, LECT NOTES ARTIF INT, V3720, P72, DOI 10.1007/11564096_12
  • [6] Davis J., 2006, P 23 INT C MACH LEAR, P233, DOI [10.1145/1143844.1143874, DOI 10.1145/1143844.1143874]
  • [7] A universal framework for regulatory element discovery across all Genomes and data types
    Elemento, Olivier
    Slonim, Noam
    Tavazoie, Saeed
    [J]. MOLECULAR CELL, 2007, 28 (02) : 337 - 350
  • [8] DNAASE FOOTPRINTING - SIMPLE METHOD FOR DETECTION OF PROTEIN-DNA BINDING SPECIFICITY
    GALAS, DJ
    SCHMITZ, A
    [J]. NUCLEIC ACIDS RESEARCH, 1978, 5 (09) : 3157 - 3170
  • [9] Auxin response factors
    Guilfoyle, Toni J.
    Hagen, Gretchen
    [J]. CURRENT OPINION IN PLANT BIOLOGY, 2007, 10 (05) : 453 - 460
  • [10] Transcriptional regulatory code of a eukaryotic genome
    Harbison, CT
    Gordon, DB
    Lee, TI
    Rinaldi, NJ
    Macisaac, KD
    Danford, TW
    Hannett, NM
    Tagne, JB
    Reynolds, DB
    Yoo, J
    Jennings, EG
    Zeitlinger, J
    Pokholok, DK
    Kellis, M
    Rolfe, PA
    Takusagawa, KT
    Lander, ES
    Gifford, DK
    Fraenkel, E
    Young, RA
    [J]. NATURE, 2004, 431 (7004) : 99 - 104