Discovering cis-regulatory modules by optimizing barbecues

被引:0
|
作者
Mosig, Axel [1 ,2 ]
Biyikoglu, Tuerker [2 ,3 ]
Prohaska, Sonja J. [4 ,5 ,6 ,7 ,8 ]
Stadler, Peter F. [4 ,5 ,6 ,7 ,9 ]
机构
[1] Shanghai Inst Biol Sci, CAS MPG Partner Inst Computat Biol, Shanghai 200031, Peoples R China
[2] Max Planck Inst Math Sci, D-04103 Leipzig, Germany
[3] Isik Univ, TR-34980 Istanbul, Turkey
[4] Univ Vienna, Dept Theoret Chem, A-1090 Vienna, Austria
[5] Univ Leipzig, Bioinformat Grp, Dept Comp Sci, D-04107 Leipzig, Germany
[6] Univ Leipzig, Interdisciplinary Ctr Bioinformat, D-04107 Leipzig, Germany
[7] Santa Fe Inst, Santa Fe, NM 87501 USA
[8] Arizona State Univ, Dept Biomed Informat, Sch Comp & Informat, Tempe, AZ 85287 USA
[9] Fraunhofer Inst Zelltherapie & Immunol, D-04103 Leipzig, Germany
关键词
Gene regulation; cis-regulatory modules (CRMs); Best barbecue problem; NP-completeness; Branch-and-bound algorithms; Itemset mining; EVOLUTION; ELEMENTS; GENES; IDENTIFICATION; DATABASE;
D O I
10.1016/j.dam.2008.06.042
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Gene expression in eukaryotic cells is regulated by a complex network of interactions, in which transcription factors and their binding sites on the genomic DNA play a determining role. As transcription factors rarely, if ever, act in isolation, binding sites of interacting factors are typically arranged in close proximity forming so-called cis-regulatory modules. Even when the individual binding sites are known, module discovery remains a hard combinatorial problem, which we formalize here as the Best Barbecue Problem. It asks for simultaneously stabbing a maximum number of differently colored intervals from K arrangements of colored intervals. This geometric problem turns out to be an elementary, yet previously unstudied combinatorial optimization problem of detecting common edges in a family of hypergraphs, a decision version of which we show here to be NP-complete. Due to its relevance in biological applications, we propose algorithmic variations that are suitable for the analysis of real data sets comprising either many sequences or many binding sites. Being based on set systems induced by interval arrangements, our problem setting generalizes to discovering patterns of co-localized itemsets in non-sequential objects that consist of corresponding arrangements or induce set systems of co-localized items. In fact, our optimization problem is a generalization of the popular concept of frequent itemset mining. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:2458 / 2468
页数:11
相关论文
共 50 条
  • [31] BICORN: An R package for integrative inference of de novo cis-regulatory modules
    Chen, Xi
    Gu, Jinghua
    Neuwald, Andrew F.
    Hilakivi-Clarke, Leena
    Clarke, Robert
    Xuan, Jianhua
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [32] Spatially varying cis-regulatory divergence in Drosophila embryos elucidates cis-regulatory logic
    Combs, Peter A.
    Fraser, Hunter B.
    PLOS GENETICS, 2018, 14 (11):
  • [33] A map of the cis-regulatory sequences in the mouse genome
    Shen, Yin
    Yue, Feng
    McCleary, David F.
    Ye, Zhen
    Edsall, Lee
    Kuan, Samantha
    Wagner, Ulrich
    Dixon, Jesse
    Lee, Leonard
    Lobanenkov, Victor V.
    Ren, Bing
    NATURE, 2012, 488 (7409) : 116 - 120
  • [34] Epistatic Interactions in the Arabinose Cis-Regulatory Element
    Lagator, Mato
    Igler, Claudia
    Moreno, Anaisa B.
    Guet, Calin C.
    Bollback, Jonathan P.
    MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (03) : 761 - 769
  • [35] cis-Regulatory elements in plant cell signaling
    Priest, Henry D.
    Filichkin, Sergei A.
    Mockler, Todd C.
    CURRENT OPINION IN PLANT BIOLOGY, 2009, 12 (05) : 643 - 649
  • [36] Cis-Regulatory Elements in Mammals
    Liu, Xingyu
    Chen, Mengjie
    Qu, Xiuwen
    Liu, Wenjing
    Dou, Yuting
    Liu, Qingyou
    Shi, Deshun
    Jiang, Mingsheng
    Li, Hui
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (01)
  • [37] OncoCis: annotation of cis-regulatory mutations in cancer
    Perera, Dilmi
    Chacon, Diego
    Thoms, Julie A. I.
    Poulos, Rebecca C.
    Shlien, Adam
    Beck, Dominik
    Campbell, Peter J.
    Pimanda, John E.
    Wong, Jason W. H.
    GENOME BIOLOGY, 2014, 15 (10)
  • [38] Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
    Halfon, Marc S.
    Zhu, Qianqian
    Brennan, Elizabeth R.
    Zhou, Yiyun
    BMC GENOMICS, 2011, 12
  • [39] Multiplex cis-regulatory analysis
    Nam, Jongmin
    ECHINODERMS, PT B, 2019, 151 : 159 - 176
  • [40] Imogene: identification of motifs and cis-regulatory modules underlying gene co-regulation
    Rouault, Herve
    Santolini, Marc
    Schweisguth, Francois
    Hakim, Vincent
    NUCLEIC ACIDS RESEARCH, 2014, 42 (10) : 6128 - 6145