Discovering cis-regulatory modules by optimizing barbecues

被引:0
|
作者
Mosig, Axel [1 ,2 ]
Biyikoglu, Tuerker [2 ,3 ]
Prohaska, Sonja J. [4 ,5 ,6 ,7 ,8 ]
Stadler, Peter F. [4 ,5 ,6 ,7 ,9 ]
机构
[1] Shanghai Inst Biol Sci, CAS MPG Partner Inst Computat Biol, Shanghai 200031, Peoples R China
[2] Max Planck Inst Math Sci, D-04103 Leipzig, Germany
[3] Isik Univ, TR-34980 Istanbul, Turkey
[4] Univ Vienna, Dept Theoret Chem, A-1090 Vienna, Austria
[5] Univ Leipzig, Bioinformat Grp, Dept Comp Sci, D-04107 Leipzig, Germany
[6] Univ Leipzig, Interdisciplinary Ctr Bioinformat, D-04107 Leipzig, Germany
[7] Santa Fe Inst, Santa Fe, NM 87501 USA
[8] Arizona State Univ, Dept Biomed Informat, Sch Comp & Informat, Tempe, AZ 85287 USA
[9] Fraunhofer Inst Zelltherapie & Immunol, D-04103 Leipzig, Germany
关键词
Gene regulation; cis-regulatory modules (CRMs); Best barbecue problem; NP-completeness; Branch-and-bound algorithms; Itemset mining; EVOLUTION; ELEMENTS; GENES; IDENTIFICATION; DATABASE;
D O I
10.1016/j.dam.2008.06.042
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Gene expression in eukaryotic cells is regulated by a complex network of interactions, in which transcription factors and their binding sites on the genomic DNA play a determining role. As transcription factors rarely, if ever, act in isolation, binding sites of interacting factors are typically arranged in close proximity forming so-called cis-regulatory modules. Even when the individual binding sites are known, module discovery remains a hard combinatorial problem, which we formalize here as the Best Barbecue Problem. It asks for simultaneously stabbing a maximum number of differently colored intervals from K arrangements of colored intervals. This geometric problem turns out to be an elementary, yet previously unstudied combinatorial optimization problem of detecting common edges in a family of hypergraphs, a decision version of which we show here to be NP-complete. Due to its relevance in biological applications, we propose algorithmic variations that are suitable for the analysis of real data sets comprising either many sequences or many binding sites. Being based on set systems induced by interval arrangements, our problem setting generalizes to discovering patterns of co-localized itemsets in non-sequential objects that consist of corresponding arrangements or induce set systems of co-localized items. In fact, our optimization problem is a generalization of the popular concept of frequent itemset mining. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:2458 / 2468
页数:11
相关论文
共 50 条
  • [1] A New Algorithm for Identifying Cis-Regulatory Modules Based on Hidden Markov Model
    Guo, Haitao
    Huo, Hongwei
    BIOMED RESEARCH INTERNATIONAL, 2017, 2017
  • [2] Deciphering the transcriptional cis-regulatory code
    Yanez-Cuna, J. Omar
    Kvon, Evgeny Z.
    Stark, Alexander
    TRENDS IN GENETICS, 2013, 29 (01) : 11 - 22
  • [3] Predicting Cis-regulatory Modules by Method Integration
    Chang, Darby Tien-Hao
    Shiu, Guan-Yu
    Sun, You-Jie
    11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 491 - 494
  • [4] Computational methods for the detection of cis-regulatory modules
    Van Loo, Peter
    Marynen, Peter
    BRIEFINGS IN BIOINFORMATICS, 2009, 10 (05) : 509 - 524
  • [5] Modeling the cis-regulatory modules of genes expressed in developmental stages of Drosophila melanogaster
    Lopez, Yosvany
    Vandenbon, Alexis
    Nose, Akinao
    Nakai, Kenta
    PEERJ, 2017, 5
  • [6] Evolution of lineage-specific functions in ancient cis-regulatory modules
    Pauls, Stefan
    Goode, Debbie K.
    Petrone, Libero
    Oliveri, Paola
    Elgar, Greg
    OPEN BIOLOGY, 2015, 5 (11)
  • [7] SMCis: An Effective Algorithm for Discovery of Cis-Regulatory Modules
    Guo, Haitao
    Huo, Hongwei
    Yu, Qiang
    PLOS ONE, 2016, 11 (09):
  • [8] BestCRM: An Exhaustive Search for Optimal Cis-Regulatory Modules in Promoters Accelerated by the Multidimensional Hash Function
    Deyneko, Igor V.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (03)
  • [9] Motifs and cis-regulatory modules mediating the expression of genes co-expressed in presynaptic neurons
    Liu, Rui
    Hannenhalli, Sridhar
    Bucan, Maja
    GENOME BIOLOGY, 2009, 10 (07):
  • [10] Exploring a Drosophila Transcription Factor Interaction Network to Identify Cis-Regulatory Modules
    Mahmud, A. K. M. Firoj
    Yang, Doo
    Stenberg, Per
    Ioshikhes, Ilya
    Nandi, Soumyadeep
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (08) : 1313 - 1328