Discovering cis-regulatory modules by optimizing barbecues

被引:0
|
作者
Mosig, Axel [1 ,2 ]
Biyikoglu, Tuerker [2 ,3 ]
Prohaska, Sonja J. [4 ,5 ,6 ,7 ,8 ]
Stadler, Peter F. [4 ,5 ,6 ,7 ,9 ]
机构
[1] Shanghai Inst Biol Sci, CAS MPG Partner Inst Computat Biol, Shanghai 200031, Peoples R China
[2] Max Planck Inst Math Sci, D-04103 Leipzig, Germany
[3] Isik Univ, TR-34980 Istanbul, Turkey
[4] Univ Vienna, Dept Theoret Chem, A-1090 Vienna, Austria
[5] Univ Leipzig, Bioinformat Grp, Dept Comp Sci, D-04107 Leipzig, Germany
[6] Univ Leipzig, Interdisciplinary Ctr Bioinformat, D-04107 Leipzig, Germany
[7] Santa Fe Inst, Santa Fe, NM 87501 USA
[8] Arizona State Univ, Dept Biomed Informat, Sch Comp & Informat, Tempe, AZ 85287 USA
[9] Fraunhofer Inst Zelltherapie & Immunol, D-04103 Leipzig, Germany
关键词
Gene regulation; cis-regulatory modules (CRMs); Best barbecue problem; NP-completeness; Branch-and-bound algorithms; Itemset mining; EVOLUTION; ELEMENTS; GENES; IDENTIFICATION; DATABASE;
D O I
10.1016/j.dam.2008.06.042
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Gene expression in eukaryotic cells is regulated by a complex network of interactions, in which transcription factors and their binding sites on the genomic DNA play a determining role. As transcription factors rarely, if ever, act in isolation, binding sites of interacting factors are typically arranged in close proximity forming so-called cis-regulatory modules. Even when the individual binding sites are known, module discovery remains a hard combinatorial problem, which we formalize here as the Best Barbecue Problem. It asks for simultaneously stabbing a maximum number of differently colored intervals from K arrangements of colored intervals. This geometric problem turns out to be an elementary, yet previously unstudied combinatorial optimization problem of detecting common edges in a family of hypergraphs, a decision version of which we show here to be NP-complete. Due to its relevance in biological applications, we propose algorithmic variations that are suitable for the analysis of real data sets comprising either many sequences or many binding sites. Being based on set systems induced by interval arrangements, our problem setting generalizes to discovering patterns of co-localized itemsets in non-sequential objects that consist of corresponding arrangements or induce set systems of co-localized items. In fact, our optimization problem is a generalization of the popular concept of frequent itemset mining. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:2458 / 2468
页数:11
相关论文
共 50 条
  • [41] Lmx1b-targeted cis-regulatory modules involved in limb dorsalization
    Haro, Endika
    Watson, Billy A.
    Feenstra, Jennifer M.
    Tegeler, Luke
    Pira, Charmaine U.
    Mohan, Subburaman
    Oberg, Kerby C.
    DEVELOPMENT, 2017, 144 (11): : 2009 - 2020
  • [42] Genome-wide analysis of the cis-regulatory modules of divergent gene pairs in yeast
    Su, Chien-Hao
    Shih, Ching-Hua
    Chang, Tien-Hsien
    Tsai, Huai-Kuang
    GENOMICS, 2010, 96 (06) : 352 - 361
  • [43] Discovering sequences with potential regulatory characteristics
    Bina, Minou
    Wyss, Phillip
    Lazarus, Sheryl A.
    Shah, Syed R.
    Ren, Wenhui
    Szpankowski, Wojciech
    Crawford, Gregory E.
    Park, Sang P.
    Song, Xiaohui C.
    GENOMICS, 2009, 93 (04) : 314 - 322
  • [44] A novel method for predicting activity of cis-regulatory modules, based on a diverse training set
    Yang, Wei
    Sinha, Saurabh
    BIOINFORMATICS, 2017, 33 (01) : 1 - 7
  • [45] CFA: An explainable deep learning model for annotating the transcriptional roles of cis-regulatory modules based on epigenetic codes
    Yang, Tzu-Hsien
    Yu, Yu-Huai
    Wu, Sheng-Hang
    Zhang, Fang-Yuan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
  • [46] Predicting tissue specific cis-regulatory modules in the human genome using pairs of co-occurring motifs
    Girgis, Hani Z.
    Ovcharenko, Ivan
    BMC BIOINFORMATICS, 2012, 13
  • [47] Cis-regulatory Module Detection using Constraint Programming
    Guns, Tias
    Sun, Hong
    Marchal, Kathleen
    Nijssen, Siegfried
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 363 - 368
  • [48] Bacterial cis-regulatory RNA structures
    Gelfand M.S.
    Molecular Biology, 2006, 40 (4) : 541 - 550
  • [49] Bacterial cis-regulatory RNA structures
    Gelfand, M. S.
    MOLECULAR BIOLOGY, 2006, 40 (04) : 609 - 619
  • [50] Cis-regulatory logic in archaeal transcription
    Peeters, Eveline
    Peixeiro, Nuno
    Sezonov, Guennadi
    BIOCHEMICAL SOCIETY TRANSACTIONS, 2013, 41 : 326 - 331