Assessing the significance of sets of words

被引:0
作者
Boeva, V
Clément, J
Régnier, M
Vandenbogaert, M
机构
[1] Moscow MV Lomonosov State Univ, Moscow, Russia
[2] Univ Marne La Vallee, IGM, Marne La Vallee, France
[3] Inst Natl Rech Informat & Automat, F-78153 Le Chesnay, France
[4] Univ Basel, Biozentrum, Basel, Switzerland
来源
COMBINATORIAL PATTERN MATCHING, PROCEEDINGS | 2005年 / 3537卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various criteria have been defined to evaluate the significance of sets of words, the computation of them often being difficult. We provide explicit expressions for the waiting time in such a context. In order to assess the significance of a cluster of potential binding sites, we extend them to the co-occurrence problem. We point out that these criteria values depend on a few fundamental parameters. We provide efficient algorithms to compute them, that rely on a combinatorial interpretation of the formulae. We show that our results axe very tight in the so-called twilight zone and improve on previous rough approximations. One assumes that the text is generated according to a Markov stationary process. These results axe developed for an extended model of consensus.
引用
收藏
页码:358 / 370
页数:13
相关论文
共 26 条
  • [1] EFFICIENT STRING MATCHING - AID TO BIBLIOGRAPHIC SEARCH
    AHO, AV
    CORASICK, MJ
    [J]. COMMUNICATIONS OF THE ACM, 1975, 18 (06) : 333 - 340
  • [2] [Anonymous], 1978, INDAGATIONES MATH
  • [3] [Anonymous], 2001, Proceedings of the fifth annual international conference on Computational biology, RECOMB '01
  • [4] Patterns of variant polyadenylation signal usage in human genes
    Beaudoing, E
    Freier, S
    Wyatt, JR
    Claverie, JM
    Gautheret, D
    [J]. GENOME RESEARCH, 2000, 10 (07) : 1001 - 1010
  • [5] THE DISTRIBUTION OF SUBWORD COUNTS IS USUALLY NORMAL
    BENDER, EA
    KOCHMAN, F
    [J]. EUROPEAN JOURNAL OF COMBINATORICS, 1993, 14 (04) : 265 - 275
  • [6] BLANCHETTE M, 2001, BIOINFORMATICS, V817, P30
  • [7] Phylogenetically and spatially conserved word pairs associated with gene-expression changes in yeasts
    Chiang, DY
    Moses, AM
    Kellis, M
    Lander, ES
    Eisen, MB
    [J]. GENOME BIOLOGY, 2003, 4 (07)
  • [8] CHRYSAPHINOU C, 1990, THEOR PROBAB APPL, V79, P167
  • [9] Crochemore M., 2002, JEWELS STRINGOLOGY
  • [10] FLAJOLET P, 1996, ANAL ALGORITHMS