WHAT CAN WE LEARN ABOUT SUFFIX TREES FROM INDEPENDENT TRIES

被引:0
|
作者
JACQUET, P [1 ]
SZPANKOWSKI, W [1 ]
机构
[1] PURDUE UNIV, DEPT COMP SCI, W LAFAYETTE, IN 47907 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A suffix tree of a word is a digital tree that is built from suffixes of the underlying word. We consider words that are random sequences built from independent symbols over a finite alphabet. Our main finding shows that the depths in a suffix tree are asymptotically equivalent to the depths in a digital tree that stores independent keys (i.e., independent digital trees known also as tries). More precisely, we prove that the depths in a suffix tree build from the first n suffixes of a random word are normally distributed with the mean asymptotically equivalent to 1/h1 log n and the variance alpha.log n, where h1 is the entropy of the alphabet, and alpha is a parameter of the probabilistic model. Our results provide new insights into asymptotic properties of compression schemes, and therefore find direct applications in computer sciences and telecommunications, most notably in coding theory, theory of languages, and design and analysis of algorithms.
引用
收藏
页码:228 / 239
页数:12
相关论文
共 50 条