Distances in random digital search trees

被引:3
作者
Aguech, Rafik
Lasmar, Nabil
Mahmoud, Hosam [1 ]
机构
[1] George Washington Univ, Dept Stat, Washington, DC 20052 USA
[2] Fac Sci Monastir, Dept Math, Monastir 5019, Tunisia
[3] IPEIT, Dept Math, Tunis, Tunisia
关键词
random trees; recurrence; Mellin Transform; poissonization; fixed point; contraction method;
D O I
10.1007/s00236-006-0019-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distances between nodes in random trees is a popular topic, and several classes of trees have recently been investigated. We look into this matter in digital search trees. By analytic techniques, such as the Mellin Transform and poissonization, we describe a program to determine the moments of these distances. The program is illustrated on the mean and variance. One encounters delayed Mellin transform equations, which we solve by inspection. In addition to various asymptotics, we give an exact expression for the mean and for the variance in the unbiased case. Interestingly, the unbiased case gives a bounded variance, whereas the biased case gives a variance growing with the number of keys. It is therefore possible in the biased case to show that an appropriately normalized version of the distance converges to a limit. The complexity of moment calculation increases substantially with each higher moment; it is prudent to seek a shortcut to the limit via a method that avoids the computation of all moments. Toward this end, we utilize the contraction method to show that in biased digital search trees the distribution of a suitably normalized version of the distances approaches a limit that is the fixed-point solution of a distributional equation (distances being measured in the Wasserstein metric space). An explicit solution to the fixed-point equation is readily demonstrated to be Gaussian.
引用
收藏
页码:243 / 264
页数:22
相关论文
共 26 条
[1]   An asymptotic theory for Cauchy-Euler differential equations with applications to the analysis of algorithms [J].
Chern, HH ;
Hwang, HK ;
Tsai, TH .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2002, 44 (01) :177-225
[2]   The oscillatory distribution of distances in random tries [J].
Christophi, CA ;
Mahmoud, HM .
ANNALS OF APPLIED PROBABILITY, 2005, 15 (02) :1536-1564
[3]   FILE STRUCTURES USING HASHING FUNCTIONS [J].
COFFMAN, EG ;
EVE, J .
COMMUNICATIONS OF THE ACM, 1970, 13 (07) :427-&
[4]   Distances and finger search in random binary search trees [J].
Devroye, L ;
Neininger, R .
SIAM JOURNAL ON COMPUTING, 2004, 33 (03) :647-658
[5]   MELLIN TRANSFORMS AND ASYMPTOTICS - HARMONIC SUMS [J].
FLAJOLET, P ;
GOURDON, X ;
DUMAS, P .
THEORETICAL COMPUTER SCIENCE, 1995, 144 (1-2) :3-58
[6]   DIGITAL SEARCH-TREES REVISITED [J].
FLAJOLET, P ;
SEDGWICK, R .
SIAM JOURNAL ON COMPUTING, 1986, 15 (03) :748-767
[7]   Analytical dePoissonization and its applications [J].
Jacquet, P ;
Szpankowski, W .
THEORETICAL COMPUTER SCIENCE, 1998, 201 (1-2) :1-62
[8]   DIGITAL SEARCH-TREES AGAIN REVISITED - THE INTERNAL PATH-LENGTH PERSPECTIVE [J].
KIRSCHENHOFER, P ;
PRODINGER, H ;
SZPANKOWSKI, W .
SIAM JOURNAL ON COMPUTING, 1994, 23 (03) :598-616
[9]   FURTHER RESULTS ON DIGITAL SEARCH-TREES [J].
KIRSCHENHOFER, P ;
PRODINGER, H .
THEORETICAL COMPUTER SCIENCE, 1988, 58 (1-3) :143-154
[10]   AVERAGE PROFILE AND LIMITING DISTRIBUTION FOR A PHRASE SIZE IN THE LEMPEL-ZIV PARSING ALGORITHM [J].
LOUCHARD, G ;
SZPANKOWSKI, W .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (02) :478-488