On the Limiting Distribution of Lempel-Ziv'78 Redundancy for Memoryless Sources

被引:3
|
作者
Jacquet, Philippe [1 ]
Szpankowski, Wojciech [2 ,3 ]
机构
[1] Alcatel Lucent Bell Labs, F-91620 Nozay, France
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[3] Gdansk Univ Technol, Fac Elect Telecommun & Informat, PL-80233 Gdansk, Poland
基金
美国国家科学基金会;
关键词
Lempel-Zv; 78; redundancy; digital search trees; analytic information theory; ZIV PARSING SCHEME; PROBABILITY; SEQUENCES;
D O I
10.1109/TIT.2014.2358679
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the Lempel-Ziv'78 algorithm and show that its (normalized) redundancy rate tends to a Gaussian distribution for memoryless sources. We accomplish it by extending findings from our 1995 paper, in particular, by presenting a new simplified proof of the central limit theorem (CLT) for the number of phrases in the LZ'78 algorithm. We first analyze the asymptotic behavior of the total path length in the associated digital search tree built from independent sequences. Then, a renewal theory type argument yields CLT for LZ'78 scheme. Here, we extend our analysis of LZ'78 algorithm to present new results on the convergence of moments, moderate and large deviations, and CLT for the (normalized) redundancy. In particular, we confirm that the average redundancy rate decays as 1/log n, and we find that the variance is of order 1/n, where n is the length of the text.
引用
收藏
页码:6917 / 6930
页数:14
相关论文
共 19 条
  • [1] Limiting Distribution of Lempel Ziv'78 Redundancy
    Jacquet, Philippe
    Szpankowski, Wojciech
    2011 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2011, : 1509 - 1513
  • [2] On the average redundancy rate of the Lempel-Ziv code
    Louchard, G
    Szpankowski, W
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1997, 43 (01) : 2 - 8
  • [3] AVERAGE PROFILE AND LIMITING DISTRIBUTION FOR A PHRASE SIZE IN THE LEMPEL-ZIV PARSING ALGORITHM
    LOUCHARD, G
    SZPANKOWSKI, W
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (02) : 478 - 488
  • [4] On the average redundancy rate of the Lempel-Ziv code with the k-error protocol
    Reznik, YA
    Szpankowski, W
    INFORMATION SCIENCES, 2001, 135 (1-2) : 57 - 70
  • [5] A Lempel-Ziv Compressed Structure for Document Listing
    Ferrada, Hector
    Navarro, Gonzalo
    STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2013), 2013, 8214 : 116 - 128
  • [6] Stronger Lempel-Ziv Based Compressed Text Indexing
    Arroyuelo, Diego
    Navarro, Gonzalo
    Sadakane, Kunihiko
    ALGORITHMICA, 2012, 62 (1-2) : 54 - 101
  • [7] On the Nonlinear complexity and Lempel-Ziv complexity of finite length sequences
    Limniotis, Konstantinos
    Kolokotronis, Nicholas
    Kalouptsidis, Nicholas
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2007, 53 (11) : 4293 - 4302
  • [8] Average Profile of the Lempel-Ziv Parsing Scheme for a Markovian Source
    P. Jacquet
    W. Szpankowski
    J. Tang
    Algorithmica, 2001, 31 : 318 - 360
  • [9] Nonlinear complexity of binary sequences and connections with Lempel-Ziv compression
    Limniotis, Konstantinos
    Kolokotronis, Nicholas
    Kalouptsidis, Nicholas
    SEQUENCES AND THEIR APPLICATIONS - SETA 2006, 2006, 4086 : 168 - 179
  • [10] Average profile of the Lempel-Ziv parsing scheme for a Markovian source
    Jacquet, P
    Szpankowski, W
    Tang, J
    ALGORITHMICA, 2001, 31 (03) : 318 - 360