An analysis of the Burrows-Wheeler Transform

被引:212
|
作者
Manzini, G [1 ]
机构
[1] Univ Piemonte Orientale, Dipartimento Sci & Tecnol Avanzate, I-15100 Alessandria, Italy
关键词
algorithms; performance; block sorting; Burrows-Wheeler Transform; move-to-front encoding; worst-case analysis of compression;
D O I
10.1145/382780.382782
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Burrows-Wheeler Transform (also known as Block-Sorting) is at the base of compression algorithms that are the state of the art in lossless data compression. In this paper, we analyze two algorithms that use this technique. The first one is the original algorithm described by Burrows and Wheeler, which, despite its simplicity. outperforms the Gzip compressor. The second one uses an additional run-length encoding step to improve compression. We prove that the compression ratio of both algorithms can be bounded in terms of the kth order empirical entropy of the input string for any k greater than or equal to 0. We make no assumptions on the input and we obtain bounds which hold in the worst case, that is, for every possible input string. All previous results for Block-Sorting algorithms were concerned with the average compression ratio and have been established assuming that the input comes from a finite-order Markov source.
引用
收藏
页码:407 / 430
页数:24
相关论文
共 50 条
  • [1] Formalized Burrows-Wheeler Transform
    Cheung, Louis
    Moffat, Alistair
    Rizkallah, Christine
    PROCEEDINGS OF THE 14TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON CERTIFIED PROGRAMS AND PROOFS, CPP 2025, 2025, : 187 - 197
  • [2] An extension of the Burrows-Wheeler transform
    Mantaci, S.
    Restivo, A.
    Rosone, G.
    Sciortino, M.
    THEORETICAL COMPUTER SCIENCE, 2007, 387 (03) : 298 - 312
  • [3] Context exhumation after the Burrows-Wheeler transform
    Deorowicz, S
    INFORMATION PROCESSING LETTERS, 2005, 95 (01) : 313 - 320
  • [4] Bit Catastrophes for the Burrows-Wheeler Transform
    Giuliani, Sara
    Inenaga, Shunsuke
    Liptak, Zsuzsanna
    Romana, Giuseppe
    Sciortino, Marinella
    Urbina, Cristian
    THEORY OF COMPUTING SYSTEMS, 2025, 69 (02)
  • [5] Bit Catastrophes for the Burrows-Wheeler Transform
    Giuliani, Sara
    Inenaga, Shunsuke
    Liptak, Zsuzsanna
    Romana, Giuseppe
    Sciortino, Marinella
    Urbina, Cristian
    DEVELOPMENTS IN LANGUAGE THEORY, DLT 2023, 2023, 13911 : 86 - 99
  • [6] Burrows-Wheeler transform and Sturmian words
    Mantaci, S
    Restivo, A
    Sciortino, M
    INFORMATION PROCESSING LETTERS, 2003, 86 (05) : 241 - 246
  • [7] Resolution of the Burrows-Wheeler Transform Conjecture
    Kempa, Dominik
    Kociumaka, Tomasz
    2020 IEEE 61ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2020), 2020, : 1002 - 1013
  • [8] On Fixed Points of the Burrows-Wheeler Transform
    Mantaci, Sabrina
    Restivo, Antonio
    Rosone, Giovanna
    Russo, Floriana
    Sciortino, Marinella
    FUNDAMENTA INFORMATICAE, 2017, 154 (1-4) : 277 - 288
  • [9] Burrows-Wheeler transform and palindromic richness
    Restivo, Antonio
    Rosone, Giovanna
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (30-32) : 3018 - 3026
  • [10] Local Decodability of the Burrows-Wheeler Transform
    Sinha, Sandip
    Weinstein, Omri
    PROCEEDINGS OF THE 51ST ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '19), 2019, : 744 - 755