Bit Catastrophes for the Burrows-Wheeler Transform

被引:5
|
作者
Giuliani, Sara [1 ]
Inenaga, Shunsuke [2 ]
Liptak, Zsuzsanna [1 ]
Romana, Giuseppe [3 ]
Sciortino, Marinella [3 ]
Urbina, Cristian [4 ]
机构
[1] Univ Verona, Verona, Italy
[2] Kyushu Univ, Fukuoka, Japan
[3] Univ Palermo, Palermo, Italy
[4] Univ Chile, Santiago, Chile
来源
DEVELOPMENTS IN LANGUAGE THEORY, DLT 2023 | 2023年 / 13911卷
关键词
Burrows-Wheeler transform; Equal-letter run; Repetitiveness measure; Sensitivity;
D O I
10.1007/978-3-031-33264-7_8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A bit catastrophe, loosely defined, is when a change in just one character of a string causes a significant change in the size of the compressed string. We study this phenomenon for the Burrows-Wheeler Transform (BWT), a string transform at the heart of several of the most popular compressors and aligners today. The parameter determining the size of the compressed data is the number of equal-letter runs of the BWT, commonly denoted r. We exhibit infinite families of strings in which insertion, deletion, resp. substitution of one character increases r from constant to Theta(log n), where n is the length of the string. These strings can be interpreted both as examples for an increase by a multiplicative or an additive Theta(log n)-factor. As regards multiplicative factor, they attain the upper bound given by Akagi, Funakoshi, and Inenaga [Inf & Comput. 2023] of O( log n log r), since here r = O(1). We then give examples of strings in which insertion, deletion, resp. substitution of a character increases r by a Theta(root n) additive factor. These strings significantly improve the best known lower bound for an additive factor of Omega(log n) [Giuliani et al., SOFSEM 2021].
引用
收藏
页码:86 / 99
页数:14
相关论文
共 50 条
  • [21] On the number of equal-letter runs of the bijective Burrows-Wheeler transform
    Biagi, Elena
    Cenzato, Davide
    Liptak, Zsuzsanna
    Romana, Giuseppe
    THEORETICAL COMPUTER SCIENCE, 2025, 1027
  • [22] A High Performance Architecture for Computing Burrows-Wheeler Transform on FPGAs
    Cheema, Umer I.
    Khokhar, Ashfaq A.
    2013 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2013,
  • [23] Application of the Burrows-Wheeler Transform for Searching for Approximate Tandem Repeats
    Danek, Agnieszka
    Pokrzywa, Rafal
    Makalowska, Izabela
    Polanski, Andrzej
    PATTERN RECOGNITION IN BIOINFORMATICS, 2012, 7632 : 255 - 266
  • [24] Can burrows-Wheeler transform be replaced in chain code compression?
    Zalik, Borut
    Mongus, Domen
    Lukac, Niko
    Zalik, Krista Rizman
    INFORMATION SCIENCES, 2020, 525 (525) : 109 - 118
  • [25] Pipelined Processor for Image Compression through Burrows-Wheeler Transform
    Bafna, Chirag Babulal
    Berde, Ankit
    Jashnani, Karan
    Chaudhari, Monali
    Sharma, Simeran
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 383 - 387
  • [26] Parallel lossless data compression using the Burrows-Wheeler Transform
    Gilchrist, Jeff
    Cuhadar, Aysegul
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2008, 4 (01) : 117 - 135
  • [27] Computing the Burrows-Wheeler transform of a string and its reverse in parallel
    Ohlebusch, Enno
    Beller, Timo
    Abouelhoda, Mohamed I.
    JOURNAL OF DISCRETE ALGORITHMS, 2014, 25 : 21 - 33
  • [28] Efficient pattern matching in degenerate strings with the Burrows-Wheeler transform
    Daykin, J. W.
    Groult, R.
    Guesnet, Y.
    Lecroq, T.
    Lefebvre, A.
    Leonard, M.
    Mouchard, L.
    Prieur-Gaston, E.
    Watson, B.
    INFORMATION PROCESSING LETTERS, 2019, 147 : 82 - 87
  • [29] High performance OpenCL realization of Burrows-Wheeler transform on GPU
    Kartsev, Petr F.
    IWOCL'18: PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL, 2018, : 83 - 84
  • [30] A four-stage algorithm for updating a Burrows-Wheeler transform
    Salson, M.
    Lecroq, T.
    Leonard, M.
    Mouchard, L.
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (43) : 4350 - 4359