Dynamic rank/select structures with applications to run-length encoded texts

被引:8
作者
Lee, Sunho [1 ]
Park, Kunsoo [1 ]
机构
[1] Seoul Natl Univ, Sch Engn & Comp Sci, Seoul 151742, South Korea
关键词
Succinct data structures; Dynamic rank/select structures; Full-text index; Run-length encoding;
D O I
10.1016/j.tcs.2009.07.021
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given an n-length text over a sigma-size alphabet, we propose a framework for dynamic rank/select structures on the text and some of its applications. For a small alphabet with a <= log n, we propose a two-level structure consisting of a counting scheme and a storing scheme that supports O(log n) worst-case time rank/select operations and O(log n) amortized time insert/delete operations. For a large alphabet with log n < sigma <= n, we extend it to obtain O((1+ log sigma/log log n) worst-case time rank/select and O((1 + log sigma/log log n) log n) amortized time insert/delete. Our structure provides a simple representation of an index for a collection of texts. In addition, we present rank/select structures on run-length encoding (RLE) of a text. For the n'-length RLE of an n-length text, our static version provides O(1) time select and O(log log sigma) time rank using n' log sigma + O(n) bits and our dynamic version gives O((1 + log sigma/log log n) log n) time operations in n' log sigma + o(n' log sigma ) + O(n) bits. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:4402 / 4413
页数:12
相关论文
共 15 条
  • [1] Distinct squares in run-length encoded strings
    Liu, J. J.
    THEORETICAL COMPUTER SCIENCE, 2010, 411 (49) : 4235 - 4241
  • [2] Hardness of comparing two run-length encoded strings
    Chen, Kuan-Yu
    Hsu, Ping-Hui
    Chao, Kun-Mao
    JOURNAL OF COMPLEXITY, 2010, 26 (04) : 364 - 374
  • [3] An algorithm for the rapid computation of boundaries of run-length encoded regions
    Quek, FKH
    PATTERN RECOGNITION, 2000, 33 (10) : 1637 - 1649
  • [4] Efficient retrieval of approximate palindromes in a run-length encoded string
    Chen, Kuan-Yu
    Hsu, Ping-Hui
    Chao, Kun-Mao
    THEORETICAL COMPUTER SCIENCE, 2012, 432 : 28 - 37
  • [5] Fast algorithms for computing the constrained LCS of run-length encoded strings
    Ann, Hsing-Yen
    Yang, Chang-Biau
    Tseng, Chiou-Ting
    Hor, Chiou-Yi
    THEORETICAL COMPUTER SCIENCE, 2012, 432 : 1 - 9
  • [6] Lyndon Factorization Algorithms for Small Alphabets and Run-Length Encoded Strings
    Ghuman, Sukhpal Singh
    Giaquinta, Emanuele
    Tarhio, Jorma
    ALGORITHMS, 2019, 12 (06)
  • [7] Computing similarity of run-length encoded strings with affine gap penalty
    Kim, Jin Wook
    Amir, Amihood
    Landau, Gad A.
    Park, Kunsoo
    THEORETICAL COMPUTER SCIENCE, 2008, 395 (2-3) : 268 - 282
  • [8] Binary jumbled string matching for highly run-length compressible texts
    Badkobeh, Golnaz
    Fici, Gabriele
    Kroon, Steve
    Liptak, Zsuzsanna
    INFORMATION PROCESSING LETTERS, 2013, 113 (17) : 604 - 608
  • [9] A fast algorithm for finding the positions of all squares in a run-length encoded string
    Liu, J. J.
    Huang, G. S.
    Wang, Y. L.
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (38-40) : 3942 - 3948
  • [10] Approximate Matching for Run-Length Encoded Strings Is 3SUM-Hard
    Chen, Kuan-Yu
    Hsu, Ping-Hui
    Chao, Kun-Mao
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2009, 5577 : 168 - 179