Near-Optimal Search Time in δ-Optimal Space, and Vice Versa

被引:0
作者
Kociumaka, Tomasz [1 ]
Navarro, Gonzalo [2 ]
Olivares, Francisco [2 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Univ Chile, CeBiB Ctr Biotechnol & Bioengn, Dept Comp Sci, Santiago, Chile
关键词
Text indexing; Pattern matching; Substring complexity; Repetitive sequences;
D O I
10.1007/s00453-023-01186-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Two recent lower bounds on the compressibility of repetitive sequences, delta <= gamma, have received much attention. It has been shown that a length -n string S over an alphabet of size sigma can be represented within the optimal O(delta log n log / delta log n ) space, and further, that within that space one can find all the occ occurrences in S of any length -m pattern in time O(m log n + occ log(epsilon) n) for any constant epsilon > 0. Instead, the near-optimal search time O(m + (occ + 1) log(epsilon) n) has been achieved only within O(gamma log n/gamma ) space. Both results are based on considerably different locally consistent parsing techniques. The question of whether the better search time could be supported within the delta-optimal space remained open. In this paper, we prove that both techniques can indeed be combined to obtain the best of both worlds: O(m + (occ + 1) log(epsilon) n) search time within O(delta log n log sigma delta log n) space. Moreover, the number of occurrences can be computed in O(m + log(2+epsilon) n) time within O(delta log n log sigma/ delta log n ) space. We also show that an extra sublogarithmic factor on top of this space enables optimal O(m + occ) search time, whereas an extra logarithmic factor enables optimal O(m) counting time.
引用
收藏
页码:1031 / 1056
页数:26
相关论文
共 28 条
[1]   New data structures for orthogonal range searching [J].
Alstrup, S ;
Brodal, GS ;
Rauhe, T .
41ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2000, :198-207
[2]  
Batu T, 2005, LECT NOTES COMPUT SC, V3572, P22
[3]   Alphabet-Independent Compressed Text Indexing [J].
Belazzougui, Djamal ;
Navarro, Gonzalo .
ACM TRANSACTIONS ON ALGORITHMS, 2014, 10 (04)
[4]  
Birenzwige O, 2020, PROCEEDINGS OF THE THIRTY-FIRST ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA'20), P607
[5]  
Chan TM, 2011, COMPUTATIONAL GEOMETRY (SCG 11), P1
[6]   Optimal-Time Dictionary-Compressed Indexes [J].
Christiansen, Anders Roy ;
Ettienne, Mikko Berggren ;
Kociumaka, Tomasz ;
Navarro, Gonzalo ;
Prezza, Nicola .
ACM TRANSACTIONS ON ALGORITHMS, 2021, 17 (01)
[7]   Grammar-compressed indexes with logarithmic search time [J].
Claude, Francisco ;
Navarro, Gonzalo ;
Pacheco, Alejandro .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2021, 118 :53-74
[8]  
Claude F, 2012, LECT NOTES COMPUT SC, V7608, P180, DOI 10.1007/978-3-642-34109-0_19
[9]  
Cole R., 1986, STOC '86: Proceedings of the eighteenth annual ACM symposium on Theory of computing, P206, DOI DOI 10.1145/12130.12151
[10]   UNIQUENESS THEOREMS FOR PERIODIC FUNCTIONS [J].
FINE, NJ ;
WILF, HS .
PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 1965, 16 (01) :109-&