k-Approximate Quasiperiodicity Under Hamming and Edit Distance

被引:1
作者
Kedzierski, Aleksander [1 ,2 ]
Radoszewski, Jakub [1 ,2 ]
机构
[1] Univ Warsaw, Inst Informat, Warsaw, Poland
[2] Samsung R&D Inst Poland, Warsaw, Poland
关键词
Quasiperiodicity; Approximate cover; Approximate seed; Hamming distance; Edit distance; ALGORITHMS; COVER;
D O I
10.1007/s00453-021-00842-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Quasiperiodicity in strings was introduced almost 30 years ago as an extension of string periodicity. The basic notions of quasiperiodicity are cover and seed. A cover of a text T is a string whose occurrences in T cover all positions of T. A seed of text T is a cover of a superstring of T. In various applications exact quasiperiodicity is still not sufficient due to the presence of errors. We consider approximate notions of quasiperiodicity, for which we allow approximate occurrences in T with a small Hamming, Levenshtein or weighted edit distance. In previous work Sim et al. (J Korea Inf Sci Soc 29(1):16-21, 2002) and Christodoulakis et al. (J Autom Lang Comb 10(5/6), 609-626, 2005) showed that computing approximate covers and seeds, respectively, under weighted edit distance is NP-hard. They, therefore, considered restricted approximate covers and seeds which need to be factors of the original string T and presented polynomial-time algorithms for computing them. Further algorithms, considering approximate occurrences with Hamming distance bounded by k, were given in several contributions by Guth et al. They also studied relaxed approximate quasiperiods. We present more efficient algorithms for computing restricted approximate covers and seeds. In particular, we improve upon the complexities of many of the aforementioned algorithms, also for relaxed quasiperiods. Our solutions are especially efficient if the number (or total cost) of allowed errors is small. We also show conditional lower bounds for computing restricted approximate covers and prove NP-hardness of computing non-restricted approximate covers and seeds under the Hamming distance.
引用
收藏
页码:566 / 589
页数:24
相关论文
共 36 条
  • [1] Approximate cover of strings
    Amir, A.
    Levy, A.
    Lubin, R.
    Porat, E.
    [J]. THEORETICAL COMPUTER SCIENCE, 2019, 793 : 59 - 69
  • [2] Amir A, 2018, LIPICS, VLIPIcs 105, DOI [10.4230/LIPIcs.CPM.2018.4, DOI 10.4230/LIPICS.CPM.2018.4]
  • [3] Can We Recover the Cover?
    Amir, Amihood
    Levy, Avivit
    Lewenstein, Moshe
    Lubin, Ronit
    Porat, Benny
    [J]. ALGORITHMICA, 2019, 81 (07) : 2857 - 2875
  • [4] EFFICIENT DETECTION OF QUASIPERIODICITIES IN STRINGS
    APOSTOLICO, A
    EHRENFEUCHT, A
    [J]. THEORETICAL COMPUTER SCIENCE, 1993, 119 (02) : 247 - 265
  • [5] OPTIMAL SUPERPRIMITIVITY TESTING FOR STRINGS
    APOSTOLICO, A
    FARACH, M
    ILIOPOULOS, CS
    [J]. INFORMATION PROCESSING LETTERS, 1991, 39 (01) : 17 - 20
  • [6] AN ONLINE STRING SUPERPRIMITIVITY TEST
    BRESLAUER, D
    [J]. INFORMATION PROCESSING LETTERS, 1992, 44 (06) : 345 - 347
  • [7] Quadratic Conditional Lower Bounds for String Problems and Dynamic TimeWarping
    Bringmann, Karl
    Kuennemann, Marvin
    [J]. 2015 IEEE 56TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 2015, : 79 - 97
  • [8] Christodoulakis Manolis, 2005, Journal of Automata, Languages and Combinatorics, V10, P609
  • [9] CROCHEMORE M., 2003, Jewels of Stringology, DOI DOI 10.1142/4838
  • [10] Experimental evaluation of algorithms for computing quasiperiods
    Czajka, Patryk
    Radoszewski, Jakub
    [J]. THEORETICAL COMPUTER SCIENCE, 2021, 854 : 17 - 29