Periodicity in Data Streams with Wildcards

被引:1
|
作者
Ergun, Funda [1 ]
Grigorescu, Elena [2 ]
Azer, Erfan Sadeqi [1 ]
Zhou, Samson [1 ,2 ]
机构
[1] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47405 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
Streaming algorithms; Periodicity; Wildcards; Randomized algorithms; String algorithms;
D O I
10.1007/s00224-019-09950-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We investigate the problem of detecting periodic trends within a string S of length n, arriving in the streaming model, containing at most k wildcard characters, where k = o(n). A wildcard character is a special character that can be assigned any other character. We say that S has wildcard-period p if there exists an assignment to each of the wildcard characters so that in the resulting stream the prefix of length n - p equals the suffix of length n - p. We present a two-pass streaming algorithm that computes wildcard-periods of S using O(k3 polylog n) bits of space, while we also show that this problem cannot be solved in sublinear space in one pass. We also give a one-pass randomized streaming algorithm that computes all wildcard-periods p of S withp < n2 and no wildcard characters appearing in the last p symbols of S, using O(k3 log9 n) space.
引用
收藏
页码:177 / 197
页数:21
相关论文
共 50 条
  • [31] Refactoring Java']Java Generics by Inferring Wildcards, In Practice
    Altidor, John
    Smaragdakis, Yannis
    ACM SIGPLAN NOTICES, 2014, 49 (10) : 271 - 290
  • [32] Periodicity detection in small-sample gene-expression data
    Mahata, Kaushik
    Mahata, Pritha
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 111 - +
  • [33] BPBM: An Algorithm for String Matching with Wildcards and Length Constraints
    Hong, Xiao-Li
    Wu, Xindong
    Hu, Xue-Gang
    Liu, Ying-Ling
    Gao, Jun
    Wu, Gong-Qing
    ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 518 - 525
  • [34] Efficient pattern matching with periodical wildcards in uncertain sequences
    Liu, Huiting
    Wang, Lili
    Liu, Zhizhong
    Zhao, Peng
    Wu, Xindong
    INTELLIGENT DATA ANALYSIS, 2018, 22 (04) : 829 - 842
  • [35] Deterministic Sampling and Range Counting in Geometric Data Streams
    Bagchi, Amitabha
    Chaudhary, Amitabh
    Eppstein, David
    Goodrich, Michael T.
    ACM TRANSACTIONS ON ALGORITHMS, 2007, 3 (02)
  • [36] Truly Perfect Samplers for Data Streams and Sliding Windows
    Jayaram, Rajesh
    Woodruff, David P.
    Zhou, Samson
    PROCEEDINGS OF THE 41ST ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS (PODS '22), 2022, : 29 - 40
  • [37] One-pass wavelet decompositions of data streams
    Gilbert, AC
    Kotidis, Y
    Muthukrishnan, S
    Strauss, MJ
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 541 - 554
  • [38] Weighted sampling without replacement from data streams
    Braverman, Vladimir
    Ostroysky, Rafail
    Vorsanger, Gregory
    INFORMATION PROCESSING LETTERS, 2015, 115 (12) : 923 - 926
  • [39] Periodicity and arithmetic-periodicity in hexadecimal games
    Howse, S
    Nowakowski, RJ
    THEORETICAL COMPUTER SCIENCE, 2004, 313 (03) : 463 - 472
  • [40] A Bit-Parallel Algorithm for Sequential Pattern Matching with Wildcards
    Guo, Dan
    Hong, Xiao-Li
    Hu, Xue-Gang
    Gao, Jun
    Liu, Ying-Ling
    Wu, Gong-Qing
    Wu, Xindong
    CYBERNETICS AND SYSTEMS, 2011, 42 (06) : 382 - 401