Periodicity and repetitions in parameterized strings

被引:23
作者
Apostolico, Alberto [1 ,2 ]
Giancarlo, Raffaele [3 ]
机构
[1] Univ Padua, Dipartimento Ingn Informaz, Padua, Italy
[2] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
[3] Univ Palermo, Dipartimento Matemat & Applicata, I-90133 Palermo, Italy
关键词
combinatorics on words; string periodicity; parameterized strings;
D O I
10.1016/j.dam.2006.11.017
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
One of the most beautiful and useful notions in the Mathematical Theory of Strings is that of a Period, i.e., an initial piece of a given string that can generate that string by repeating itself at regular intervals. Periods have an elegant mathematical structure and a wealth of applications [F. Mignosi and A. Restivo, Periodicity, Algebraic Combinatorics on Words, in: M. Lothaire (Ed.), Cambridge University Press, Cambridge, pp. 237-274, 2002]. At the hearth of their theory, there are two Periodicity Lemmas: one due to Lyndon and Schutzenberger [The equation a(M) = b(N)C(P) in a free group, Michigan Math. J. 9 (1962) 289-298], referred to as the Weak Version, and the other due to Fine and Wilf [Uniqueness theorems for periodic functions, Proc. Amer. Math. Soc. 16 (1965) 109-114]. In this paper, we investigate the notion of periodicity and the closely related one of repetition in connection with parameterized strings as introduced by Baker [Parameterized pattern matching: algorithms and applications, J. Comput. System Sci. 52(1) (1996) 28-42; Parameterized duplication in strings: algorithms and an application to software maintenance, SIAM J. Comput. 26(5) (1997) 1343-1362]. In such strings, the notion of pairwise match or "equivalence" of symbols is more relaxed than the usual one, in that it rests on some mapping, rather than identity, of symbols. It seems natural to try and extend notions of periods and periodicities to encompass parameterized strings. However, we know of no previous attempt in this direction. Our preliminary investigation yields results as follows. For periodicity, we get (a) a generalization of the Weak Version of the Periodicity Lemma for parameterized strings, showing that it is essential that the two mappings inducing the periodicity must commute; (b) a proof that an analogous of the Lemma by Fine and Wilf [Uniqueness theorems for periodic functions, Proc. Amer. Math. Soc. 16 (1965) 109-114] cannot hold for parameterized strings, even if the mappings inducing the periodicity "commute", in a sense to be specified below; (c) a proof that parameterized strings over an alphabet of at least three letters may have a set of periods which differ from those of any binary string of the same length-whereby the parameterized analog of a classic result by Guibas and Odlyzko [String overlaps, pattern matching, and nontransitive games, J. Combin. Theory Ser. A 30 (1981) 183-208] cannot hold. We also derive necessary and sufficient conditions characterizing parameterized repetitions, which are patterns of length at least twice that of the period, and show how the notion of root differs from the standard case, and highlight some of the implications on extending algorithmic criteria previously adopted for string searching, detection of repetitions and the likes. Finally, as a corollary of our main results, we also show that binary parameterized strings behave much in the same way as non-parameterized ones with respect to periodicity and repetitions, while there is a substantial difference for strings over alphabets of at least three symbols. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1389 / 1398
页数:10
相关论文
共 19 条
[1]   Two-dimensional periodicity in rectangular arrays [J].
Amir, A ;
Benson, G .
SIAM JOURNAL ON COMPUTING, 1998, 27 (01) :90-106
[2]   ALPHABET DEPENDENCE IN PARAMETERIZED MATCHING [J].
AMIR, A ;
FARACH, M ;
MUTHUKRISHNAN, S .
INFORMATION PROCESSING LETTERS, 1994, 49 (03) :111-115
[3]  
AMIR A, 2003, P 30 ICALP, P929
[4]   STRUCTURAL-PROPERTIES OF THE STRING STATISTICS PROBLEM [J].
APOSTOLICO, A ;
PREPARATA, FP .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1985, 31 (03) :394-411
[5]   OPTIMAL OFF-LINE DETECTION OF REPETITIONS IN A STRING [J].
APOSTOLICO, A ;
PREPARATA, FP .
THEORETICAL COMPUTER SCIENCE, 1983, 22 (03) :297-315
[6]  
Apostolico A, 1996, ALGORITHMICA, V15, P481, DOI 10.1007/BF01955046
[7]   THE BOYER-MOORE-GALIL STRING SEARCHING STRATEGIES REVISITED [J].
APOSTOLICO, A ;
GIANCARLO, R .
SIAM JOURNAL ON COMPUTING, 1986, 15 (01) :98-105
[8]  
Apostolico A, 1997, Pattern Matching Algorithms
[9]  
Baker Brenda S., 1999, P 10 ANN ACM SIAM S, P854
[10]   Sparse dynamic programming for longest common subsequence from fragments [J].
Baker, BS ;
Giancarlo, R .
JOURNAL OF ALGORITHMS, 2002, 42 (02) :231-254