Position heaps: A simple and dynamic text indexing data structure

被引:24
|
作者
Ehrenfeucht, Andrzej [1 ]
McConnell, Ross M. [2 ]
Osheim, Nissa [2 ]
Woo, Sung-Whan [2 ]
机构
[1] Univ Colorado Boulder, Dept Comp Sci, Boulder, CO 80309 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
关键词
Position heap; String searching;
D O I
10.1016/j.jda.2010.12.001
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We address the problem of finding the locations of all instances of a string P in a text T, where preprocessing of T is allowed in order to facilitate the queries. Previous data structures for this problem include the suffix tree, the suffix array, and the compact DAWG. We modify a data structure called a sequence tree, which was proposed by Coffman and Eve (1970) [ 3] for hashing, and adapt it to the new problem. We can then produce a list of k occurrences of any string P in T in O(parallel to P parallel to + k) time. Because of properties shared by suffixes of a text that are not shared by arbitrary hash keys, we can build the structure in O(parallel to T parallel to) time, which is much faster than Coffman and Eve's algorithm. These bounds are as good as those for the suffix tree, suffix array, and the compact DAWG. The advantages are the elementary nature of some of the algorithms for constructing and using the data structure and the asymptotic bounds we can give for updating the data structure when the text is edited. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:100 / 121
页数:22
相关论文
共 50 条
  • [1] Contracted Suffix Trees: A Simple and Dynamic Text Indexing Data Structure
    Ehrenfeucht, Andrzej
    McConnell, Ross M.
    Woo, Sung-Whan
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2009, 5577 : 41 - +
  • [2] Improved dynamic text indexing
    Ferragina, P
    Grossi, R
    JOURNAL OF ALGORITHMS, 1999, 31 (02) : 291 - 319
  • [3] An Indexing Structure for Dynamic Multidimensional Data in Vector Space
    Mikhaylova, Elena
    Novikov, Boris
    Volokhov, Anton
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, 2013, 186 : 185 - 193
  • [4] DYNAMIC INDEXING FOR THE ANALYSIS OF LITERARY TEXT
    WEISS, H
    INTERNATIONAL CLASSIFICATION, 1991, 18 (04): : 200 - 204
  • [5] FA-Tree - A dynamic indexing structure for spatial data
    Chang, CC
    Shen, JJ
    Chou, YC
    SOFT COMPUTING AS TRANSDISCIPLINARY SCIENCE AND TECHNOLOGY, 2005, : 1071 - 1080
  • [6] DyST: Dynamic and scalable temporal text indexing
    Norvag, Kjetil
    Nybo, Albert Overskeid
    TIME 2006: THIRTEENTH INTERNATIONAL SYMPOSIUM ON TEMPORAL REPRESENTATION AND REASONING, PROCEEDINGS, 2006, : 204 - +
  • [7] Dynamic text indexing under string updates
    Ferragina, P
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 1997, 22 (02): : 296 - 328
  • [8] A POINTER-FREE DATA STRUCTURE FOR MERGING HEAPS AND MIN-MAX HEAPS
    GAMBOSI, G
    NARDELLI, E
    TALAMO, M
    THEORETICAL COMPUTER SCIENCE, 1991, 84 (01) : 107 - 126
  • [9] A POINTER-FREE DATA STRUCTURE FOR MERGING HEAPS AND MIN-MAX HEAPS
    GAMBOSI, G
    NARDELLI, E
    TALAMO, M
    LECTURE NOTES IN COMPUTER SCIENCE, 1989, 372 : 405 - 438
  • [10] A-Tree: A Dynamic Data Structure for Efficiently Indexing Arbitrary Boolean Expressions
    Ji, Shuping
    Jacobsen, Hans-Arno
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 817 - 829