Position heaps: A simple and dynamic text indexing data structure

被引:24
|
作者
Ehrenfeucht, Andrzej [1 ]
McConnell, Ross M. [2 ]
Osheim, Nissa [2 ]
Woo, Sung-Whan [2 ]
机构
[1] Univ Colorado Boulder, Dept Comp Sci, Boulder, CO 80309 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
关键词
Position heap; String searching;
D O I
10.1016/j.jda.2010.12.001
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We address the problem of finding the locations of all instances of a string P in a text T, where preprocessing of T is allowed in order to facilitate the queries. Previous data structures for this problem include the suffix tree, the suffix array, and the compact DAWG. We modify a data structure called a sequence tree, which was proposed by Coffman and Eve (1970) [ 3] for hashing, and adapt it to the new problem. We can then produce a list of k occurrences of any string P in T in O(parallel to P parallel to + k) time. Because of properties shared by suffixes of a text that are not shared by arbitrary hash keys, we can build the structure in O(parallel to T parallel to) time, which is much faster than Coffman and Eve's algorithm. These bounds are as good as those for the suffix tree, suffix array, and the compact DAWG. The advantages are the elementary nature of some of the algorithms for constructing and using the data structure and the asymptotic bounds we can give for updating the data structure when the text is edited. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:100 / 121
页数:22
相关论文
共 50 条
  • [41] Semantic indexing-based data augmentation for filtering undesired short text messages
    Lochter, Johannes V.
    Silva, Renato M.
    Almeida, Tiago A.
    Yamakami, Akebo
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 1034 - 1039
  • [42] A simple approach to integration of acceleration data for dynamic soil-structure interaction analysis
    Yang, J.
    Li, J. B.
    Lin, G.
    SOIL DYNAMICS AND EARTHQUAKE ENGINEERING, 2006, 26 (08) : 725 - 734
  • [43] FORMAL METHODS OF TEXT STRUCTURE-ANALYSIS USED FOR AUTOMATIC-INDEXING AND ABSTRACTING
    BONDARENKO, GV
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1976, (05): : 26 - 37
  • [44] SIMPLE DYNAMIC STRUCTURE METHOD.
    Leung, A.Y.T.
    Earthquake Engineering & Structural Dynamics, 1988, 16 (06): : 827 - 837
  • [45] An Efficient Data Retrieval Indexing Structure for Wireless Broadcasting System
    Lin, Lien-Fa
    Chen, Chao-Chun
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, PROCEEDINGS, 2008, : 651 - +
  • [46] A succinct data structure for self-indexing ternary relations
    Alvarez-Garcia S.
    de Bernardo G.
    Brisaboa N.R.
    Navarro G.
    Journal of Discrete Algorithms, 2017, 43 : 38 - 53
  • [48] A new dynamic indexing structure for searching time-series patterns
    Lin, ZY
    Xue, YS
    Lv, XH
    CONCEPTUAL MODELING FOR ADVANCED APPLICATION DOMAINS, PROCEEDINGS, 2004, 3289 : 290 - 301
  • [49] A Viewable Indexing Structure for the Interactive Exploration of Dynamic and Large Image Collections
    Rayar, Frederic
    Barrat, Sabine
    Bouali, Fatma
    Venturini, Gilles
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (01)
  • [50] Dynamic Position Calibration by Road Structure Detection
    Speth, Thomas
    Doric, Igor
    Riedel, Helmut
    Brandmeier, Thomas
    Jumar, Ulrich
    2015 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY (ICVES), 2015, : 104 - 109