On-line construction of compact directed acyclic word graphs

被引:27
作者
Inenaga, S [1 ]
Hoshino, H
Shinohara, A
Takeda, M
Arikawa, S
Mauri, G
Pavesi, G
机构
[1] Kyushu Univ, Dept Informat, Fukuoka 812, Japan
[2] Univ Milan Biccoca, Dept Comp Sci Syst & Commun, Milan, Italy
关键词
pattern matching on strings; compact directed acyclic word graphs; directed acyclic word graphs; suffix trees; on-line and; linear-time algorithms;
D O I
10.1016/j.dam.2004.04.012
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Many different index structures, providing efficient solutions to problems related to pattern matching, have been introduced so far. Examples of these structures are suffix trees and directed acyclic word graphs (DAWGs), which can be efficiently constructed in linear time and space. Compact directed acyclic word graphs (CDAWGs) are an index structure preserving some features of both suffix trees and DAWGs, and require less space than both of them. An algorithm which directly constructs CDAWGs in linear time and space was first introduced by Crochemore and Verin, based on McCreight's algorithm for constructing suffix trees. In this work, we present a novel on-line linear-time algorithm that builds the CDAWG for a single string as well as for a set of strings, inspired by Ukkonen's on-line algorithm for constructing suffix trees. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:156 / 179
页数:24
相关论文
共 36 条