Incremental construction and maintenance of minimal finite-state automata

被引:27
作者
Carrasco, RC [1 ]
Forcada, ML [1 ]
机构
[1] Univ Alacant, Dept Llenguatges & Sistemes Informat, E-03071 Alacant, Spain
关键词
D O I
10.1162/089120102760173652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Daciuk et al. [Computational Linguistics 26(1): 3-16 (2000)] describe a method for constructing incrementally minimal, deterministic, acyclic finite-state automata (dictionaries) from sets of strings. But acyclic finite-state automata have limitations: For instance, if one wants a linguistic application to accept all possible integer numbers or Internet addresses, the corresponding finite-state automaton has to be cyclic. In this article, we describe a simple and equally efficient method for modifying any minimal finite-state automaton (be it acyclic or not) so that a string is added to or removed from the language it accepts; both operations are very important when dictionary maintenance is performed and solve the dictionary construction problem addressed by Daciuk et al. as a special case. The algorithms proposed here may be straightforwardly derived from the customary textbook constructions for the intersection and the complementation of finite-state automata; the algorithms exploit the special properties of the automata resulting from the intersection operation when one of the finite-state automata accepts a single string.
引用
收藏
页码:207 / 216
页数:10
相关论文
共 6 条
[1]  
[Anonymous], PROCESAMIENTO LENGUA
[2]   Incremental construction of minimal acyclic finite-state automata [J].
Daciuk, J ;
Mihov, S ;
Watson, BW ;
Watson, RE .
COMPUTATIONAL LINGUISTICS, 2000, 26 (01) :3-16
[3]  
Hopcroft J. E., 2007, Introduction to Automata Theory, Languages and Computation
[4]  
KARAKOSTAS G, 2000, P 15 ANN IEEE C COMP
[5]  
REVUZ D, 2000, PREPROCEEDINGS CIAA, P226
[6]  
Roche E, 1997, LANG SPEECH & COMMUN, P1