Finite-state transducer cascades to extract named entities in texts

被引:34
作者
Friburger, N [1 ]
Maurel, D [1 ]
机构
[1] Lab Informat Tours, F-37000 Tours, France
关键词
Finite-State Transducer; Finite-State Cascade; named entity; proper names; pattern matching; MUC;
D O I
10.1016/j.tcs.2003.10.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A lot of Named Entity Extraction Systems were created in English thanks to the impulse of WC conferences. This article describes a Finite-State Transducer Cascade for the extraction of named entities in French journalistic texts. Finite-State Cascades are widely used for Natural Language Processing: a cascade is a series of finite-state transducers applied to a text transforming it. Such transducer cascades allow implementation of syntactic analysis, translation memory and information extraction. We present our general system named CasSys: this system uses the INTEX natural language processing features to realize a transducer cascade. CasSys is not dedicated to the extraction of named entity; we use it for this task but thanks to Intex, it allows syntactic analyses, information extraction or other tasks. (C) 2003 Published by Elsevier B.V.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 25 条
[1]  
Abney S. P., 1991, Principle-based parsing, P257, DOI DOI 10.1007/978-94-011-3474-3_10
[2]  
AITMOKHTAR S, 1997, ANLP 9M
[3]  
[Anonymous], WORKSH ROB PARS 8 EU
[4]  
CHANOD T, 1996, P WORKSH ROB PARS PR, P16
[5]  
Chinchor N.A., 1998, P 7 MESSAGE UNDERSTA
[6]  
COASESSTEPHENS S, 1993, COMPUT HUMANITIES, V26, P441
[7]  
COURTOIS B, 1990, DICT ELECT MOTS SIMP
[8]  
DEJONG G.F., 1982, Strategies for Natural Language Processing, P149
[9]  
FAIRON C, 2000, THESIS U PARIS 7
[10]  
GALAPAVIA N, 1999, P 15 C SEPLN LLEID S