A system for named entity recognition based on local grammars

被引:13
|
作者
Krstev, Cvetana [1 ]
Obradovic, Ivan [2 ]
Utvic, Milos [1 ]
Vitas, Dusko [3 ]
机构
[1] Univ Belgrade, Fac Philol, Belgrade 11000, Serbia
[2] Univ Belgrade, Fac Min & Geol, Belgrade 11000, Serbia
[3] Univ Belgrade, Fac Math, Belgrade 11000, Serbia
关键词
Lexical resources; finite-state transducers; local grammars; named entity recognition; Serbian language; system evaluation;
D O I
10.1093/logcom/exs079
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The existence of large-scale lexical resources for Serbian, e-dictionaries in particular, coupled with local grammars in the form of finite-state transducers, enabled the development of a complex system for named entity recognition and tagging. The system is not general in nature, but targets some specific types of name, temporal and numerical expressions. In order to improve the precision of recognition we used local grammars to describe the context of named entities. In the case of personal names the widest context was used to include the recognition of nominal phrases describing a person's position. The evaluation of our system was performed twice on a corpus of 3,000 short agency news. Results obtained by the system were manually evaluated, all omissions and incorrect recognitions precisely identified, and most of them corrected before the second evaluation. The overall recall R = 0.88 for types and R = 0.94 for tokens, and overall precision P = 0.96 for types and P = 0.98 for tokens indicated that our system gives priority to precision. The evaluation of recognition of surnames only, with and without positions, and also names of distinguished persons such as royalty and church dignitaries confirmed this fact, albeit with less satisfactory results for both precision and recall.
引用
收藏
页码:473 / 489
页数:17
相关论文
共 50 条
  • [1] Portuguese Named Entity Recognition using Conditional Random Fields and Local Grammars
    Pirovani, Juliana P. C.
    de Oliveira, Elias
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4452 - 4456
  • [2] Named entity recognition for Arabic using syntactic grammars
    Mesfar, Slim
    Natural Language Processing and Information Systems, Proceedings, 2007, 4592 : 305 - 316
  • [3] Learning the Morphological and Syntactic Grammars for Named Entity Recognition
    Sun, Mengtao
    Yang, Qiang
    Wang, Hao
    Pasquine, Mark
    Hameed, Ibrahim A.
    INFORMATION, 2022, 13 (02)
  • [4] Advanced grammars for state-of-the-art Named Entity Recognition (NER)
    Sayle, Roger
    Lowe, Daniel
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [5] Biomedical named entity recognition system
    Patrick, J. (jonpat@it.usyd.edu.au), 2005, School of Information Technologies
  • [6] A Named Entity Recognition system for Dutch
    De Meulder, F
    Daelemans, W
    Hoste, V
    COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS 2001, 2002, (45): : 77 - 88
  • [7] Chinese medical named entity recognition model based on local enhancement
    Chen, Jing
    Xing, Kexuan
    Meng, Weilun
    Guo, Jingfeng
    Feng, Jianzhou
    Tongxin Xuebao/Journal on Communications, 45 (07): : 171 - 183
  • [8] Wikipedia-based Named Entity Recognition System for Turkish
    Kucuk, Dogan
    Arici, Nursal
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2016, 19 (03): : 325 - 332
  • [9] Named entity recognition in a Hungarian NL based QA system
    Tikk, Domonkos
    Szidarovszky, P. Ferenc
    Kardkovacs, Zsolt T.
    Magyar, Gabor
    ADVANCES IN INFORMATION SYSTEMS DEVELOPMENT, VOL 1 AND 2: BRIDGING THE GAP BETWEEN ACADEMIA AND INDUSTRY, 2006, : 879 - +
  • [10] Named Entity Recognition for Tibetan Texts Using Case-auxiliary Grammars
    Yu, Hongzhi
    Jiang, Tao
    Ma, Ning
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 601 - 604