A system for named entity recognition based on local grammars

被引:13
|
作者
Krstev, Cvetana [1 ]
Obradovic, Ivan [2 ]
Utvic, Milos [1 ]
Vitas, Dusko [3 ]
机构
[1] Univ Belgrade, Fac Philol, Belgrade 11000, Serbia
[2] Univ Belgrade, Fac Min & Geol, Belgrade 11000, Serbia
[3] Univ Belgrade, Fac Math, Belgrade 11000, Serbia
关键词
Lexical resources; finite-state transducers; local grammars; named entity recognition; Serbian language; system evaluation;
D O I
10.1093/logcom/exs079
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The existence of large-scale lexical resources for Serbian, e-dictionaries in particular, coupled with local grammars in the form of finite-state transducers, enabled the development of a complex system for named entity recognition and tagging. The system is not general in nature, but targets some specific types of name, temporal and numerical expressions. In order to improve the precision of recognition we used local grammars to describe the context of named entities. In the case of personal names the widest context was used to include the recognition of nominal phrases describing a person's position. The evaluation of our system was performed twice on a corpus of 3,000 short agency news. Results obtained by the system were manually evaluated, all omissions and incorrect recognitions precisely identified, and most of them corrected before the second evaluation. The overall recall R = 0.88 for types and R = 0.94 for tokens, and overall precision P = 0.96 for types and P = 0.98 for tokens indicated that our system gives priority to precision. The evaluation of recognition of surnames only, with and without positions, and also names of distinguished persons such as royalty and church dignitaries confirmed this fact, albeit with less satisfactory results for both precision and recall.
引用
收藏
页码:473 / 489
页数:17
相关论文
共 50 条
  • [41] Chinese named entity recognition based on adaptive transformer
    Yan Yang
    Yin, Guozhe
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 327 - 331
  • [42] Knowledge-based Named Entity Recognition in Polish
    Pohl, Aleksander
    2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2013, : 145 - 151
  • [43] Chinese Named Entity Recognition and Disambiguation Based on Wikipedia
    Yu Miao
    Lv Yajuan
    Liu Qun
    Su Jinsong
    Xiong Hao
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 272 - 283
  • [44] A named entity recognition model based on ensemble learning
    Zhu, Xinghui
    Zou, Zhuoyang
    Qiao, Bo
    Fang, Kui
    Chen, Yiming
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2021, 21 (02) : 475 - 486
  • [45] Pattern based bootstrapping method for named entity recognition
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 349 - +
  • [46] NAMED ENTITY RECOGNITION FOR POLISH
    Marcinczuk, Michal
    Wawer, Aleksander
    POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2019, 55 (02): : 239 - 269
  • [47] NAMED ENTITY RECOGNITION FOR ROMANIAN
    Iftene, Adrian
    Trandabat, Diana
    Toader, Mihai
    Corici, Marius
    KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 49 - 60
  • [48] Named Entity Recognition Approaches
    Mansouri, Alireza
    Affendey, Lilly Suriani
    Mamat, Ali
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (02): : 339 - 344
  • [49] An Overview of Named Entity Recognition
    Sun, Peng
    Yang, Xuezhen
    Zhao, Xiaobing
    Wang, Zhijuan
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 273 - 278
  • [50] Arabic Named Entity Recognition
    Benajiba, Yassine
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 151 - 152