A system for named entity recognition based on local grammars

被引:13
|
作者
Krstev, Cvetana [1 ]
Obradovic, Ivan [2 ]
Utvic, Milos [1 ]
Vitas, Dusko [3 ]
机构
[1] Univ Belgrade, Fac Philol, Belgrade 11000, Serbia
[2] Univ Belgrade, Fac Min & Geol, Belgrade 11000, Serbia
[3] Univ Belgrade, Fac Math, Belgrade 11000, Serbia
关键词
Lexical resources; finite-state transducers; local grammars; named entity recognition; Serbian language; system evaluation;
D O I
10.1093/logcom/exs079
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The existence of large-scale lexical resources for Serbian, e-dictionaries in particular, coupled with local grammars in the form of finite-state transducers, enabled the development of a complex system for named entity recognition and tagging. The system is not general in nature, but targets some specific types of name, temporal and numerical expressions. In order to improve the precision of recognition we used local grammars to describe the context of named entities. In the case of personal names the widest context was used to include the recognition of nominal phrases describing a person's position. The evaluation of our system was performed twice on a corpus of 3,000 short agency news. Results obtained by the system were manually evaluated, all omissions and incorrect recognitions precisely identified, and most of them corrected before the second evaluation. The overall recall R = 0.88 for types and R = 0.94 for tokens, and overall precision P = 0.96 for types and P = 0.98 for tokens indicated that our system gives priority to precision. The evaluation of recognition of surnames only, with and without positions, and also names of distinguished persons such as royalty and church dignitaries confirmed this fact, albeit with less satisfactory results for both precision and recall.
引用
收藏
页码:473 / 489
页数:17
相关论文
共 50 条
  • [31] Named entity recognition in Bengali using system combination
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    LINGUISTICAE INVESTIGATIONES, 2014, 37 (01): : 1 - 22
  • [32] JAPANESE NAMED ENTITY RECOGNITION FOR QUESTION ANSWERING SYSTEM
    Liu, Ye
    Ren, Fuji
    2011 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS, 2011, : 402 - 406
  • [33] Cross Domains Arabic Named Entity Recognition System
    Al-Ahmari, S. Saad
    Al-Johar, B. Abdullatif
    FIRST INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2016, 0011
  • [34] A Hybrid Named Entity Recognition System for Aviation Text
    Bharathi, A.
    Ramdin, Robin
    Babu, Preeja
    Menon, Vijay Krishna
    Jayaramakrishnan, Chandrasekhar
    Lakshmikumar, Sudarsan
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01)
  • [35] ChemSpot: a hybrid system for chemical named entity recognition
    Rocktaschel, Tim
    Weidlich, Michael
    Leser, Ulf
    BIOINFORMATICS, 2012, 28 (12) : 1633 - 1640
  • [36] Hindi named entity recognition using system combination
    Sarkar, Kamal
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2018, 5 (01) : 11 - 39
  • [37] A Named Entity Recognition Model Based on Entity Trigger Reinforcement Learning
    Wang, Ping
    Si, Nong
    Tong, Haopeng
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 43 - 48
  • [38] Named Entity Recognition for Vietnamese
    Dat Ba Nguyen
    Son Huu Hoang
    Son Bao Pham
    Thai Phuong Nguyen
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, PROCEEDINGS, 2010, 5991 : 205 - 214
  • [39] Persian Named Entity Recognition
    Dashtipour, Kia
    Gogate, Mandar
    Adeel, Ahsan
    Algarafi, Abdulrahman
    Howard, Newton
    Hussain, Amir
    2017 IEEE 16TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2017, : 79 - 83
  • [40] Named Entity Recognition for Tweets
    Liu, Xiaohua
    Wei, Furu
    Zhang, Shaodian
    Zhou, Ming
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (01)