Latent semantics in Named Entity Recognition

被引:47
作者
Konkol, Michal [1 ]
Brychcin, Tomas
Konopik, Miloslav
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Comp Sci & Engn, Plzen 30614, Czech Republic
关键词
Named Entity Recognition; Information extraction; Stemming; Semantic analysis; Semantic spaces; Latent Dirichlet allocation; INFORMATION; MODELS; SPACES;
D O I
10.1016/j.eswa.2014.12.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose new features for Named Entity Recognition (NER) based on latent semantics. Furthermore, we explore the effect of unsupervised morphological information on these methods and on the NER system in general. The newly created NER system is fully language-independent thanks to the unsupervised nature of the proposed features. We evaluate the system on English, Spanish, Dutch and Czech corpora and study the difference between weakly and highly inflectional languages. Our system achieves the same or even better results than state-of-the-art language dependent systems. The proposed features proved to be very useful and are the main reason of our promising results. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3470 / 3479
页数:10
相关论文
共 61 条
  • [1] [Anonymous], 2007, P 24 INT C MACH LEAR, DOI DOI 10.1145/1273496.1273577
  • [2] [Anonymous], 2002, TECHNICAL REPORT
  • [3] [Anonymous], 2003, P 7 C NAT LANG LEARN, DOI [10.3115/1119176.1119197, DOI 10.3115/1119176.1119197]
  • [4] [Anonymous], 1957, STUDIES LINGUISTIC A
  • [5] [Anonymous], 2005, ACL, DOI 10.3115/1219840.1219885
  • [6] A multi-strategy approach to biological named entity recognition
    Atkinson, John
    Bull, Veronica
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (17) : 12968 - 12974
  • [7] Multi-document summarization based on the Yago ontology
    Baralis, Elena
    Cagliero, Luca
    Jabeen, Saima
    Fiori, Alessandro
    Shah, Sajid
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (17) : 6976 - 6984
  • [8] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [9] Borthwick Andrew Eliot, 1999, A Maximum Entropy Approach to Named Entity Recognition
  • [10] Brown P. F., 1992, Computational Linguistics, V18, P467