Headword-Oriented Entity Linking: A Special Entity Linking Task with Dataset and Baseline

被引:0
作者
Yang, Mu [1 ]
Chen, Chi-Yen [1 ]
Lee, Yi-Hui [1 ]
Zeng, Qian-Hui [1 ]
Ma, Wei-Yun [1 ]
Shih, Chen-Yang [2 ]
Chen, Wei-Jhih [2 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] PIXNET Corp, R&D Ctr, Taipei, Taiwan
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年
关键词
Corpus; Information Extraction; Distant Supervision;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we design headword-oriented entity linking (HEL), a specialized entity linking problem in which only the headwords of the entities are to be linked to knowledge bases; mention scopes of the entities do not need to be identified in the problem setting. This special task is motivated by the fact that in many articles referring to specific products, the complete full product names are rarely written; instead, they are often abbreviated to shorter, irregular versions or even just to their headwords, which are usually their product types, such as "stick" or "mask" in a cosmetic context. To fully design the special task, we construct a labeled cosmetic corpus as a public benchmark for this problem, and propose a product embedding model to address the task, where each product corresponds to a dense representation to encode the different information on products and their context jointly. Besides, to increase training data, we propose a special transfer learning framework in which distant supervision with heuristic patterns is first utilized, followed by supervised learning using a small amount of manually labeled data. The experimental results show that our model provides a strong benchmark performance on the special task.
引用
收藏
页码:1910 / 1917
页数:8
相关论文
共 28 条
  • [1] Agirre E., 2009, TAC
  • [2] Bunescu R. C., 2006, P 11 C EUR CHAPT ASS
  • [3] Chen Z., 2011, Proc. of the 2011 Conf. on Empirical Methods in Natural Language Process, P771
  • [4] Dredze M., 2010, P 23 INT C COMP LING
  • [5] Nominal Coreference Resolution Using Semantic Knowledge
    Fonseca, Evandro
    Vanin, Aline
    Vieira, Renata
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 37 - 45
  • [6] Francis-Landau M., 2016, P 2016 C N AM CHAPTE, DOI [DOI 10.18653/V1/N16-1150, 10.18653/v1/n 16-1150]
  • [7] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
  • [8] Gupta N., 2017, Proceedings of the 2017 conference on empirical methods in natural language processing, P2681, DOI [DOI 10.18653/V1/D17-1284, 10.18653/v1/d17-1284]
  • [9] Hsieh Y.- M., 2012, P 2 CIPS SIGHAN JOIN, P216
  • [10] Hsieh Y.- M., 2007, INT J COMPUTATIONAL, V19, P195