Learning Entity Linking Features for Emerging Entities

被引:0
|
作者
Ran, Chenwei [1 ]
Shen, Wei [2 ]
Gao, Jianbo [2 ]
Li, Yuhan [2 ]
Wang, Jianyong [1 ,3 ]
Jia, Yantao [4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Nankai Univ, TMCC, TKLNDST, Coll Comp Sci, Tianjin 300350, Peoples R China
[3] Jiangsu Normal Univ, Jiangsu Collaborat Innovat Ctr Language Abil, Xuzhou 221008, Jiangsu, Peoples R China
[4] Huawei Technol Co Ltd, Beijing 100077, Peoples R China
基金
中国国家自然科学基金;
关键词
Encyclopedias; Optimization; Online services; Internet; Task analysis; Numerical models; Data models; Entity linking; entity linking feature; emerging entity; self-training;
D O I
10.1109/TKDE.2022.3197707
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Entity linking (EL) is the process of linking entity mentions appearing in text with their corresponding entities in a knowledge base. EL features of entities (e.g., prior probability, relatedness score, and entity embedding) are usually estimated based on Wikipedia. However, for newly emerging entities (EEs) which have just been discovered in news, they may still not be included in Wikipedia yet. As a consequence, it is unable to obtain required EL features for those EEs from Wikipedia and EL models will always fail to link ambiguous mentions with those EEs correctly as the absence of their EL features. To deal with this problem, in this paper we focus on a new task of learning EL features for emerging entities in a general way. We propose a novel approach called STAMO to learn high-quality EL features for EEs automatically, which needs just a small number of labeled documents for each EE collected from the Web, as it could further leverage the knowledge hidden in the unlabeled data. STAMO is mainly based on self-training, which makes it flexibly integrated with any EL feature or EL model, but also makes it easily suffer from the error reinforcement problem caused by the mislabeled data. Instead of some common self-training strategies that try to throw the mislabeled data away explicitly, we regard self-training as a multiple optimization process with respect to the EL features of EEs, and propose both intra-slot and inter-slot optimizations to alleviate the error reinforcement problem implicitly. We construct two EL datasets involving selected EEs to evaluate the quality of obtained EL features for EEs, and the experimental results show that our approach significantly outperforms other baseline methods of learning EL features.
引用
收藏
页码:7088 / 7102
页数:15
相关论文
共 50 条
  • [1] Entity Linking over Nested Named Entities for Russian
    Loukachevitch, Natalia
    Braslavski, Pavel
    Ivanov, Vladimir
    Batura, Tatiana
    Manandhar, Suresh
    Shelmanov, Artem
    Tutubalina, Elena
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4458 - 4466
  • [2] Entity Linking Meets Deep Learning: Techniques and Solutions
    Shen, Wei
    Li, Yuhan
    Liu, Yinan
    Han, Jiawei
    Wang, Jianyong
    Yuan, Xiaojie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (03) : 2556 - 2578
  • [3] An entity linking model based on candidate features
    Dun Li
    Zijian Fu
    Zhiyun Zheng
    Social Network Analysis and Mining, 2021, 11
  • [4] Improving entity linking with two adaptive features
    Zhang, Hongbin
    Chen, Quan
    Zhang, Weiwen
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (11) : 1620 - 1630
  • [5] An entity linking model based on candidate features
    Li, Dun
    Fu, Zijian
    Zheng, Zhiyun
    SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)
  • [6] Learning Relatedness Measures for Entity Linking
    Ceccarelli, Diego
    Lucchese, Claudio
    Orlando, Salvatore
    Perego, Raffaele
    Trani, Salvatore
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 139 - 148
  • [7] Bilinear joint learning of word and entity embeddings for Entity Linking
    Chen, Hui
    Wei, Baogang
    Liu, Yonghuai
    Li, Yiming
    Yu, Jifang
    Zhu, Wenhao
    NEUROCOMPUTING, 2018, 294 : 12 - 18
  • [8] Joint Entity Linking with Deep Reinforcement Learning
    Fang, Zheng
    Cao, Yanan
    Zhang, Dongjie
    Li, Qian
    Zhang, Zhenyu
    Liu, Yanbing
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 438 - 447
  • [9] ELAD: An Entity Linking Based Affiliation Disambiguation Framework
    Shao, Zhou
    Cao, Xiangying
    Yuan, Sha
    Wang, Yongli
    IEEE ACCESS, 2020, 8 : 70519 - 70526
  • [10] Toward Tweet Entity Linking With Heterogeneous Information Networks
    Shen, Wei
    Yin, Yuwei
    Yang, Yang
    Han, Jiawei
    Wang, Jianyong
    Yuan, Xiaojie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (12) : 6003 - 6017