Chinese Short Text Entity Linking Based On Semantic Similarity and Entity Correlation

被引:3
作者
Zhao, Yan [1 ]
Wang, Yun [1 ]
Yang, Na [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
来源
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) | 2020年
关键词
Entity linking; Random Walk with Restart; BERT; Natural language processing; Deep learning;
D O I
10.1109/ICTAI50040.2020.00073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, more and more corpora in Internet consist of short texts, such as QA queries, posts in social media and news titles. Entity linking for these short texts is quite important. However, due to the serious colloquialism and insufficient contexts, the entity linking task of Chinese short text is more difficult. In this paper we propose an entity linking model named BERT-RWR. BERT-RWR aims at improving the accuracy of predicting the right target entity in candidate set by integrating deep neural network and graph model. More specifically, deep neural network based on fine-tune BERT is designed to calculate the mention-entity semantic similarity and Random Walk with Restart (RWR) algorithm can further capture the correlation between candidate entities of different mentions. In BERT-RWR, we leverage (1) semantic similarity score between each mention and its candidate entities and (2) the prior probability and (3) the correlation between different candidate entities to select the target entity. To improve the recall rate of candidate entity, we put forward three-method fusion strategy for candidate generation. Experimental results demonstrate that our model outperforms the state-of-the-art results for entity linking in Chinese short text datasets.
引用
收藏
页码:426 / 431
页数:6
相关论文
共 19 条
  • [1] Cassidy T., 2012, Proc. of the 24th Int. Conf. on Comput. Linguist, P441
  • [2] [陈万礼 Chen Wanli], 2015, [中文信息学报, Journal of Chinese Information Processing], V29, P117
  • [3] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [4] Gupta N., 2017, Proceedings of the 2017 conference on empirical methods in natural language processing, P2681, DOI DOI 10.18653/V1/D17-1284
  • [5] He Z., 2013, P 51 ANN M ASS COMPU, P30
  • [6] Entity Linking via Symmetrical Attention-Based Neural Network and Entity Structural Features
    Hu, Shengze
    Tan, Zhen
    Zeng, Weixin
    Ge, Bin
    Xiao, Weidong
    [J]. SYMMETRY-BASEL, 2019, 11 (04):
  • [7] An approach on Chinese microblog entity linking combining baidu encyclopaedia and word2vec
    Huang, Dongchuan
    Wang, Jiali
    [J]. 8TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY, 2017, 111 : 37 - 45
  • [8] Ji Heng, 2011, P TEXT AN C TAC 2011
  • [9] Jones L, 2017, INFORM PROCESSING SY
  • [10] Chinese Social Media Entity Linking Based on Effective Context with Topic Semantics
    Ma, Chengfang
    Sha, Ying
    Tan, Jianlong
    Guo, Li
    Peng, Huailiang
    [J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 386 - 395