Entity-aware Transformers for Entity Search

被引:12
|
作者
Gerritse, Emma J. [1 ]
Hasibi, Faegheh [1 ]
de Vries, Arjen P. [1 ]
机构
[1] Radboud Univ Nijmegen, Nijmegen, Netherlands
来源
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年
关键词
Entity retrieval; transformers; BERT; entity embeddings;
D O I
10.1145/3477495.3531971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pre-trained language models such as BERT have been a key ingredient to achieve state-of-the-art results on a variety of tasks in natural language processing and, more recently, also in information retrieval. Recent research even claims that BERT is able to capture factual knowledge about entity relations and properties, the information that is commonly obtained from knowledge graphs. This paper investigates the following question: Do BERT-based entity retrieval models benefit from additional entity information stored in knowledge graphs? To address this research question, we map entity embeddings into the same input space as a pre-trained BERT model and inject these entity embeddings into the BERT model. This entity-enriched language model is then employed on the entity retrieval task. We show that the entity-enriched BERT model improves effectiveness on entity-oriented queries over a regular BERT model, establishing a new state-of-the-art result for the entity retrieval task, with substantial improvements for complex natural language queries and queries requesting a list of entities with a certain property. Additionally, we show that the entity information provided by our entity-enriched model particularly helps queries related to less popular entities. Last, we observe empirically that the entity-enriched BERT models enable fine-tuning on limited training data, which otherwise would not be feasible due to the known instabilities of BERT in few-sample fine-tuning, thereby contributing to data-efficient training of BERT for entity search.
引用
收藏
页码:1455 / 1465
页数:11
相关论文
共 50 条
  • [1] Entity-aware Image Caption Generation
    Lu, Di
    Whitehead, Spencer
    Huang, Lifu
    Ji, Heng
    Chang, Shih-Fu
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4013 - 4023
  • [2] Entity-Aware Biaffine Attention for Constituent Parsing
    Bai, Xinyi
    Yin, Nan
    Zhang, Xiang
    Wang, Xin
    Luo, Zhigang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 191 - 203
  • [3] LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
    Yamada, Ikuya
    Asai, Akari
    Shindo, Hiroyuki
    Takeda, Hideaki
    Matsumoto, Yuji
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6442 - 6454
  • [4] Abnormal Entity-Aware Knowledge Graph Completion
    Sun, Ke
    Yu, Shuo
    Peng, Ciyuan
    Li, Xiang
    Naseriparsa, Mehdi
    Xia, Feng
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 891 - 900
  • [5] Entity-Aware Social Media Reading Comprehension
    Liu, Hao
    Hong, Yu
    Zhu, Qiao-Ming
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 197 - 210
  • [6] Entity-Aware Language Model as an Unsupervised Reranker
    Rasooli, Mohammad Sadegh
    Parthasarathy, Sarangarajan
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 406 - 410
  • [7] DeepLife: An Entity-aware Search, Analytics and Exploration Platform for Health and Life Sciences
    Ernst, Patrick
    Siu, Amy
    Milchevski, Dragan
    Hoffart, Johannes
    Weikum, Gerhard
    PROCEEDINGS OF 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL-2016): SYSTEM DEMONSTRATIONS, 2016, : 19 - 24
  • [8] PoliToHFI at SemEval-2023 Task 6: Leveraging Entity-Aware and Hierarchical Transformers For Legal Entity Recognition and Court Judgment Prediction
    Benedetto, Irene
    Koudounas, Alkis
    Vaiani, Lorenzo
    Pastor, Eliana
    Baralis, Elena
    Cagliero, Luca
    Tarasconi, Francesco
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1401 - 1411
  • [9] Software Knowledge Entity Relation Extraction with Entity-Aware and Syntactic Dependency Structure Information
    Tang, Mingjing
    Li, Tong
    Wang, Wei
    Zhu, Rui
    Ma, Zifei
    Tang, Yahui
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [10] EASAL: Entity-Aware Subsequence-Based Active Learning for Named Entity Recognition
    Liu, Yang
    Hu, Jinpeng
    Chen, Zhihong
    Wan, Xiang
    Chang, Tsung-Hui
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8897 - 8905