Entity-aware Transformers for Entity Search

被引:12
|
作者
Gerritse, Emma J. [1 ]
Hasibi, Faegheh [1 ]
de Vries, Arjen P. [1 ]
机构
[1] Radboud Univ Nijmegen, Nijmegen, Netherlands
来源
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年
关键词
Entity retrieval; transformers; BERT; entity embeddings;
D O I
10.1145/3477495.3531971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pre-trained language models such as BERT have been a key ingredient to achieve state-of-the-art results on a variety of tasks in natural language processing and, more recently, also in information retrieval. Recent research even claims that BERT is able to capture factual knowledge about entity relations and properties, the information that is commonly obtained from knowledge graphs. This paper investigates the following question: Do BERT-based entity retrieval models benefit from additional entity information stored in knowledge graphs? To address this research question, we map entity embeddings into the same input space as a pre-trained BERT model and inject these entity embeddings into the BERT model. This entity-enriched language model is then employed on the entity retrieval task. We show that the entity-enriched BERT model improves effectiveness on entity-oriented queries over a regular BERT model, establishing a new state-of-the-art result for the entity retrieval task, with substantial improvements for complex natural language queries and queries requesting a list of entities with a certain property. Additionally, we show that the entity information provided by our entity-enriched model particularly helps queries related to less popular entities. Last, we observe empirically that the entity-enriched BERT models enable fine-tuning on limited training data, which otherwise would not be feasible due to the known instabilities of BERT in few-sample fine-tuning, thereby contributing to data-efficient training of BERT for entity search.
引用
收藏
页码:1455 / 1465
页数:11
相关论文
共 50 条
  • [21] On-the-Fly Entity-Aware Query Processing in the Presence of Linkage
    Ioannou, Ekaterini
    Nejdl, Wolfgang
    Niederee, Claudia
    Velegrakis, Yannis
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 429 - 438
  • [22] Show, Write, and Retrieve: Entity-aware Article Generation and Retrieval
    Zhang, Zhongping
    Gu, Yiwen
    Plummer, Bryan A.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8684 - 8704
  • [23] Entity-aware Collaborative Relation Network with Knowledge Graph for Recommendation
    Huang, Ruoran
    Han, Chuanqi
    Cui, Li
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3098 - 3102
  • [24] SEntFiN 1.0: Entity-aware sentiment analysis for financial news
    Sinha, Ankur
    Kedas, Satishwar
    Kumar, Rishu
    Malo, Pekka
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2022, 73 (09) : 1314 - 1335
  • [25] End-to-end entity-aware neural machine translation
    Xie, Shufang
    Xia, Yingce
    Wu, Lijun
    Huang, Yiqing
    Fan, Yang
    Qin, Tao
    MACHINE LEARNING, 2022, 111 (03) : 1181 - 1203
  • [26] Knowledge Base Entity Typing From Text via Entity-Aware Heterogeneous Graph Attention Network
    Xu, Bo
    Sun, Zhong
    Du, Ming
    Song, Hui
    Wang, Hongya
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [27] Entity-aware Multi-task Learning for Query Understanding at Walmart
    Peng, Zhiyuan
    Dave, Vachik
    McNabb, Nicole
    Sharnagat, Rahul
    Magnani, Alessandro
    Liao, Ciya
    Fang, Yi
    Rajanala, Sravanthi
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 4733 - 4742
  • [28] Semantic Relation Classification via Bidirectional LSTM Networks with Entity-Aware Attention Using Latent Entity Typing
    Lee, Joohong
    Seo, Sangwoo
    Choi, Yong Suk
    SYMMETRY-BASEL, 2019, 11 (06):
  • [29] Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia
    Nguyen, Khanh
    Furkan Biten, Ali
    Mafla, Andres
    Gomez, Lluis
    Karatzas, Dimosthenis
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1940 - 1948
  • [30] Editorial: Ubiquitous Computing Entity-aware Data Management on Mobile Devices
    Khan, Faheem
    Ullah, Rahat
    Laila, Umm e
    MOBILE NETWORKS & APPLICATIONS, 2024, 29 (02): : 398 - 400