Ad hoc retrieval via entity linking and semantic similarity

被引:0
作者
Faezeh Ensan
Weichang Du
机构
[1] Ferdowsi University of Mashhad,Department of Computer Engineering
[2] University of New Brunswick,Faculty of Computer Science
来源
Knowledge and Information Systems | 2019年 / 58卷
关键词
Semantic search; Ad hoc retrieval; Entity linking; Semantic relatedness; Language models;
D O I
暂无
中图分类号
学科分类号
摘要
Semantic search has emerged as a possible way for addressing the challenges of traditional keyword-based retrieval systems such as the vocabulary gap between the query and document spaces. In this paper, we propose a novel semantic retrieval framework that uses semantic entity linking systems for forming a graph representation of documents and queries, where nodes represent concepts extracted from documents and edges represent semantic relatedness between those concepts. The core of our proposed work is a semantic-enabled language model that estimates the probability of generating query concepts given values assigned to document concepts. The semantic retrieval framework also provides basis for interpolating keyword-based retrieval systems with the semantic-enabled language model. We conduct comprehensive experiments over several Trec document collections and analyze the performance of different configurations of the framework across multiple retrieval measures. Our experimental results show that the proposed semantic retrieval model has a synergistic impact on the results obtained through the state-of-the-art keyword-based systems, and the consideration of semantic information can complement and enhance the performance of such retrieval models.
引用
收藏
页码:551 / 583
页数:32
相关论文
共 41 条
  • [1] Blei DM(2003)Latent dirichlet allocation J. Mach. Learn. Res. 3 993-1022
  • [2] Ng AY(2011)Concept-based information retrieval using explicit semantic analysis ACM Trans Inf Syst (TOIS) 29 8-75
  • [3] Jordan MI(2012)Fast and accurate annotation of short texts with wikipedia pages IEEE Softw 29 70-792
  • [4] Egozi O(2014)Bridging structured and unstructured data via hybrid semantic search and interactive ontology-enhanced query formulation Knowl Inf Syst 41 761-114
  • [5] Markovitch S(2017)Efficient indexing for semantic search Expert Syst Appl 73 92-331
  • [6] Gabrilovich E(2009)Learning to rank for information retrieval Found Trends Inf Retr 3 225-239
  • [7] Ferragina P(2013)An open-source toolkit for mining wikipedia Artif Intell 194 222-856
  • [8] Scaiella U(2013)Entity ranking using click-log information Intell Data Anal 17 837-718
  • [9] Gärtner M(2015)Using knowledge-based relatedness for information retrieval Knowl Inf Syst 44 689-979
  • [10] Rauber A(2006)Information extraction from research papers using conditional random fields Inf Proces Manag 42 963-45