Towards Better Text Understanding and Retrieval through Kernel Entity Salience Modeling

被引:29
作者
Xiong, Chenyan [1 ]
Liu, Zhengzhong [1 ]
Callan, Jamie [1 ]
Liu, Tie-Yan [2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Microsoft Res, Redmond, WA USA
来源
ACM/SIGIR PROCEEDINGS 2018 | 2018年
基金
美国国家科学基金会;
关键词
Text Understanding; Entity Salience; Entity-Oriented Search;
D O I
10.1145/3209978.3209982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a Kernel Entity Salience Model (KESM) that improves text understanding and retrieval by better estimating entity salience (importance) in documents. KESM represents entities by knowledge enriched distributed representations, models the interactions between entities and words by kernels, and combines the kernel scores to estimate entity salience. The whole model is learned end-to-end using entity salience labels. The salience model also improves ad hoc search accuracy, providing effective ranking features by modeling the salience of query entities in candidate documents. Our experiments on two entity salience corpora and two TREC ad hoc search datasets demonstrate the effectiveness of KESM over frequency-based and feature-based methods. We also provide examples showing how KESM conveys its text understanding ability learned from entity salience to search.
引用
收藏
页码:575 / 584
页数:10
相关论文
共 31 条
[1]  
[Anonymous], 2010, WSDM 2010 P 3 ACM IN, DOI DOI 10.1145/1718487.1718542
[2]  
[Anonymous], 2010, Search engines: Information retrieval in practice
[3]  
[Anonymous], ARXIV10063498
[4]  
[Anonymous], 2002, P ACM SIGKDD KDD 200
[5]  
[Anonymous], P 10 ED LANG RES EV
[6]  
[Anonymous], 2013, ADV NEURAL INF PROCE
[7]  
[Anonymous], P 26 INT C WORLD WID
[8]   Semantic Search on Text and Knowledge Bases [J].
Bast, Hannah ;
Buchhold, Bjoern ;
Haussmann, Elmar .
FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2016, 10 (2-3) :120-+
[9]  
Bendersky M, 2011, PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), P605
[10]   Graph-based term weighting for information retrieval [J].
Blanco, Roi ;
Lioma, Christina .
INFORMATION RETRIEVAL, 2012, 15 (01) :54-92