Modeling positive and negative feedback for improving document retrieval

被引：3

作者：

Hao, Shufeng ^{[1
]}

Shi, Chongyang ^{[1
]}

Niu, Zhendong ^{[1
]}

Cao, Longbing ^{[2
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China

[2] Univ Technol Sydney, Adv Analyt Inst, Sydney, NSW 2007, Australia

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2019年 / 120卷

基金：

中国国家自然科学基金;

关键词：

Pseudo-relevance feedback; Negative feedback; Positive feedback; Language model; PSEUDO-RELEVANCE FEEDBACK;

D O I：

10.1016/j.eswa.2018.11.035

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pseudo-relevance feedback (PRF) has evident potential for enriching the representation of short queries. Traditional PRF methods treat top-ranked documents as feedback, since they are assumed to be relevant to the query. However, some of these feedback documents may actually distract from the query topic for a range of reasons and accordingly downgrade PRF system performance. Such documents constitute negative examples (negative feedback) but could also be valuable in retrieval. In this paper, a novel framework of query language model construction is proposed in order to improve retrieval performance by integrating both positive and negative feedback. First, an improvement-based method is proposed to automatically identify the types of feedback documents (i.e. positive or negative) according to whether the document enhances the retrieval's effectiveness. Subsequently, based on the learned positive and negative examples, the positive feedback models and the negative feedback models are estimated using an Expectation-Maximization algorithm with the assumptions: the positive term distribution is affected by the context term distribution and the negative term distribution is affected by both the positive term distribution and the context term distribution (such that the positive feedback model upgrades the rankings of relevant documents and the negative feedback model prunes the irrelevant documents from a query). Finally, a content-based representativeness criterion is proposed in order to obtain the representative negative feedback documents. Experiments conducted on the TREC collections demonstrate that our proposed approach results in better retrieval accuracy and robustness than baseline methods. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：253 / 261

页数：9

共 30 条

[1] Improving retrievability with improved cluster-based pseudo-relevance feedback selection
Bashir, Shariq
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) : 7495 - 7502
[2] Utilizing Focused Relevance Feedback
Brondwine, Elinor
Shtok, Anna
Kurland, Oren
[J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 1061 - 1064
[3] A New Context-Dependent Term Weight Computed by Boost and Discount Using Relevance Information
Dang, E. K. F.
Luk, R. W. P.
Allan, J.
Ho, K. S.
Chan, S. C. F.
Chung, K. F. L.
Lee, D. L.
[J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (12): : 2514 - 2530
[4] A context-dependent relevance model
Dang, Edward Kai Fung
Luk, Robert W. P.
Allan, James
[J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (03) : 582 - 593
[5] Multimodal Retrieval using Mutual Information based Textual Query Reformulation
Datta, Deepanwita
Varma, Shubham
Chowdary, Ravindranath C.
Singh, Sanjay K.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 68 : 81 - 92
[6] Luhn Revisited: Significant Words Language Models
Dehghani, Mostafa
Azarbonyad, Hosein
Kamps, Jaap
Hiemstra, Djoerd
Marx, Maarten
[J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1301 - 1310
[7] A novel Fuzzy-PSO term weighting automatic query expansion approach using combined semantic filtering
Gupta, Yogesh
Saini, Ashish
[J]. KNOWLEDGE-BASED SYSTEMS, 2017, 136 : 97 - 120
[8] He B., 2009, P 18 ACM C INF KNOWL, P2011
[9] Jaleel N. A., 2004, 13 TEXT RETR C TREC
[10] John J., 1971, The Smart retrieval system - experiments in automatic document processing

← 1 2 3 →