Predicting Query Performance by Query-Drift Estimation

被引:100
作者
Shtok, Anna [1 ]
Kurland, Oren [1 ]
Carmel, David [2 ]
Raiber, Fiana [1 ]
Markovits, Gad [1 ]
机构
[1] Technion Israel Inst Technol, Fac Ind Engn & Management, IL-32000 Haifa, Israel
[2] IBM Res, Haifa Labs, IL-31905 Haifa, Israel
基金
以色列科学基金会;
关键词
Algorithms; Experimentation; Query-performance prediction; query drift; score distribution; MODELS;
D O I
10.1145/2180868.2180873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting query performance, that is, the effectiveness of a search performed in response to a query, is a highly important and challenging problem. We present a novel approach to this task that is based on measuring the standard deviation of retrieval scores in the result list of the documents most highly ranked. We argue that for retrieval methods that are based on document-query surface-level similarities, the standard deviation can serve as a surrogate for estimating the presumed amount of query drift in the result list, that is, the presence (and dominance) of aspects or topics not related to the query in documents in the list. Empirical evaluation demonstrates the prediction effectiveness of our approach for several retrieval models. Specifically, the prediction quality often transcends that of current state-of-the-art prediction methods.
引用
收藏
页数:35
相关论文
共 67 条
[1]  
ABDUL-JALEEL N., 2004, P TEXT RETR C TREC 1
[2]  
Amati G, 2004, LECT NOTES COMPUT SC, V2997, P127
[3]  
[Anonymous], 2006, P SIGIR
[4]  
[Anonymous], 2003, INFORM RETRIEVAL BOO
[5]  
[Anonymous], 1998, SIGIR 98 P 21 ANN IN, DOI DOI 10.1145/290941.291008
[6]  
[Anonymous], IR338 U MASS CTR INT
[7]   Modeling score distributions in information retrieval [J].
Arampatzis, Avi ;
Robertson, Stephen .
INFORMATION RETRIEVAL, 2011, 14 (01) :26-46
[8]   Where to Stop Reading a Ranked List? Threshold Optimization using Truncated Score Distributions [J].
Arampatzis, Avi ;
Kamps, Jaap ;
Robertson, Stephen .
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, :524-531
[9]  
Aslam JA, 2007, LECT NOTES COMPUT SC, V4425, P198
[10]  
Bendersky M., 2011, P 4 ACM INT C WEB SE, P95, DOI DOI 10.1145/1935826.1935849