Rank-Biased Precision for Measurement of Retrieval Effectiveness

被引:336
作者
Moffat, Alistair [1 ]
Zobel, Justin [2 ]
机构
[1] Univ Melbourne, Dept Comp Sci & Software Engn, Melbourne, Vic 3010, Australia
[2] RMIT Univ, Melbourne, Vic, Australia
关键词
Experimentation; Measurement; Human Factors; Recall; precision; average precision; relevance; pooling; RELEVANCE; RECALL;
D O I
10.1145/1416950.1416952
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A range of methods for measuring the effectiveness of information retrieval systems has been proposed. These are typically intended to provide a quantitative single-value summary of a document ranking relative to a query. However, many of these measures have failings. For example, recall is not well founded as a measure of satisfaction, since the user of an actual system cannot judge recall. Average precision is derived from recall, and suffers from the same problem. In addition, average precision lacks key stability properties that are needed for robust experiments. In this article, we introduce a new effectiveness metric, rank-biased precision, that avoids these problems. Rank-biased precision is derived from a simple model of user behavior, is robust if answer rankings are extended to greater depths, and allows accurate quantification of experimental uncertainty, even when only partial relevance judgments are available.
引用
收藏
页数:27
相关论文
共 38 条
[1]  
[Anonymous], 2005, Experiment and Evaluation in Information Retrieval
[2]  
[Anonymous], 2007, P 30 ANN INT ACM SIG, DOI DOI 10.1145/1277741.1277756
[3]  
ASLAM JA, 2006, P 29 ANN INT ACM SIG, P541
[4]  
Borlund P., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P324, DOI 10.1145/290941.291019
[5]  
Buckley C., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P25, DOI 10.1145/1008992.1009000
[6]  
Buttcher S., 2007, P 30 ANN INT ACM SIG, P63, DOI DOI 10.1145/1277741.1277755
[7]  
Carterette B., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P268, DOI 10.1145/1148170.1148219
[8]   SELECTING A MEASURE OF RETRIEVAL EFFECTIVENESS [J].
COOPER, WS .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1973, 24 (02) :87-100
[9]  
COPPER WS, 1968, AM DOC, V19, P30
[10]  
Cormack G. V., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P533, DOI 10.1145/1148170.1148262