A Similarity Measure for Indefinite Rankings

被引:496
作者
Webber, William [1 ]
Moffat, Alistair [1 ]
Zobel, Justin [1 ]
机构
[1] Univ Melbourne, Dept Comp Sci & Software Engn, Melbourne, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
Experimentation; Measurement; Human Factors; Rank correlation; probabilistic models; ranking; COMPARING RANKINGS;
D O I
10.1145/1852102.1852106
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ranked lists are encountered in research and daily life and it is often of interest to compare these lists even when they are incomplete or have only some members in common. An example is document rankings returned for the same query by different search engines. A measure of the similarity between incomplete rankings should handle nonconjointness, weight high ranks more heavily than low, and be monotonic with increasing depth of evaluation; but no measure satisfying all these criteria currently exists. In this article, we propose a new measure having these qualities, namely rank-biased overlap (RBO). The RBO measure is based on a simple probabilistic user model. It provides monotonicity by calculating, at a given depth of evaluation, a base score that is non-decreasing with additional evaluation, and a maximum score that is nonincreasing. An extrapolated score can be calculated between these bounds if a point estimate is required. RBO has a parameter which determines the strength of the weighting to top ranks. We extend RBO to handle tied ranks and rankings of different lengths. Finally, we give examples of the use of the measure in comparing the results produced by public search engines and in assessing retrieval systems in the laboratory.
引用
收藏
页数:38
相关论文
共 23 条
[1]  
[Anonymous], 1997, ART COMPUTER PROGRAM
[2]  
[Anonymous], 1948, RANK CORRELATION MET
[3]  
[Anonymous], 2008, INT ACM SIGIR C RES, DOI DOI 10.1145/1390334.1390435
[4]   Comparing rankings of search results on the Web [J].
Bar-Ilan, J .
INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (06) :1511-1519
[5]   Methods for comparing rankings of search engine results [J].
Bar-Ilan, Judit ;
Mat-Hassan, Mazlita ;
Levene, Mark .
COMPUTER NETWORKS, 2006, 50 (10) :1448-1463
[6]   Rank correlation - An alternative measure [J].
Blest, DC .
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2000, 42 (01) :101-111
[7]  
Buckley C., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P506, DOI 10.1145/1008992.1009093
[8]   On Rank Correlation and the Distance Between Rankings [J].
Carterette, Ben .
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, :436-443
[9]  
Cliff N., 1996, Ordinal methods for behavioral data analysis
[10]   Comparing top k lists [J].
Fagin, R ;
Kumar, R ;
Sivakumar, D .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 2003, 17 (01) :134-160