On Rank Correlation and the Distance Between Rankings

被引:40
作者
Carterette, Ben [1 ]
机构
[1] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
来源
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2009年
关键词
information retrieval; evaluation; rank correlation; distance measure;
D O I
10.1145/1571941.1572017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rank correlation statistics are useful for determining whether a there is a correspondence between two measurements, particularly when the measures themselves are of less interest than their relative ordering. Kendall's tau in particular has found use in Information Retrieval as a "meta-evaluation" measure: it has been used to compare evaluation measures., evaluate system rankings, and evaluate predicted performance. In the meta-evaluation domain, however, correlations between systems confound relationships between measurements, practically guaranteeing a positive and significant estimate of tau regardless of any actual correlation between the measurements. We introduce an alternative measure of distance between rankings that corrects this by explicitly accounting for correlations between systems over a sample of topics, and moreover has a probabilistic interpretation for use in a test of statistical significance. We validate our measure with theory, simulated data, and experiment.
引用
收藏
页码:436 / 443
页数:8
相关论文
共 17 条
[1]  
ALLAN J, 2007, P TREC, P67203
[2]  
[Anonymous], 2007, P 30 ANN INT ACM SIG, DOI DOI 10.1145/1277741.1277756
[3]  
[Anonymous], 2008, INT ACM SIGIR C RES, DOI [DOI 10.1145/1390334.1390435, 10.1145/1390334.1390435]
[4]  
Aslam Javed., 2003, P SIGIR, P361
[5]  
BOYD S, 2004, CONVEX OPTIMIZATION, P67203
[6]  
Buckley C., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P25, DOI 10.1145/1008992.1009000
[7]  
CORMACK GV, 2007, P SIGIR, P837, DOI DOI 10.1145/1277741.1277934
[8]  
JOHNSON RA, 1982, APPL MULTIVARIATE ST, P67203
[9]  
KENDALL M, 1970, RANK CORRELATION MET, P67203
[10]  
Melucci M., 2007, SIGIR Forum, V41, P18, DOI 10.1145/1273221.1273223