Desirable Properties for Diversity and Truncated Effectiveness Metrics

被引:17
作者
Albahem, Ameer [1 ]
Spina, Damiano [1 ]
Scholer, Falk [1 ]
Moffat, Alistair [2 ]
Cavedon, Lawrence [1 ]
机构
[1] RMIT Univ, Melbourne, Vic, Australia
[2] Univ Melbourne, Melbourne, Vic, Australia
来源
ADCS'18: PROCEEDINGS OF THE 23RD AUSTRALASIAN DOCUMENT COMPUTING SYMPOSIUM | 2018年
基金
澳大利亚研究理事会;
关键词
Evaluation; Search result diversification; Axiomatic analysis;
D O I
10.1145/3291992.3291996
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A wide range of evaluation metrics have been proposed to measure the quality of search results, including in the presence of diversification. Some of these metrics have been adapted for use in search tasks with different complexities, such as where the search system returns lists of different lengths. Given the range of requirements, it can be difficult to compare the behavior of these metrics. In this work, we examine effectiveness metrics using a simple property-based approach. In particular, we present a case-analysis framework to define and study fundamental properties that seem integral to any evaluation metric. An example of a simple property is that a ranking with only one non-relevant document should never score lower than a ranking with two non-relevant documents. The framework facilitates quantifying the ability of metrics to satisfy properties, both separately and simultaneously, and to identify those cases where properties are violated. Our analysis shows that the Average Cube Test and Intent-Aware Average Precision are two metrics which fail to satisfy the desirable properties, and hence should be used with caution.
引用
收藏
页数:7
相关论文
共 30 条
[1]  
Agrawal R., 2009, P 2 ACM INT C WEB SE, DOI DOI 10.1145/1498759.1498766
[2]   Axiomatic Thinking for Information Retrieval - And Related Tasks [J].
Amigo, Enrique ;
Fang, Hui ;
Mizzaro, Stefano ;
Zhai, ChengXiang .
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :1419-1420
[3]   Are we on the Right Track? An Examination of Information Retrieval Methodologies [J].
Amigo, Enrique ;
Fang, Hui ;
Mizzaro, Stefano ;
Zhai, ChengXiang .
ACM/SIGIR PROCEEDINGS 2018, 2018, :997-1000
[4]   An Axiomatic Analysis of Diversity Evaluation Metrics: Introducing the Rank-Biased Utility Metric [J].
Amigo, Enrique ;
Spina, Damiano ;
Carrillo-de-Albornoz, Jorge .
ACM/SIGIR PROCEEDINGS 2018, 2018, :625-634
[5]  
Amigó E, 2013, SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, P643
[6]  
[Anonymous], 2008, P 2008 INT C WEB SEA, DOI [10.1145/1341531.1341545, 10.1145/1341531, DOI 10.1145/1341531.1341545]
[7]  
[Anonymous], 2009, CIKM, DOI DOI 10.1145/1645953.1646033
[8]  
Clarke CLA, 2009, LECT NOTES COMPUT SC, V5766, P188, DOI 10.1007/978-3-642-04417-5_17
[9]  
Clarke Charles L. A., 2012, P TREC
[10]  
Clarke CharlesL A., 2011, Proceedings of the fourth ACM international conference on Web search and data mining, P75