Measuring Constraint Violations in Information Retrieval

被引:2
作者
Cummins, Ronan [1 ]
O'Riordan, Colm [1 ]
机构
[1] Natl Univ Ireland, Digital Enterprise Res Inst, Galway, Ireland
来源
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2009年
关键词
Information Retrieval; Constraints; Axioms;
D O I
10.1145/1571941.1572096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, an inductive approach to modelling term-weighting function correctness has provided a number of axioms (constraints), to which all good term-weighting functions should adhere. These constraints have been shown to be theoretically and empirically sound in a number of works [2, 3, 1]. It has been shown that when a term-weighting function breaks one or more of the constraints, it typically indicates sub-optimality of that function. This elegant inductive approach may more accurately model the human process of determining the relevance a document. It is intuitive that a person's notion of relevance changes as terms that are either on or off-topic are encountered in a given document. Ultimately, it would be desirable to be able to mathematically determine the performance of term-weighting functions without the need for test collections. Many modern term-weighting functions do riot, satisfy the constraints in an unconditional manner [3]. However, the degree to which these functions violate the constraints has not been investigated. A comparison between weighting functions from this perspective may shed light on the poor performance of certain functions in certain settings. Moreover, if a correlation exists between performance and the number of violations, measuring the degree of violation could help more accurately predict how a certain scheme will perform on a given collection.
引用
收藏
页码:722 / 723
页数:2
相关论文
共 3 条
[1]   An axiomatic comparison of learned term-weighting schemes in information retrieval: clarifications and extensions [J].
Cummins, Ronan ;
O'Riordan, Colm .
ARTIFICIAL INTELLIGENCE REVIEW, 2007, 28 (01) :51-68
[2]  
FANG H, 2004, SIGIR 04, P49, DOI DOI 10.1145/1008992.1009004
[3]  
FANG H, 2005, SIGIR 05, P480