Variations in relevance judgments and the measurement of retrieval effectiveness

被引：181

作者：

Voorhees, EM ^{[1
]}

机构：

[1] Natl Inst Stand & Technol, Gaithersburg, MD 20899 USA

来源：

INFORMATION PROCESSING & MANAGEMENT | 2000年 / 36卷 / 05期

关键词：

relevance; test collections; text retrieval evaluation; TREC;

D O I：

10.1016/S0306-4573(00)00010-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Test collections have traditionally been used by information retrieval researchers to improve their retrieval strategies. To be viable as a laboratory tool, a collection must reliably rank different retrieval variants according to their true effectiveness. In particular, the relative effectiveness of two retrieval strategies should be insensitive to modest changes in the relevant document set since individual relevance assessments are known to vary widely. The test collections developed in the TREC workshops have become the collections of choice in the retrieval research community. To verify their reliability, NIST investigated the effect changes in the relevance assessments have on the evaluation of retrieval results. Very high correlations were found among the rankings of systems Produced using different relevance judgment sets. The high correlations indicate that the comparative evaluation of retrieval performance is stable despite substantial differences in relevance judgments, and thus reaffirm the use of the TREC collections as laboratory tools. Published by Elsevier Science Ltd.

引用

页码：697 / 716

页数：20

共 18 条

[1] [Anonymous], 1970, 3 CRANF I TECHN
[2] [Anonymous], 1983, ENCY STAT SCI
[3] INFORMATION FILTERING AND INFORMATION-RETRIEVAL - 2 SIDES OF THE SAME COIN
BELKIN, NJ
CROFT, WB
[J]. COMMUNICATIONS OF THE ACM, 1992, 35 (12) : 29 - 38
[4] VARIATIONS IN RELEVANCE JUDGMENTS AND THE EVALUATION OF RETRIEVAL PERFORMANCE
BURGIN, R
[J]. INFORMATION PROCESSING & MANAGEMENT, 1992, 28 (05) : 619 - 627
[5] Cleverdon C., 1968, Factors determining the performance of indexing systems
[6] CORMACK GV, 1998, NIST SPECIAL PUBLICA, V500
[7] OPENING BLACK BOX OF RELEVANCE
CUADRA, CA
KATTER, RV
[J]. JOURNAL OF DOCUMENTATION, 1967, 23 (04) : 291 - &
[8] THE 1ST TEXT RETRIEVAL CONFERENCE (TREC-1) ROCKVILLE, MD, USA, 4-6 NOVEMBER, 1992
HARMAN, DK
[J]. INFORMATION PROCESSING & MANAGEMENT, 1993, 29 (04) : 411 - 414
[9] Harter SP, 1996, J AM SOC INFORM SCI, V47, P37, DOI 10.1002/(SICI)1097-4571(199601)47:1<37::AID-ASI4>3.0.CO
[10] 2-3

← 1 2 →