Test collection based evaluation of information retrieval systems

被引:190
作者
Sanderson M. [1 ]
机构
[1] Information School, University of Sheffield, Sheffield
来源
Foundations and Trends in Information Retrieval | 2010年 / 4卷 / 04期
关键词
D O I
10.1561/1500000009
中图分类号
学科分类号
摘要
Use of test collections and evaluation measures to assess the effectiveness of information retrieval systems has its origins in work dating back to the early 1950s. Across the nearly 60 years since that work started, use of test collections is a de facto standard of evaluation. This monograph surveys the research conducted and explains the methods and measures devised for evaluation of retrieval systems, including a detailed look at the use of statistical significance testing in retrieval experimentation. This monograph reviews more recent examinations of the validity of the test collection approach and evaluation measures as well as outlining trends in current research exploiting query logs and live labs. At its core, the modern-day test collection is little different from the structures that the pioneering researchers in the 1950s and 1960s conceived of. This tutorial and review shows that despite its age, this long-standing evaluation method is still a highly valued tool for retrieval research. © 2010 M. Sanderson.
引用
收藏
页码:247 / 375
页数:128
相关论文
共 50 条
  • [41] Model for the evaluation of expansion techniques in information retrieval systems
    Aigrain, Philippe
    Longueville, Veronique
    Journal of the American Society for Information Science, 1994, 45 (04):
  • [42] A Test Collection for Interactive Lifelog Retrieval
    Gurrin, Cathal
    Schoeffmann, Klaus
    Joho, Hideo
    Munzer, Bernd
    Albatal, Rami
    Hopfgartner, Frank
    Zhou, Liting
    Dang-Nguyen, Duc-Tien
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 312 - 324
  • [43] Human information behaviour and design, development and evaluation of information retrieval systems
    Keshavarz, Hamid
    PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2008, 42 (04) : 391 - 401
  • [44] Evaluating the performance of information retrieval systems using test collections
    Clough, Paul
    Sanderson, Mark
    INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2013, 18 (02):
  • [45] Complex document information processing: Prototype, test collection, and evaluation
    Agam, G.
    Argamon, S.
    Frieder, O.
    Grossman, D.
    Lewis, D.
    DOCUMENT RECOGNITION AND RETRIEVAL XIII, 2006, 6067
  • [46] Partial Collection Replication for Information Retrieval
    Zhihong Lu
    Kathryn S. McKinley
    Information Retrieval, 2003, 6 : 159 - 198
  • [48] Recall-Oriented Evaluation for Information Retrieval Systems
    Audeh, Bissan
    Beaune, Philippe
    Beigbeder, Michel
    MULTIDISCIPLINARY INFORMATION RETRIEVAL, 2013, 8201 : 29 - 32
  • [49] Text retrieval conferences (TRECs): Providing a test-bed for information retrieval systems
    Harman, Donna
    Bulletin of the American Society for Information Science, 1998, 24 (04):
  • [50] MINICOMPUTER BASED INFORMATION-RETRIEVAL SYSTEMS
    VASTOLA, FJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1975, (169): : 15 - 15