Test collection based evaluation of information retrieval systems

被引:190
|
作者
Sanderson M. [1 ]
机构
[1] Information School, University of Sheffield, Sheffield
来源
Foundations and Trends in Information Retrieval | 2010年 / 4卷 / 04期
关键词
D O I
10.1561/1500000009
中图分类号
学科分类号
摘要
Use of test collections and evaluation measures to assess the effectiveness of information retrieval systems has its origins in work dating back to the early 1950s. Across the nearly 60 years since that work started, use of test collections is a de facto standard of evaluation. This monograph surveys the research conducted and explains the methods and measures devised for evaluation of retrieval systems, including a detailed look at the use of statistical significance testing in retrieval experimentation. This monograph reviews more recent examinations of the validity of the test collection approach and evaluation measures as well as outlining trends in current research exploiting query logs and live labs. At its core, the modern-day test collection is little different from the structures that the pioneering researchers in the 1950s and 1960s conceived of. This tutorial and review shows that despite its age, this long-standing evaluation method is still a highly valued tool for retrieval research. © 2010 M. Sanderson.
引用
收藏
页码:247 / 375
页数:128
相关论文
共 50 条
  • [1] Mahak: A test collection for evaluation of farsi information retrieval systems
    Esmaili, Kyumars Sheykh
    Abolhassani, Hassan
    Neshati, Mahmood
    Behrangi, Ehsan
    Rostami, Asreen
    Nasiri, Mojtaba Mohammadi
    2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 639 - +
  • [2] A METHODOLOGY FOR TEST AND EVALUATION OF INFORMATION RETRIEVAL SYSTEMS
    GOFFMAN, W
    NEWILL, VA
    INFORMATION STORAGE AND RETRIEVAL, 1966, 3 (01): : 19 - +
  • [3] Collection profiling for collection fusion in distributed information retrieval systems
    Lu, Chengye
    Xu, Yue
    Geva, Shlomo
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 279 - 288
  • [4] CURE: Collection for Urdu Information Retrieval Evaluation and Ranking
    Iqbal, Muntaha
    Tahir, Bilal
    Mehmood, Muhammad Amir
    2021 INTERNATIONAL CONFERENCE ON DIGITAL FUTURES AND TRANSFORMATIVE TECHNOLOGIES (ICODT2), 2021,
  • [5] Pooling-based continuous evaluation of information retrieval systems
    Tonon, Alberto
    Demartini, Gianluca
    Cudre-Mauroux, Philippe
    INFORMATION RETRIEVAL JOURNAL, 2015, 18 (05): : 445 - 472
  • [6] Information retrieval test collection for searching spontaneous Czech speech
    Ircing, Pavel
    Pecina, Pavel
    Oard, Douglas W.
    Wang, Jianqiang
    White, Ryen W.
    Hoidekr, Jan
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 439 - +
  • [7] Information Retrieval Using a Macedonian Test Collection for Question Answering
    Armenska, Jasmina
    Tomovski, Aleksandar
    Zdravkova, Katerina
    Pehcevski, Jovan
    ICT INNOVATIONS 2010, 2011, 83 : 205 - +
  • [8] Pooling-based continuous evaluation of information retrieval systems
    Alberto Tonon
    Gianluca Demartini
    Philippe Cudré-Mauroux
    Information Retrieval Journal, 2015, 18 : 445 - 472
  • [9] On the evaluation of Geographic Information Retrieval systems
    Palacio, Damien
    Cabanac, Guillaume
    Sallaberry, Christian
    Hubert, Gilles
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2010, 11 (02) : 91 - 109
  • [10] EVALUATION OF INFORMATION-RETRIEVAL SYSTEMS
    FARRADANE, J
    JOURNAL OF DOCUMENTATION, 1974, 30 (02) : 195 - 209