Document-based approach to improve the accuracy of pairwise comparison in evaluating information retrieval systems

被引:1
作者
Ravana, Sri Devi [1 ]
Taheri, Masumeh Sadat [1 ]
Rajagopal, Prabha [1 ]
机构
[1] Univ Malaya, Dept Informat Syst, Kuala Lumpur, Malaysia
关键词
Information retrieval; Document-based evaluation; Information retrieval evaluation; Pairwise comparison; Significance test; SIGN TEST; TIES;
D O I
10.1108/AJIM-12-2014-0171
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - The purpose of this paper is to propose a method to have more accurate results in comparing performance of the paired information retrieval (IR) systems with reference to the current method, which is based on the mean effectiveness scores of the systems across a set of identified topics/queries. Design/methodology/approach - Based on the proposed approach, instead of the classic method of using a set of topic scores, the documents level scores are considered as the evaluation unit. These document scores are the defined document's weight, which play the role of the mean average precision (MAP) score of the systems as a significance test's statics. The experiments were conducted using the TREC 9 Web track collection. Findings - The p-values generated through the two types of significance tests, namely the Student's t-test and Mann-Whitney show that by using the document level scores as an evaluation unit, the difference between IR systems is more significant compared with utilizing topic scores. Originality/value - Utilizing a suitable test collection is a primary prerequisite for IR systems comparative evaluation. However, in addition to reusable test collections, having an accurate statistical testing is a necessity for these evaluations. The findings of this study will assist IR researchers to evaluate their retrieval systems and algorithms more accurately.
引用
收藏
页码:408 / 421
页数:14
相关论文
共 50 条
  • [21] An Information Retrieval Approach Based on Time Attributes
    Zheng, Xiaolin
    Zhou, Chaofei
    Shen, Yisu
    INFORMATION AND BUSINESS INTELLIGENCE, PT I, 2012, 267 : 78 - 85
  • [22] Using WordNet for Concept-Based Document Indexing in Information Retrieval
    Boubekeur, Fatiha
    Boughanem, Mohand
    Tamine, Lynda
    Daoud, Mariam
    SEMAPRO 2010: THE FOURTH INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, 2010, : 151 - 157
  • [23] Pagico: Evaluating Wikipedia-based information retrieval in Portuguese
    Mota, Cristina
    Simoes, Alberto
    Freitas, Claudia
    Costa, Luis
    Santos, Diana
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2015 - 2022
  • [24] Document Expansion Based on Clique for Markov Network Information Retrieval Model
    Gan, Lixin
    Tu, Wei
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 392 - 395
  • [25] CBDIR: Fast and Effective Content Based Document Information Retrieval System
    Cha, Moon Soo
    Kim, So Yeon
    Ha, Jae Hee
    Lee, Min-June
    Choi, Young-June
    Sohn, Kyung-Ah
    2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 203 - 208
  • [26] A new approach to clustering records in information retrieval systems
    Moghrabi, IAR
    Makholian, RA
    INFORMATION RETRIEVAL, 2000, 3 (02): : 105 - 126
  • [27] Instance cooperative memory to improve query expansion in Information Retrieval Systems
    Jéribi, L
    Rumpler, B
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2002, 8 (06): : 591 - 601
  • [28] Internet-based information and retrieval systems
    O'Leary, DE
    DECISION SUPPORT SYSTEMS, 1999, 27 (03) : 319 - 327
  • [29] Information granulation for Web based information retrieval support systems
    Yao, JT
    Yao, YY
    DATA MINING AND KNOWLEDGE DISCOVERY: TOOLS AND TECHNOLOGY V, 2003, 5098 : 138 - 146
  • [30] Category based customization approach for information retrieval
    Aihara, K
    Takasu, A
    USER MODELING 2001, PROCEEDINGS, 2001, 2109 : 207 - 209