A System for Efficient High-Recall Retrieval

被引:26
作者
Abualsaud, Mustafa [1 ]
Ghelani, Nimesh [1 ]
Zhang, Haotian [1 ]
Smucker, Mark D. [1 ]
Cormack, Gordon V. [1 ]
Grossman, Maura R. [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
来源
ACM/SIGIR PROCEEDINGS 2018 | 2018年
基金
加拿大自然科学与工程研究理事会;
关键词
High-Recall; Electronic Discovery; Systematic Review;
D O I
10.1145/3209978.3210176
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of high-recall information retrieval (HRIR) is to find all or nearly all relevant documents for a search topic. In this paper, we present the design of our system that affords efficient high-recall retrieval. HRIR systems commonly rely on iterative relevance feedback. Our system uses a state-of-the-art implementation of continuous active learning (CAL), and is designed to allow other feedback systems to be attached with little work. Our system allows users to judge documents as fast as possible with no perceptible interface lag. We also support the integration of a search engine for users who would like to interactively search and judge documents. In addition to detailing the design of our system, we report on user feedback collected as part of a 50 participants user study. While we have found that users find the most relevant documents when we restrict user interaction, a majority of participants prefer having flexibility in user interaction. Our work has implications on how to build effective assessment systems and what features of the system are believed to be useful by users.
引用
收藏
页码:1317 / 1320
页数:4
相关论文
共 11 条
[1]  
[Anonymous], 2017, TREC
[2]  
Cormack Gordon V., 2015, ABS150406868 CORR
[3]  
Cormack Gordon V, 2014, SIGIR
[4]  
Grossman Maura R, 2016, TREC
[5]  
Grossman Maura R, 2017, SIGIR
[6]  
Roegiest A., 2015, TREC
[7]  
Sandhaus Evan, 2008, NEW YORK TIMES ANNOT, DOI [10.35111/77ba-9x74, DOI 10.35111/77BA-9X74]
[8]  
Strohman T., 2005, P INT C INT AN, V2, P2
[9]  
Tan L., 2017, TREC
[10]  
Zhang Haotian, 2018, EVALUATING SENTENCE